Skip to Main Content (Press Enter)

Logo UNIMI
  • ×
  • Home
  • Persone
  • Attività
  • Ambiti
  • Strutture
  • Pubblicazioni
  • Terza Missione

Expertise & Skills
Logo UNIMI

|

Expertise & Skills

unimi.it
  • ×
  • Home
  • Persone
  • Attività
  • Ambiti
  • Strutture
  • Pubblicazioni
  • Terza Missione
  1. Pubblicazioni

By Tying Embeddings You Are Assuming the Distributional Hypothesis

Contributo in Atti di convegno
Data di Pubblicazione:
2024
Citazione:
By Tying Embeddings You Are Assuming the Distributional Hypothesis / F. Bertolotti, W. Cazzola (PROCEEDINGS OF MACHINE LEARNING RESEARCH). - In: ICML'24 / [a cura di] R. Salakhutdinov, Z. Kolter, K. Heller, A. Weller, N. Oliver, J. Scarlett, F. Berkenkamp. - [s.l] : PMLR, 2024 Jul. - pp. 3584-3610 (( 41. International Conference on Machine Learning : July. 21 - 27 Wien (Österreich) 2024 [10.5555/3692070.3692213].
Abstract:
In this work, we analyze both theoretically and empirically the effect of tied input-output embeddings—a popular technique that reduces the model size while often improving training. Interestingly, we found that this technique is connected to Harris (1954)'s distributional hypothesis—often portrayed by the famous Firth (1957)'s quote “a word is characterized by the company it keeps”. Specifically, our findings indicate that words (or, more broadly, symbols) with similar semantics tend to be encoded in similar input embeddings, while words that appear in similar contexts are encoded in similar output embeddings (thus explaining the semantic space arising in input and output embedding of foundational language models). As a consequence of these findings, the tying of the input and output embeddings is encouraged only when the distributional hypothesis holds for the underlying data. These results also provide insight into the embeddings of foundation language models (which are known to be semantically organized). Further, we complement the theoretical findings with several experiments supporting the claims.
Tipologia IRIS:
03 - Contributo in volume
Elenco autori:
F. Bertolotti, W. Cazzola
Autori di Ateneo:
CAZZOLA WALTER ( autore )
Link alla scheda completa:
https://air.unimi.it/handle/2434/1231755
Link al Full Text:
https://air.unimi.it/retrieve/handle/2434/1231755/3294104/icaml24-published.pdf
Titolo del libro:
ICML'24
Progetto:
Typeful Language Adaptation for Dynamic, Interacting and Evolving Systems
  • Aree Di Ricerca

Aree Di Ricerca

Settori


Settore INFO-01/A - Informatica
  • Informazioni
  • Assistenza
  • Accessibilità
  • Privacy
  • Utilizzo dei cookie
  • Note legali

Realizzato con VIVO | Progettato da Cineca | 26.6.1.0