2. (Large) language models
The first probabilistic language models were created many years ago, and were inspired by work to model human and computer languages using Markov chains. Models have been used in automatic language processing and speech recognition since the 1980s-1990s, making it possible to estimate the probability of a word's appearance based on previous words. Using a short history (a single previous word for unigram models, two and three for bigram and trigram models), these models were unable to take into account dependencies beyond a few words, and had difficulty handling rare and new words.
Large Language Models (LLMs) emerged in the late 2010s. They have benefited both from theoretical advances in machine learning, with deep neural networks
Exclusive to subscribers. 97% yet to be discovered!
Already subscribed? Log in!
(Large) language models
Article included in this offer
"Digital documents and content management"
(
71 articles
)
Updated and enriched with articles validated by our scientific committees
A set of exclusive tools to complement the resources
Bibliography
Directory
Manufacturers – Suppliers – Distributors (non-exhaustive list)
Elastic
Emvista
Hugging Face
Exclusive to subscribers. 97% yet to be discovered!
Already subscribed? Log in!