2. (Large) language models

The first probabilistic language models were created many years ago, and were inspired by work to model human and computer languages using Markov chains. Models have been used in automatic language processing and speech recognition since the 1980s-1990s, making it possible to estimate the probability of a word's appearance based on previous words. Using a short history (a single previous word for unigram models, two and three for bigram and trigram models), these models were unable to take into account dependencies beyond a few words, and had difficulty handling rare and new words.

Large Language Models (LLMs) emerged in the late 2010s. They have benefited both from theoretical advances in machine learning, with deep neural networks

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed? Log in!

Ongoing reading
(Large) language models

Previous
page Classic information search

Generative process enhanced by information retrieval

Article included in this offer

"Digital documents and content management"

( 75 articles )

Complete knowledge base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

View offer details

Bibliography

(1) - AMINI (M.-R.), GAUSSIER (E.) - Recherche d'information : Applications, modèles et algorithmes-Fouille de données, décisionnel et big data. - Éditions Eyrolles (2013).
(2) - ROBERTSON (S.E.), WALKER (S.) - Some simple effective approximations to the 2-poisson...

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed? Log in!