1. Typewriters: language models
1.1 Spam filtering
Let's start with an elementary language processing task: spam filtering. Its probabilistic processing involves three steps:
1. collection of a representative set of e-mails, containing a set D ok of acceptable e-mails and a set D ko of undesirable e-mails ;
2. construction of a numerical representation for texts. A very simple representation transforms each e-mail d into a large binary vector h in ...
Exclusive to subscribers. 97% yet to be discovered!
Already subscribed? Log in!
Typewriters: language models
Article included in this offer
"Technological innovations"
(
187 articles
)
Updated and enriched with articles validated by our scientific committees
A set of exclusive tools to complement the resources
Bibliography
- (1) - AHARONI (R.), JOHNSON (M.), FIRAT (O.) - Massively multilingual neural machine translation. - Proceedings of the 2019 conference of the north American chapter of the association for computational linguistics : Human language technologies, volume 1 (long and short papers), Association for Computational Linguistics, p. 3874-3884 (2019)....
Exclusive to subscribers. 97% yet to be discovered!
Already subscribed? Log in!