1. Typewriters: language models

1.1 Spam filtering

Let's start with an elementary language processing task: spam filtering. Its probabilistic processing involves three steps:

1. collection of a representative set of e-mails, containing a set D _ok of acceptable e-mails and a set D _ko of undesirable e-mails ;

2. construction of a numerical representation for texts. A very simple representation transforms each e-mail d into a large binary vector h in ${0, 1}^{| V |}$ ...

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed? Log in!

Ongoing reading
Typewriters: language models

Previous
page Transformer: neural networks for automatic language processing

Transformer model

Article included in this offer

"Technological innovations"

( 187 articles )

Complete knowledge base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

View offer details

Bibliography

(1) - AHARONI (R.), JOHNSON (M.), FIRAT (O.) - Massively multilingual neural machine translation. - Proceedings of the 2019 conference of the north American chapter of the association for computational linguistics : Human language technologies, volume 1 (long and short papers), Association for Computational Linguistics, p. 3874-3884 (2019)....

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed? Log in!