Typewriters: language models
Transformer: neural networks for automatic language processing
Archive REF: IN195 V1
Typewriters: language models
Transformer: neural networks for automatic language processing

Author : François YVON

Publication date: March 10, 2022, Review date: November 20, 2024 | Lire en français

Logo Techniques de l'Ingenieur You do not have access to this resource.
Request your free trial access! Free trial

Already subscribed?

1. Typewriters: language models

1.1 Spam filtering

Let's start with an elementary language processing task: spam filtering. Its probabilistic processing involves three steps:

1. collection of a representative set of e-mails, containing a set D ok of acceptable e-mails and a set D ko of undesirable e-mails ;

2. construction of a numerical representation for texts. A very simple representation transforms each e-mail d into a large binary vector h in {0,1}|V|...

You do not have access to this resource.
Logo Techniques de l'Ingenieur

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed?


Article included in this offer

"Technological innovations"

( 187 articles )

Complete knowledge base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

View offer details
Contact us