Transformer model
Transformer: neural networks for automatic language processing
Archive REF: IN195 V1
Transformer model
Transformer: neural networks for automatic language processing

Author : François YVON

Publication date: March 10, 2022, Review date: November 20, 2024 | Lire en français

Logo Techniques de l'Ingenieur You do not have access to this resource.
Request your free trial access! Free trial

Already subscribed?

2. Transformer model

2.1 Attention, a fundamental mechanism

With these first concepts of probabilistic modeling established, in this section we introduce the Transformer model, which relies on a more general mechanism for encoding the context of each decision.

SCROLL TO TOP

2.1.1 Context vector calculation

The central idea of the Transformer model

You do not have access to this resource.
Logo Techniques de l'Ingenieur

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed?


Article included in this offer

"Software technologies and System architectures"

( 227 articles )

Complete knowledge base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

View offer details