4. Resources for automatic text processing

NLP systems rely on several types of resources:

textual resources: textual data, known as corpora, are used to create test benches, to train learning systems, to extract lexical data, etc. ...;
lexical resources: lexicons form the core of the linguistic information used by a system. They vary in nature depending on the application, and incorporate more or less complex information, from simple word lists to structured semantic resources. Given the cost involved in building a lexicon for a given application, the trend is towards reusability and automatic acquisition of lexical data;
software resources: lemmatizers, segmenters and labelers are the basic building blocks of text processing. The complexity of a NLP application calls for the reusability of existing components; this...

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed? Log in!

Ongoing reading
Resources for automatic text processing

Article included in this offer

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

Bibliography

(1) - JURAFSKY (D.), MARTIN (J.H.) - Speech and language processing – An introduction to natural language processing, computational linguistics, and speech recognition. - (2009).
(2) - LALLICH-BOIDIN (G.), MARET (D.) - Recherche d'information et traitement...

You do not have access to this resource.

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed? Log in!