4. Resources for automatic text processing
NLP systems rely on several types of resources:
textual resources: textual data, known as corpora, are used to create test benches, to train learning systems, to extract lexical data, etc. ...;
lexical resources: lexicons form the core of the linguistic information used by a system. They vary in nature depending on the application, and incorporate more or less complex information, from simple word lists to structured semantic resources. Given the cost involved in building a lexicon for a given application, the trend is towards reusability and automatic acquisition of lexical data;
software resources: lemmatizers, segmenters and labelers are the basic building blocks of text processing. The complexity of a NLP application calls for the reusability of existing components; this...
Exclusive to subscribers. 97% yet to be discovered!
Already subscribed? Log in!
Resources for automatic text processing
Article included in this offer
"Software technologies and System architectures"
(
227 articles
)
Updated and enriched with articles validated by our scientific committees
A set of exclusive tools to complement the resources
Bibliography
Exclusive to subscribers. 97% yet to be discovered!
Already subscribed? Log in!