Indexing engine
Search Engines: Google, Bing and their competitors
Quizzed article REF: H7240 V3
Indexing engine
Search Engines: Google, Bing and their competitors

Author : Olivier ANDRIEU

Publication date: April 10, 2022, Review date: February 29, 2024 | Lire en français

Logo Techniques de l'Ingenieur You do not have access to this resource.
Request your free trial access! Free trial

Already subscribed?

3. Indexing engine

3.1 Index

Once the web pages have been crawled, the spider sends the collected information to the indexing engine. Indexing is carried out in full text: all the words on a page, and more generally its HTML code, are then taken into account.

The indexing systems then identify, in "full text", all the words in the texts contained on the pages, as well as their position within the page. However, some engines may limit their indexing capacity. For many years, for example, Google limited its indexing to the first 101 kilobytes of a page (which was, however, quite a substantial size). Today, this limit no longer applies. Other engines can select according to document format (Excel, Powerpoint, PDF...).

Finally, as with documentary software...

You do not have access to this resource.
Logo Techniques de l'Ingenieur

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed?


Article included in this offer

"Digital documents and content management"

( 71 articles )

Complete knowledge base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

View offer details