Practical sheet | REF: FIC1275 V1

Scraping, methods and tools for business intelligence

Author: David COMMARMOND

Publication date: August 10, 2024 | Lire en français

You do not have access to this resource.
Click here to request your free trial access!

Already subscribed? Log in!

Automatically translated using artificial intelligence technology (Note that only the original version is binding) > find out more.

    A  |  A

    2. Organizing your data

    To organize your data, you need to understand the notions of structured, unstructured and semi-structured. Data homogeneity is imperative, and this is where the notion of structuring comes into play.

    Aggregating data from several sources brings us face to face with the notions of formats, Mac, PC, Linux. In this field, there are thousands of formats that are more or less compatible with each other, depending on the publisher, and whether they are proprietary or royalty-free.

    In addition, each user's input must be taken into account: for example, Paris, PARIS, paris, apris, are all variants in the creation of a CITY, Ville, VILLE_France or VILLE_FRANCE or PAYS_VILLE heading, as are numerical inputs such as 0.1; 1.0; 1.

    The difficulty comes with the desire to automate the process, when updating data means constantly...

    You do not have access to this resource.

    Exclusive to subscribers. 97% yet to be discovered!

    You do not have access to this resource.
    Click here to request your free trial access!

    Already subscribed? Log in!


    The Ultimate Scientific and Technical Reference

    A Comprehensive Knowledge Base, with over 1,200 authors and 100 scientific advisors
    + More than 10,000 articles and 1,000 how-to sheets, over 800 new or updated articles every year
    From design to prototyping, right through to industrialization, the reference for securing the development of your industrial projects

    This article is included in

    Management and innovation engineering

    This offer includes:

    Knowledge Base

    Updated and enriched with articles validated by our scientific committees

    Services

    A set of exclusive tools to complement the resources

    Practical Path

    Operational and didactic, to guarantee the acquisition of transversal skills

    Doc & Quiz

    Interactive articles with quizzes, for constructive reading

    Subscribe now!

    Ongoing reading
    Organizing your data