Preparing data
Datavisualisation, a data mining and exploration tool
Practical sheet REF: FIC1404 V1
Preparing data
Datavisualisation, a data mining and exploration tool

Author : Véronique MESGUICH

Publication date: October 10, 2021 | Lire en français

Logo Techniques de l'Ingenieur You do not have access to this resource.
Request your free trial access! Free trial

Already subscribed?

3. Preparing data

It's an illusion to think that the data found in the various sources will be ready to use.

The analyst will have to apply several successive processes:

  • control of aberrations and extremes with possible suppression ;

  • data homogenization where possible (e.g., for numerical data to be converted or names to be standardized); see figure "Example of Open Refine's name homogenization processing". Open Refine is a tool that enables data to be reprocessed, standardized or deduplicated;

  • data structuring, for example using segmentation solutions (Web scrapping) or extraction of named entities;

  • data exploitation (statistical processing, spatialization,...

You do not have access to this resource.
Logo Techniques de l'Ingenieur

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed?


Article included in this offer

"Management and innovation engineering"

( 434 articles )

Complete knowledge base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

View offer details
Contact us