2. Organizing your data
To organize your data, you need to understand the notions of structured, unstructured and semi-structured. Data homogeneity is imperative, and this is where the notion of structuring comes into play.
Aggregating data from several sources brings us face to face with the notions of formats, Mac, PC, Linux. In this field, there are thousands of formats that are more or less compatible with each other, depending on the publisher, and whether they are proprietary or royalty-free.
In addition, each user's input must be taken into account: for example, Paris, PARIS, paris, apris, are all variants in the creation of a CITY, Ville, VILLE_France or VILLE_FRANCE or PAYS_VILLE heading, as are numerical inputs such as 0.1; 1.0; 1.
The difficulty comes with the desire to automate the process, when updating data means constantly...
Exclusive to subscribers. 97% yet to be discovered!
You do not have access to this resource.
Click here to request your free trial access!
Already subscribed? Log in!
The Ultimate Scientific and Technical Reference
This article is included in
Management and innovation engineering
This offer includes:
Knowledge Base
Updated and enriched with articles validated by our scientific committees
Services
A set of exclusive tools to complement the resources
Practical Path
Operational and didactic, to guarantee the acquisition of transversal skills
Doc & Quiz
Interactive articles with quizzes, for constructive reading
Organizing your data
Bibliography
Also in our database
Bibliography
Using Scrapy to acquire online data and export to multiple output files , Matthew J. Holland.
Exclusive to subscribers. 97% yet to be discovered!
You do not have access to this resource.
Click here to request your free trial access!
Already subscribed? Log in!
The Ultimate Scientific and Technical Reference