Source extraction
Data warehouses
Article REF: H3870 V1
Source extraction
Data warehouses

Authors : Claude CHRISMENT, Geneviève PUJOLLE, Franck RAVAT, Olivier TESTE, Gilles ZURFLUH

Publication date: February 10, 2005, Review date: April 28, 2016 | Lire en français

Logo Techniques de l'Ingenieur You do not have access to this resource.
Request your free trial access! Free trial

Already subscribed?

3. Source extraction

Data warehouses are fed from multiple sources, autonomous (managed by different, independent systems), heterogeneous (structurally or semantically), possibly unstructured (semi-structured data) or unstructured.

The first step in building a warehouse from such sources is to write an ad hoc program for each one, selecting the relevant data from the source and adapting it to the requirements of the system managing the data warehouse. This approach is particularly demanding, especially when it comes to refreshing the warehouse and adapting to its evolution.

An intermediate approach, designed to accommodate the heterogeneity of source management systems while preserving their autonomy, is to generate an image for each source (the role of the adapter) in a model compatible with that of the warehouse. A generic unification process then feeds the...

You do not have access to this resource.
Logo Techniques de l'Ingenieur

Exclusive to subscribers. 97% yet to be discovered!

You do not have access to this resource. Click here to request your free trial access!

Already subscribed?


Article included in this offer

"Software technologies and System architectures"

( 227 articles )

Complete knowledge base

Updated and enriched with articles validated by our scientific committees

Services

A set of exclusive tools to complement the resources

View offer details