Related Communities:

Combined Virtual and Materialized Environment for Integration of Large Heterogeneous Data Collections

Combined Virtual and Materialized Environment for Integration of Large Heterogeneous Data Collections.

Author(s): Sergey Stupnikov, Alexey Vovchenko.
Created:2014/10/13
Published:16th Russian Conference on Digital Libraries RCDL 2014 Proceedings. CEUR Workshop Proceedings 1297:339-348. (In Russian)
Abstract:
Architecture of a combined virtual and materialized environment for integration of heterogeneous data collections is provided. Collections are assumed to contain structured, semi-structured or unstructured data. Combination of virtual and materialized integration is motivated by advantages and disadvantages of both approaches. Virtual integration is provided by subject mediation technology. Materialized integration is provided by Hadoop (open source software framework for storage and distributed processing of large datasets) accompanied by a system implementing relational warehouse over Hadoop (as examples, Hive and Big SQL are considered).
Download: [ Adobe PDF ]

Supported by Synthesis Group