 |
|
Related Communities:
|
 |
|
 |
An Extensible Approach to Searching and Selecting Data Sources for Materialized Big Data Integration in Distributed Computing Environments
| Author(s): | Sazontev V., Stupnikov S. A. |
| Published: | Pattern Recognition and Image Analysis, 2023. Vol. 33. Iss. 2. P. 147–156. |
| Abstract: | |
| The work relates to the field of big data integration in distributed computing environments. One of the challenges of data integration is searching and selecting relevant data sources. In the modern world, there are many systems that are registries containing descriptions and links to data sources. Registries implement various types of searches, such as keyword searches and/or semantic searches. An extensible approach is proposed for embedding various types of data source retrieval systems into a materialized big data integration system deployed in a distributed computing environment. An automated process of searching and selecting relevant data sources in the integration system is described. A description of the implemented software components is given and an example of embedding one of the search systems into a prototype of a data integration system is illustrated.
|
| Download: |
[ https://link.springer.com/article/10.1134/S1054661823020141 ]
|
|
|