 |
|
Related Communities:
|
 |
|
 |
A Transformation of the RDF Mapping Language into a High-Level Data Analysis Language for Execution in a Distributed Computing Environment
| Author(s): | Tang W., Stupnikov S. A. |
| Published: | Communications in Computer and Information Science: 22nd International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2020 (Virtual Online, 13-16 October 2020). – Springer Science and Business Media Deutschland GmbH, 2021. Vol. 1427. P. 74 – 91. |
| Abstract: | |
| Nowadays scientific data should be FAIR that are Findable, Accessible, Interoperable and Reusable. Reference implementation of FAIR data management principles proposed recently considers RDF as unifying data model and RDF Mapping Language (RML) as the basic language for data integration. This paper is aimed at development of methods and tools for scalable data integration in the frame of this architecture. A mapping from RML into a high-level data analysis language Pig Latin that runs on Hadoop is considered. The mapping is implemented using model transformation technologies. These allows to execute RML programs in the Hadoop distributed computing environment. According to the experimental evaluation RML implementation developed scales w.r.t. data volume and outperforms related implementations. |
| Download: |
[ https://link.springer.com/chapter/10.1007/978-3-030-81200-3_6 ]
|
|
|