Related Communities:

A Cloud-Native Serverless Approach for Implementation of Batch Extract-Load Processes in Data Lakes

A Cloud-Native Serverless Approach for Implementation of Batch Extract-Load Processes in Data Lakes

Author(s): Bryzgalov A., Stupnikov S. A.
Published:Communications in Computer and Information Science: 22nd International Conference on Data Analytics and Management in Data Intensive Domains, DAMDID/RCDL 2020 (Virtual Online, 13-16 October 2020). – Springer Science and Business Media Deutschland GmbH, 2021. Vol. 1427. P. 27 – 42.
Abstract:
The paper presents an approach to deal with batch extract-load processes for cloud data lakes. The approach combines multiple data ingestion techniques, provides advanced failover strategies and adopts cloud-native implementation. The suggested approach. The prototype implementation utilizes Amazon Web Services platform and is based on its serverless features. The approach can be implemented also using other cloud platforms like Google Cloud Platform or Microsoft Azure.
Download: [ https://link.springer.com/chapter/10.1007/978-3-030-81200-3_3 ]

Supported by Synthesis Group