Thursday, July 22, 2021

Data Platform: Data Ingestion Engine for Data Lake

Introduction

This article is a follow-up to Data Platform as a Service and Data Platform: The New Generation Data Lakes. In this case, I will describe how to design and build an automated Data Ingestion Engine based on Spark and Databricks features.

The most important principle to design a Data Ingestion Engine is to follow an automation paradigm. Automation provides a set of key advantages to be successful, some of them are in the following diagram:



from DZone.com Feed https://ift.tt/2UCbjEZ

No comments:

Post a Comment