Friday, May 3, 2019

Intelligent Governance for Big Data

Data governance in traditional data warehouses is often responsible for many aspects of the data, such as:

  • Data Quality – Consumable data should be valid.
  • Identifying PII elements.
  • Identifying critical data elements.
  • User roles and access permissions.

When you have data and data which is flowing fast with variety into the ecosystem, the biggest challenge is to govern the data. But in a big data environment, where data flows fast with inferred run time schema, the need to govern data is often realized at run time. How can we find out if the data contains PII, if it’s valid data, if it’s critical data, which domain it belongs to, etc.?



from DZone.com Feed http://bit.ly/2DOCDEu

No comments:

Post a Comment