Exploring historical data changes
Beginner | 10 Minutes
Overview
Enterprise data is becoming increasingly cloud-native and scattered across different systems of records and storage. This makes it challenging to keep track of how its history changes, especially once it is transformed and loaded into reporting-ready data warehouses and marts. This presents a host of business and governance challenges when auditing is required, as well as limiting factors when historical context is eventually needed for machine learning and data science applications.
This tutorial walks you through how to view historical changes in data in the Datalake via various tools.
📋 Requirements
|
Tutorial
The Infor Data Lake, as a central repository for enterprise data, and by virtue of being an immutable object storage architecture, can retain the full history of records ingested into it. With its built-in data versioning abilities and synthetic functions to expose previous or deleted versions of records stored in it, the Data Lake allows building queries that travel back in time to investigate data changes, deleted records from systems of records, and unlocks a host of use cases for data science and machine learning applications.
As data is ingested in the Data Lake, data objects are indexed for future retrieval and querying. A number of properties are added and are specifically used to create what we commonly refer to as synthetic columns . These columns exist as queryable platform metadata and can be useful in data processing and exploration. Particularly, in this scenario, these can be used to effectively query across time and review version changes in data replicated from systems of records.
In this tutorial, we'll focus on tracking historical changes for specific items in the MITBAL table which comes from Infor M3. The table contains item details per warehouse. This table is part of a provisioned replication set for CloudSuite tenants where changes to the table are replicated to Data Lake at preset intervals.
Additional Resources:
This video provides an overview of the infor.lastmodified() function covering most of the concepts included in this tutorial:
Overview of infor.lastmodified() query function
Best Practices
What made this section unhelpful for you?
On this page
- Exploring historical data changes