Data Quality
Gathering statistics about data quality. For example, a telecom company might determine the correctness of customer data by comparing two sources or validating the data using a set of business rules.Data Credibility
Analysis of the credibility of data. For example, an investor might evaluate a set of historical social media data to see if there is any useful correlation between social media chatter and stock prices.Data Lineage
Tracing data to its sources and calculation methods.Compliance & Risks
Analysis of data for compliance and risk purposes. For example, verifying that a dataset doesn't contain personally identifiable data.Information Security
Analysis of data for purposes of information security such as verifying that fields are properly encrypted.Capacity Management
Looking at how data is growing in order to plan capacity and budget.Retention
Evaluating data in order to determine a retention schedule. For example, a team may have mysterious pools of dark data that it would like to purge but seek statistics to confirm the data isn't used.Overview: Data Profiling | ||
Type | ||
Definition | Analysis of datasets to determine information and statistics related to the data itself. | |
Related Concepts |