A-Z Popular Blog Data Search »
Data
 Advertisements
Data

7 Examples of Data Proliferation

 , updated on
Data proliferation is when data grows rapidly. This tends to have negative connotations as it is often used to describe data that is replicated and low quality. Such data can be expensive to clean up, manage and govern. In many cases, data repositories become compliance and operational risks that have little value to an organization but are difficult to discard as analysis may be required to understand its structure, sources and uses. The following are illustrative examples of data proliferation.

Customer Data

It is common for multiple systems in an organization to maintain customer data. Such data is commonly out of sync between systems with no clear single source of truth. This can cause operational failures such as sending a bill to the wrong address.

Documents

Knowledge workers tend to create a lot of documents that get checked into a document management system. In many cases, such documents become completely unused with time but are retained as a precaution.

Communication

Communications such as emails can gather at the rate of hundreds per employee per day. Most communications lose their value almost immediately but often are retained for an extended period of time.

Backups

Backups of data, documents and communications often need to be retained in case something important was deleted from the source systems. If someone deletes a critical email, the only copy may be in a backup from a particular day last year. As such, backups are commonly stored for long periods of time. This can consume considerable resources despite the fact that backups are rarely used.

Transactional Data

Transactional data such as market trades and ecommerce purchases can grow extremely quickly. Transactional data is often viewed as valuable for historical research. For example, it is common to look at patterns in stock trades going back decades.

Social Data

Data that is shared by people on a public or private social network. Often viewed as valuable for purposes such as market research and machine learning.

Sensors & Machines

Machine and sensor generated data. Sensors have become cheap to the extent than they can be embedded in everyday objects in great numbers. Such data may be generally less valuable than human generated data. For example, video of a train tunnel or data from a tire pressure sensor isn't interesting for long. Nevertheless, sensor data potentially represents a gigantic source of data that is far larger than all other sources combined.
Overview: Data Proliferation
Type
Definition
Rapid data growth.
Related Concepts

Data Management

This is the complete list of articles we have written about data management.
Data Availability
Data Attribute
Data Cleansing
Data Control
Data Custodian
Data Consumer
Data Escrow
Data Integration
Data Corruption
Data Liberation
Data Lineage
Data Entity
Data Loss
Data Owner
Data Producer
Data Profiling
Data Proliferation
Data Quality
Data Masking
Data Risks
Data Massage
Data Science
Data Security
Legacy Data
Data Purging
Data View
Information Silo
Master Data
MDM
Namespace
Personal Information
Privacy
Privacy By Design
Product Catalog
Retention Schedule
Single Source Of Truth
System Of Record
More ...
If you enjoyed this page, please consider bookmarking Simplicable.
 

Data Profiling

A definition of data profiling with examples.

Data Custodian vs Data Steward

The difference between data custodian and data steward.

Information Governance vs Data Governance

The difference between information governance and data governance.

Data Control

A definition of data control with examples.

Data Owner

A definition of data owner with examples of their responsibilities.

Data Management

An overview of data management with examples.

Data Governance vs Data Management

The difference between data governance and data management.

Data Liberation

An overview of data liberation.

Master Data Management

An overview of master data management.

Single Source Of Truth

A definition of single source of truth, a data management strategy.

Data Escrow

An overview of data escrow.

Data Availability

An overview of data availability.

Namespace

The definition of namespace with examples.

Data Science Skills

An list of commonly cited data science skills.
The most popular articles on Simplicable in the past day.

New Articles

Recent posts or updates on Simplicable.
Site Map