A-Z Popular Blog Top Search »
Data
 Advertisements

What is Data Dredging?

 , updated on
Data dredging is the automated statistical analysis of large sets of data.
This is similar to data mining with the key difference being that data mining starts with a hypothesis, or something that you expect to find in the data. Data dredging is an automated search for statistical patterns that doesn't start with a hypothesis.
Data dredging tests large sets of data against known statistical models to generate matches. As such, it runs a risk of finding coincidental patterns in data that have no real meaning. In other words, it is a process of finding a pattern that fits the data rather than confirming a pattern with data. As such, data dredging is associated with ethical issues because it's an easy way to create a research paper that looks valid but is essentially auto generated. Nevertheless, the technique does have potential to discover patterns in data that have meaning.
Overview: Data Dredging
Function
Definition
An automated search for statistical patterns in data.
Value
Pattern discovery as a starting point of analysis.
Risk
Data Dredging tends to produce patterns that exist by chance that have no meaning. In other words, results may lack significance.
Related Techniques

Statistical Analysis

This is the complete list of articles we have written about statistical analysis.
Cohort
Data Science
Distributions
Exponential Growth
Forecasting
Growth
Large Numbers
Misuse of Statistics
Negative Correlation
Populations
Positive Correlation
Regression Analysis
Research
Samples
Statistical Model
Statistics
Structured Data
More ...
If you enjoyed this page, please consider bookmarking Simplicable.
 

Correlation vs Causation

The difference explained.

Cognitive Biases

A list of common cognitive biases explained.

Top

Simplicable is a modern encyclopedia that has been updated daily since 2010.

Business Theory

A list of interesting business theories.

Knowledge Work

A definition of knowledge work with examples.

Office Politics

A list of social processes, absurdities and strategies related to office politics.

Product Development

A guide to product development.

Types Of Knowledge

The differences between types of knowledge.

Trough Of Sorrow

An overview of the trough of sorrow.

Business Models

A list of common business models.

Marketing

A list of key marketing strategies.

Competitive Advantage

A few sources of competitive advantage for businesses.
The most popular articles on Simplicable in the past day.

New Articles

Recent posts or updates on Simplicable.
Site Map