Data Outliers: 10 Ways To Prevent Big Data Damage - InformationWeek

InformationWeek is part of the Informa Tech Division of Informa PLC

This site is operated by a business or businesses owned by Informa PLC and all copyright resides with them.Informa PLC's registered office is 5 Howick Place, London SW1P 1WG. Registered in England and Wales. Number 8860726.

IoT
IoT
Data Management // Big Data Analytics
News
4/18/2016
07:06 AM
Lisa Morgan
Lisa Morgan
Slideshows
Connect Directly
Twitter
RSS
E-Mail

Data Outliers: 10 Ways To Prevent Big Data Damage

Most business decision-makers aren't trained to understand data outliers, but they can learn the basics. Executives, managers, and employees without math degrees can ask smarter questions about analyses they're basing crucial judgments on. Here are some things to know.
2 of 11

There's A Data Quality Problem 

People and machines may be responsible for poor quality data that makes its way into an analysis. Someone may have typed a wrong number or transposed digits in a number. Alternatively, a piece of equipment may report an erroneous value that skews the analysis. It is also possible for data to be corrupted while it is transported across a network. 
'The importance of removing or fixing an outlier depends on how extreme it is. If it was caused by a minor mistake, it may not matter very much. But if the outlier is extreme, it may negatively impact your analysis and lead to the wrong conclusion,' said Spencer Greenberg. 'If your outlier is caused by a mistake, you want to remove the value or fix it. And the more extreme it is, the more important it is to do that.'     
(Image: stevepb via Pixabay)

There's A Data Quality Problem

People and machines may be responsible for poor quality data that makes its way into an analysis. Someone may have typed a wrong number or transposed digits in a number. Alternatively, a piece of equipment may report an erroneous value that skews the analysis. It is also possible for data to be corrupted while it is transported across a network.

"The importance of removing or fixing an outlier depends on how extreme it is. If it was caused by a minor mistake, it may not matter very much. But if the outlier is extreme, it may negatively impact your analysis and lead to the wrong conclusion," said Spencer Greenberg. "If your outlier is caused by a mistake, you want to remove the value or fix it. And the more extreme it is, the more important it is to do that."

(Image: stevepb via Pixabay)

2 of 11
Comment  | 
Print  | 
News
Rethinking IT: Tech Investments that Drive Business Growth
Jessica Davis, Senior Editor, Enterprise Apps,  10/3/2019
Slideshows
IT Careers: 12 Job Skills in Demand for 2020
Cynthia Harvey, Freelance Journalist, InformationWeek,  10/1/2019
Commentary
Six Inevitable Technologies and the Milestones They Unlock
Guest Commentary, Guest Commentary,  10/3/2019
White Papers
Register for InformationWeek Newsletters
Video
Current Issue
Data Science and AI in the Fast Lane
This IT Trend Report will help you gain insight into how quickly and dramatically data science is influencing how enterprises are managed and where they will derive business success. Read the report today!
Slideshows
Flash Poll