Q:

# How do outliers affect mean, median, mode and range in a set of data?

A:

A mathematical outlier, which is a value vastly different from the majority of data, causes a skewed or misleading distribution in certain measures of central tendency within a data set, namely the mean and range, according to About Statistics. The affected mean or range incorrectly displays a bias toward the outlier value. The median and mode values, which express other measures of central tendency, are largely unaffected by an outlier.

## Keep Learning

Credit: Dave Dugdale CC-BY-SA 2.0

The purpose of analyzing a set of numerical data is to define accurate measures of central tendency, also called measures of central location. The Engineering Statistics Handbook defines an outlier as “an observation that lies an abnormal distance from the other values in a random sample from a population.”

Lærd Statistics explains that the mean is the single measurement most influenced by the presence of outliers because its result utilizes every value in the data set. The median, which is the middle score within a data set, is the least affected. The interquartile range, which breaks the data set into a five number summary (lowest value, first quartile, median, third quartile and highest value) is used to determine if an outlier is present. The Engineering Statistics Handbook suggests that outliers should be investigated before being discarded to potentially uncover errors in the data gathering process.

Sources:

## Related Questions

• A: Finding the mode of a data set involves finding the data values that appear most often in the set. You need all the values of the data in the set.... Full Answer >
Filed Under:
• A: The mode of a set of numbers is the value which occurs the most often in the set. For example, the mode of {1, 2, 3, 2, 4} is 2, since it occurs twice.... Full Answer >
Filed Under:
• A: The box-and-whisker plot is a technique in statistics that graphically shows the distribution of a set of data involving the minimum and maximum values, as... Full Answer >
Filed Under:
• A: In a grouped frequency distribution, data is sorted and separated into groups called classes, whereas in an ungrouped frequency distribution, a listing is ... Full Answer >
Filed Under:
PEOPLE SEARCH FOR