Quantile-Quantile Plots

Some example QQ plots

Author
Affiliation

Jon Yearsley

School of Biology and Environmental Science, UCD

Published

January 1, 2024

Introduction

To help you identify different types of distributions from a quantile-quantile plot, we give examples of histograms and quantile-quantile plots for five qualitatively different distributions:

  • A normal distribution
  • A right-skewed distribution
  • A left-skewed distribution
  • An under-dispersed distribution
  • An over-dispersed distribution

Normally distributed data

Below is an example of data (150 observations) that are drawn from a normal distribution. The normal distribution is symmetric, so it has no skew (the mean is equal to the median).

On a Q-Q plot normally distributed data appears as roughly a straight line (although the ends of the Q-Q plot often start to deviate from the straight line).

Histogram (left) and QQ-plot (right) for normal data

Right-skewed data

Below is an example of data (150 observations) that are drawn from a distribution that is right-skewed (in this case it is the exponential distribution). Right-skew is also known as positive skew.

On a Q-Q plot right-skewed data appears as a concave curve.

You can start to understand this pattern by considering the smallest and largest observations (left-hand and right-hand sides of the QQ-plot, respectively):

  • the smallest observations are larger than you would expect from a normal distribution (i.e. the points are above the line on the QQ-plot). This means the lower tail of the data’s distribution has been reduced, relative to a normal distribution.
  • the largest observations are larger than you would expect from a normal distribution (i.e. the points are above the line on the QQ-plot). This means the upper tail of the data’s distribution has been extended, relative to a normal distribution.

Histogram (left) and QQ-plot (right) for right-skewed data

Left-skewed data

Below is an example of data (150 observations) that are drawn from a distribution that is left-skewed (in this case it is a negative exponential distribution). Left-skew is also known as negative skew.

On a Q-Q plot left-skewed data appears as a concave curve (the opposite of right-skewed data).

You can start to understand this pattern by considering the smallest and largest observations (left-hand and right-hand sides of the QQ-plot, respectively):

  • the smallest observations are smaller than you would expect from a normal distribution (i.e. the points are below the line on the QQ-plot). This means the lower tail of the data’s distribution has been extended, relative to a normal distribution.
  • the largest observations are smaller than you would expect from a normal distribution (i.e. the points are below the line on the QQ-plot). This means the upper tail of the data’s distribution has been reduced, relative to a normal distribution.

Histogram (left) and QQ-plot (right) for left-skewed data

Under-dispersed data

Below is an example of data (150 observations) that are drawn from a distribution that is under-dispersed relative to a normal distribution (in this case it is the uniform distribution). Under-dispersed data has a reduced number of outliers (i.e. the distribution has thinner tails than a normal distribution). Under-dispersed data is also known as having a platykurtic distribution and as having negative excess kurtosis.

On a Q-Q plot under-dispersed data appears S shaped.

You can start to understand this pattern by considering the smallest and largest observations (left-hand and right-hand sides of the QQ-plot, respectively):

  • the smallest observations are larger than you would expect from a normal distribution (i.e. the points are above the line on the QQ-plot). This means the lower tail of the data’s distribution has been reduced, relative to a normal distribution.
  • the largest observations are less than you would expect from a normal distribution (i.e. the points are below the line on the QQ-plot). This means the upper tail of the data’s distribution has been reduced, relative to a normal distribution.

Histogram (left) and QQ-plot (right) for under-dispersed data

Over-dispersed data

Below is an example of data (150 observations) that are drawn from a distribution that is over-dispersed relative to a normal distribution (in this case it is a Laplace distribution). Over-dispersed data has an increased number of outliers (i.e. the distribution has fatter tails than a normal distribution). Over-dispersed data is also known as having a leptokurtic distribution and as having positive excess kurtosis.

On a Q-Q plot over-dispersed data appears as a flipped S shape (the opposite of under-dispersed data).

You can start to understand this pattern by considering the smallest and largest observations (left-hand and right-hand sides of the QQ-plot, respectively):

  • the smallest observations are smaller than you would expect from a normal distribution (i.e. the points are below the line on the QQ-plot). This means the lower tail of the data’s distribution has been extended, relative to a normal distribution.
  • the largest observations are larger than you would expect from a normal distribution (i.e. the points are above the line on the QQ-plot). This means the upper tail of the data’s distribution has been extended, relative to a normal distribution.

Histogram (left) and QQ-plot (right) for overdispersed data