Welcome to the next part of the course. In the previous part, we learned how to visualize the distribution of one categorical variable using a bar chart. Now we'll learn how to do the same for a numerical variable.

Our dataset has an interesting numerical variable consumption, that represents how many litres of alcohol were consumed per person for each country:

              country        pattern consumption
1             Albania   medium risky        5.28
2             Algeria somewhat risky        0.45
3             Andorra    least risky       10.60
4              Angola   medium risky        7.80
5 Antigua and Barbuda   medium risky        7.84

To understand this variable better, we can pose some specific questions about its distribution:

  • What was the lowest amount of per-capita alcohol consumption in 2010? What was the highest?
  • Is there one very frequent value? Is there a narrow range of frequent values? Or do all values appear equally often?
  • If there is one value or range of values that is very frequent, what is it?
  • How many countries consume more than 14 liters of alcohol per person per year?