Welcome to the next part of the course. In the previous part, we learned how to visualize the distribution of one categorical variable using a bar chart. Now we'll learn how to do the same for a numerical variable.
Our dataset has an interesting numerical variable
consumption, that represents how many litres of alcohol were consumed per person for each country:
country pattern consumption
1 Albania medium risky 5.28
2 Algeria somewhat risky 0.45
3 Andorra least risky 10.60
4 Angola medium risky 7.80
5 Antigua and Barbuda medium risky 7.84
To understand this variable better, we can pose some specific questions about its distribution:
- What was the lowest amount of per-capita alcohol consumption in 2010? What was the highest?
- Is there one very frequent value? Is there a narrow range of frequent values? Or do all values appear equally often?
- If there is one value or range of values that is very frequent, what is it?
- How many countries consume more than 14 liters of alcohol per person per year?