Introduction
Missing values
Duplicate rows
6. Introduction to duplicates
Outliers
Joining datasets
Summary

Instruction

Good! Another frequent problem with datasets is duplicate rows, i.e. rows that contain the same data as some other rows.

This time, we'll work with a dataset from some mysterious company. The dataset contains sales values for each state of the United States. Let's load and examine it.

Exercise

Load data from states_sales.csv into a variable called states_sales.

As you can see, there are two identical rows for Alaska and two identical rows for Idaho.

Stuck? Here's a hint!

Use the pd.read_csv function as before.