Rainbow Deals - hours only!Up to 80% off on all courses and bundles.-Close
Missing values
Duplicate rows
6. Introduction to duplicates
Joining datasets


Good! Another frequent problem with datasets is duplicate rows, i.e. rows that contain the same data as some other rows.

This time, we'll work with a dataset from some mysterious company. The dataset contains sales values for each state of the United States. Let's load and examine it.


Load data from states_sales.csv into a variable called states_sales.

As you can see, there are two identical rows for Alaska and two identical rows for Idaho.

Stuck? Here's a hint!

Use the pd.read_csv function as before.