Introduction
Missing values
Duplicate rows
Outliers
Joining datasets
Summary
18. Quiz Question

Instruction

Alright, this time, instead of a few questions, we've prepared a longer task for you!

Exercise

You are given two CSV files:

  • patient_info.csv,
  • patient_results.csv.

You can study their contents by clicking the "CSV" button on the right.

For patient_info.csv:

  • Delete obvious outlier(s).
  • Remove duplicates.

For patient_results.csv:

  • Fill empty values with 0.0.

Create a new variable all_patient_data. This variable will be a DataFrame with data from both CSV files. To merge both datasets, use the patient_id and id columns of the respective datasets. Do not include patient info with no results, and vice versa – do not show results without patient info.

Good luck!