Dealing with duplicated data
Dealing with duplicated data Data Science Project

Project Activities

All our Data Science projects include bite-sized activities to test your knowledge and practice in an environment with constant feedback.

All our activities include solutions with explanations on how they work and why we chose them.


Check if any of the record of the dataset is duplicated


Your task is to check for duplicates in a specific column of a given DataFrame. Select the correct code


Drop duplicated

Your task is to drop duplicate rows from a given DataFrame and retain the first occurrence of each duplicated row. Select the correct code.

Dealing with duplicated dataDealing with duplicated data

Verónica Barraza

This project is part of

Introduction to Supervised Learning with scikit-learn

Explore other projects