Dealing with duplicated data
Dealing with duplicated data Data Science Project
Introduction to Supervised Learning with scikit-learn

Dealing with duplicated data

In this lab, we will explore and analyze the data on calls for service from New Orleans. The dataset used in this example is available at the following link: https://data.nola.gov/. Specifically, we will focus on the dataset from the year 2015. During this lab, our main focus will be on dealing with duplicated data.
Start this project
Dealing with duplicated dataDealing with duplicated data
Project Created by

Verónica Barraza

Project Activities

All our Data Science projects include bite-sized activities to test your knowledge and practice in an environment with constant feedback.

All our activities include solutions with explanations on how they work and why we chose them.

multiplechoice

Check if any of the record of the dataset is duplicated

multiplechoice

Your task is to check for duplicates in a specific column of a given DataFrame. Select the correct code

multiplechoice

Drop duplicated

Your task is to drop duplicate rows from a given DataFrame and retain the first occurrence of each duplicated row. Select the correct code.

Dealing with duplicated dataDealing with duplicated data
Project Created by

Verónica Barraza

This project is part of

Introduction to Supervised Learning with scikit-learn

Explore other projects