From Raw to Refined: Hands-On Mobile Phone Data Cleaning
From Raw to Refined: Hands-On Mobile Phone Data Cleaning Data Science Project
Data Cleaning with Pandas

From Raw to Refined: Hands-On Mobile Phone Data Cleaning

Begin your practical journey of data refinement with our hands-on project focused on mobile phone data. Engage in a series of interactive activities designed to empower you in cleaning invalid values. Get your hands dirty!
Start this project
From Raw to Refined: Hands-On Mobile Phone Data CleaningFrom Raw to Refined: Hands-On Mobile Phone Data Cleaning
Project Created by

Lohith Unnam

Project Activities

All our Data Science projects include bite-sized activities to test your knowledge and practice in an environment with constant feedback.

All our activities include solutions with explanations on how they work and why we chose them.

codevalidated

Remove Irrelevant Column

Remove the column labeled Unnamed: 0 from the dataframe df as it does not contain any meaningful information related to mobile phone specifications or prices.

This is how your result will look like:

Activity 01

codevalidated

Rename Column Name: `Brand me` to `Model_Name`

Renamed the column Brand me to Model Name to clarify the data it represents within the dataset.

This is how your result will look like:

Activity 02

codevalidated

Remove Duplicate Entries

Eliminate duplicate entries from the dataframe df to ensure each record is unique and maintain data consistency.

This is how your result will look like:

Activity 03

codevalidated

Remove Null Values

Remove all null values from the dataframe df to uphold data integrity.

This is how your result will look like:

Activity 04

codevalidated

Remove Invalid Ratings

Filter out ratings less than 1 and ratings equal to #FIELD!.

This is how your result will look like:

Activity 05

codevalidated

Strip `GB` Unit from `RAM` and `ROM` Columns

Remove the GB from the RAM and ROM columns (do not forget to remove the trailing whitespace).

This is how your result will look like:

Activity 06

codevalidated

Convert `RAM` and `ROM` Columns to Float Data Type

This is how your result will look like:

Activity 07

codevalidated

Convert `RAM` and `ROM` Columns to Integer Data Type

This is how your result will look like:

Activity 08

codevalidated

Exclude the mobiles which have `RAM` greater than `16GB`

This is how your result will look like:

Activity 09

codevalidated

Remove the `inches` in the `Mobile_Size` column and Convert it into appropriate data type

Do not forget to remove the trailing whitespace.

This is how your result will look like:

Activity 10

codevalidated

Primary Camera Resolution Refinement: Strip `MP` and Convert to Integer

Remove MP from Primary_Cam column (do not forget to remove the trailing whitespace) then convert it from object to integer.

This is how your result will look like:

Activity 11

codevalidated

Selfie Camera Cleaning: Strip `MP` and Convert to Integer

Remove MP from Selfi_Cam column (do not forget to remove the trailing whitespace) then convert it from object to integer. Remember to first convert it to float and then to int.

This is how your result will look like:

Activity 12

codevalidated

Extract Battery Power Values from String Format

Remove mAh from Battery_Power column (do not forget to remove the trailing whitespace).

This is how your result will look like:

Activity 13

codevalidated

Battery Power: Remove Underscores and Convert to Integer

Remove the underscore that are present in few records of Battery_Power column and then convert it to integer.

This is how your result will look like:

Activity 14

codevalidated

Exclude Invalid Battery Power Values

Remove values in the Battery_Power column that are less than or equal to zero and greater than 6000.

This is how your result will look like:

Activity 15

codevalidated

Format Price Data: Remove Currency Symbol and Underscores

Remove the Rs. prefix from the Price column (do not forget to remove the trailing whitespace), then eliminate any underscores present in the data, and finally convert it into numerical format from object.

This is how your result will look like:

Activity 16

codevalidated

Filter Invalid Price Values

Exclude all records with prices less than or equal to zero and greater than 140300.

This is how your result will look like:

Activity 17

codevalidated

Enhance Model Names: Removing Parentheses and Their Contents

Removing any text enclosed within parentheses, including the parentheses themselves.

This is how your result will look like:

Activity 18

From Raw to Refined: Hands-On Mobile Phone Data CleaningFrom Raw to Refined: Hands-On Mobile Phone Data Cleaning
Project Created by

Lohith Unnam

This project is part of

Data Cleaning with Pandas

Explore other projects