DataFrames practice: working with English Words
DataFrames practice: working with English Words Data Science Project

DataFrames practice: working with English Words

In this project you'll use several skills related to DataFrames to create new columns, analyze a DataFrame and answer questions about the dataset we're using: a huge list of words of the English Language.

Project Activities

All our Data Science projects include bite-sized activities to test your knowledge and practice in an environment with constant feedback.

All our activities include solutions with explanations on how they work and why we chose them.

How many elements does this dataframe have?

Basically, how many words have we read from the file?

input
What is the value of the word `microspectrophotometries`?
input
What is the highest possible value of a word?
input
Which of the following words have a Char Count of `7` and a Value of `87` ?
multiplechoice
What is the highest possible length of a word?
input
What is the word with the value of `319`?

The word microspectrophotometries is the only word with a value of 317. Find the only word with a value of 319:

input
What is the most common value?

There are 1921 words with a value of 100, making it the second most common letter sum. What value is most common?

input
What is the shortest word with value `274`?
input
Create a column `Ratio` which represents the 'Value Ratio' of a word

The "Value Ratio" of a word is defined as the Value of a word divided by its length. That is: ValueRatio(word) = Value(word) / len(word).

Create a new column in your DataFrame named Ratio that contains such computation.

codevalidated
What is the maximum value of `Ratio`?
input
What word is the one with the highest `Ratio`?

In the previous activity you compute the maximum Ratio. Now answer, which word has such Ratio?

input
How many words have a `Ratio` of `10`?
input
What is the maximum `Value` of all the words with a `Ratio` of `10`?
input
Of those words with a `Value` of `260`, what is the lowest `Char Count` found?
input
Based on the previous task, what word is it?

What word has a Value of 260 and has the lowest Char Count?

input
DataFrames practice: working with English WordsDataFrames practice: working with English Words
Author
Santiago Basulto

This Project is part of our

Intro to Pandas for Data Analysis

Skill Track

Explore other projects