All our Data Science projects include bite-sized activities to test your knowledge and practice in an environment with constant feedback.
All our activities include solutions with explanations on how they work and why we chose them.
Perform the dropping on the original dataframe movies_df
.
Store the resulting dataframe in the variable movies_ratings_df
.
Store the result in the variable avg_ratings
.
Your result should look like this (title is unique for all movies as we have already dropped duplicates in activity 6):
Store the resulting dataframe in the variable movies_tags_df
.
Store the result in the variable movies_with_no_tags
.
Merge tags_df
& ratings_df
using an outer join on 'movieId' and 'userId'
. Use suffixes '_tags'
and '_ratings'
.
Store the resulting dataframe in the variable tags_ratings_df
.
The result should look something like:
Store the resulting dataframe in the variable movies_tags_counts_df
.
inner
join.Store the resulting dataframe in the variable movies_ratings_counts_df
.
Store the resulting dataframe in the variable movies_with_no_ratings
.