Project D10: KAGGLE - Movie Ratings

This is the project repository for the Introduction to Data Science 2022 course.

Authors:

Kevin Kliimask
Taavi Eistre
Jens Jäger

Our used datasets:

movies_metadata.csv
keywords.csv

Data retrieved from: https://www.kaggle.com/datasets/rounakbanik/the-movies-dataset

Our goals for the project:

Train models using two different datasets (movies_metadata.csv and keywords.csv) to predict movie ratings (vote averages)
Analyze the data and find interesting facts

To run our project:

Clone our repository
Install the following libraries:
- Pandas
- Numpy
- Matplotlib
- Sklearn
- ast
- Tensorflow
Select the desired notebook to run and run it

Summary

We created two different notebook files for our models.

In the first one, we used the movies metadata dataset and trained the models using regression models that we had learned about during the course.

In the second file, we tried out tensorflow on the keywords dataset with genres column from metadata in addition.

Our goal was to train a model that had an RMSE (Root mean squared error) of about 0.5 or under, but unfortunately we weren't able to accomplish that. We did however manage to train models with an RMSE of about 1, which is still pretty okay in our opinion.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
D10_movie_ratings.pdf		D10_movie_ratings.pdf
D10_notebook1.ipynb		D10_notebook1.ipynb
D10_notebook2.ipynb		D10_notebook2.ipynb
D10_report.pdf		D10_report.pdf
README.md		README.md
keywords.csv		keywords.csv
model_rmse.svg		model_rmse.svg
movies_metadata.csv		movies_metadata.csv
rating_freq.svg		rating_freq.svg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project D10: KAGGLE - Movie Ratings

Authors:

Our used datasets:

Our goals for the project:

To run our project:

Summary

About

Releases

Packages

Contributors 3

Languages

kevinkliimask/ids2022

Folders and files

Latest commit

History

Repository files navigation

Project D10: KAGGLE - Movie Ratings

Authors:

Our used datasets:

Our goals for the project:

To run our project:

Summary

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages