This project is a comprehensive data analysis and visualization of over 30,000 movies from 1960 to 2024, focusing on key metrics such as:
✅ Global Box Office Revenue Trends
✅ Academy Award-Winning Films
✅ Top-Grossing Genres & Production Companies
✅ Most Voted Movies & Directors on IMDb
The dataset includes films from various countries, genres, and production houses, providing valuable insights into industry shifts and audience preferences. The project is powered by Tableau, SQL, and Python, with interactive dashboards and data-driven storytelling.
- The global box office has grown exponentially, with revenues peaking in the last two decades.
- Most profitable movies include:
- 🎥 Avatar – Highest-grossing film of all time.
- 🎥 Avengers: Endgame – Leading the superhero era.
- 🎥 Titanic, The Lion King, and Frozen II – Strong performers in drama and animation genres.
- The most awarded movies (by total Oscars won) include:
- 🏆 The Color Purple (11 Oscars)
- 🏆 The Turning Point (11 Oscars)
- 🏆 The Banshees of Inisherin, American Hustle, Ragtime (8+ Oscars)
- The highest-grossing movie genres are:
- 🦸 Superhero, Sci-Fi, and Fantasy – Marvel & DC dominance.
- 🎭 Drama & Action Adventure – Consistent revenue generators.
- 🚀 Space Sci-Fi & Alien Invasion – Strong audience appeal.
- Top movie studios (by revenue) include:
- 🎬 Walt Disney Studios – The undisputed box office leader.
- 🎬 Warner Bros. – Thrives on franchise hits (Harry Potter, DC Universe).
- 🎬 Universal Studios – Major player with Fast & Furious, Jurassic World.
- 🎬 Marvel Studios – A massive revenue spike with MCU films.
- Most voted movies on IMDb include:
- 🎞️ Around the World in 80 Days, Public Access, Christmas with the Chosen
- Most voted directors include:
- 🎬 Alfred Vohrer, Antonio Margheriti, Gregory Dark
📁 IMDb-Dashboard/
│-- 📊 Tableau_Dashboard.twbx # Tableau interactive dashboard
│-- 🗄️ Data/ # Raw & cleaned datasets
│ │-- movies_data.csv # Processed IMDb dataset
│ │-- box_office_revenue.csv # Global box office revenue data
│-- 📄 Data_Cleaning_Scripts/ # Python scripts for preprocessing
│-- 🗄️ SQL_Queries/ # SQL queries for data analysis
│-- 📈 Data_Visualization/ # Matplotlib & Seaborn visualizations
│-- 📑 Reports/ # Findings & insights summary
│-- 📜 README.md # Documentation
🔹 Tableau Interactive Dashboard Highlights:
✅ Global Box Office Trends (1960-2024) – Line chart showing revenue evolution.
✅ Top Oscar-Winning Movies – Bar chart of films with most Academy Awards.
✅ Top 15 Highest-Grossing Movies – Revenue comparison of Avatar, Avengers, Titanic.
✅ Most Profitable Genres – Action, Sci-Fi, Superhero, and Fantasy.
✅ Top 15 Most Successful Production Companies – Disney, Warner Bros., Universal.
The dataset combines information from:
📌 IMDb – Ratings, votes, and reviews data.
📌 Box Office Mojo – Revenue & earnings statistics.
📌 Academy Awards – Oscar-winning films dataset.
📌 Kaggle Datasets – Movie industry trends from various sources.
Technology | Purpose |
---|---|
🎨 Tableau | Interactive visualizations & dashboards |
🗄️ SQL (PostgreSQL/MySQL) | Data extraction & transformation |
🐍 Python (Pandas, Matplotlib, Seaborn) | Data cleaning & statistical analysis |
📊 Excel | Data pre-processing & structuring |
git clone https://github.com/your-username/repository-name.git
- Load the
Tableau_Dashboard.twbx
file in Tableau Desktop.
- Navigate to
/SQL_Queries/
and execute scripts for insights.
- Use Python scripts in
/Data_Cleaning_Scripts/
to clean & analyze data.
🔹 Want to contribute? Open a pull request for:
✅ New visualizations & insights
✅ Performance improvements
✅ Bug fixes or dataset additions
💡 Issues & Suggestions? Open a GitHub issue or reach out!
📌 GitHub: GitHub Repository
📌 LinkedIn: Your Profile
📌 Email: [email protected]