Skip to content

ExperienceIsKey/Video-Game-Genre-Trend-Analysis

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

15 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐ŸŽฎ Video Game Genre Trend Analysis

๐Ÿ“Œ Overview

This project explores trends in video game genres using data from 30,000+ video games (2016-2024) sourced from the Steam API and Kaggle. We analyze revenue, review scores, ownership trends, and genre evolution to provide actionable insights for developers, marketers, and industry stakeholders.

๐ŸŽฏ Objectives

  • Identify popular and emerging game genres.
  • Analyze review scores, revenue, and ownership trends.
  • Utilize TF-IDF and Cosine Similarity for content-based recommendations.
  • Offer data-driven insights for the gaming industry.

๐Ÿ“Š Dataset

  • Source: Steam API, Kaggle
  • Games Analyzed: 30,000+
  • Attributes: Revenue, review scores, ownership data, genres, tags, release dates
  • Processing: Data cleaning, standardization, genre classification

๐Ÿ“Œ Dataset Overview

  • Total Number of Games: ๐Ÿ•น๏ธ 65,112
  • Total Number of Distinct Games: ๐Ÿ•น๏ธ 38,471

1๏ธโƒฃ Genre Distribution (Genre):

  • Battle Royale: 1,030 games
  • Multiplayer: 318 games
  • Role-Playing Games (RPG): 15,758 games
  • Racing: 1,754 games
  • Strategy: 10,281 games
  • Sports: 1,606 games

2๏ธโƒฃ Game Distribution (Pricing Model):

  • Free to Play: 605 games
  • Paid: 38,399 games

๐Ÿ“Œ Dataset Columns & Description

1๏ธโƒฃ Game Identification

  • App ID ๐Ÿท๏ธ โ€“ Unique identifier assigned to each game in the Steam database
  • Title ๐ŸŽฎ โ€“ Name of the game

2๏ธโƒฃ Reviews & Ratings

  • Reviews Total ๐Ÿ“ โ€“ Total number of reviews submitted by users
  • Reviews Score Fancy โญ โ€“ Steam's formatted rating based on user reviews
  • Reviews D7 ๐Ÿ“† โ€“ Number of reviews received in the last 7 days
  • Reviews D30 ๐Ÿ“† โ€“ Number of reviews received in the last 30 days
  • Reviews D90 ๐Ÿ“† โ€“ Number of reviews received in the last 90 days

3๏ธโƒฃ Game Release & Pricing

  • Release Date ๐Ÿ—“๏ธ โ€“ Date when the game was launched on Steam
  • Launch Price ๐Ÿ’ฐ โ€“ Initial price of the game at release

๐Ÿ” Methodology

  • Data Cleaning & Transformation: Handled missing values, standardized titles, converted data types.
  • Ownership Estimation: Applied the Boxleiter method for estimating game ownership.
  • Trend Analysis: Visualized genre trends over time.
  • Genre Similarity Search: Implemented TF-IDF + Cosine Similarity to recommend similar games.

๐Ÿ“ˆ Key Insights

โœ” RPG & Multiplayer games remain dominant in ownership and revenue.

โœ” Paid games have higher review scores than Free-to-Play.

โœ” Strategy & Sports games show steady, high ratings over time.

โœ” Battle Royale & Multiplayer genres are growing rapidly.

โœ” Revenue is highly influenced by top-performing titles.

๐Ÿš€ Technologies Used

  • Python, PySpark, AWS, Spark-SQL, Docker
  • Jupyter Notebook, Pandas, Matplotlib, Seaborn
  • TF-IDF, Cosine Similarity for content-based recommendations

๐Ÿ›  Future Enhancements

  • Cross-platform analysis (PC, console, mobile).
  • Impact of emerging technologies (AR/VR, cloud gaming).
  • Sentiment analysis of user reviews.
  • Influencer impact tracking on game success.

About

Leveraged AWS, PySpark, and Power BI to analyze trends in PC video game genres. Optimized ETL processes and utilized datasets and the Steam API to reveal nuanced genre frequencies and distributions. Delivered insights driving decisions in game development, marketing, and platform enhancement.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages