Skip to content

amertu/social-media-data-analyzer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Social Media Analytics Platform

Python Jupyter Notebook Git Twitter API

This project analyzes global user engagement patterns by focusing on Greta Thunberg's Twitter followers, leveraging data from the Twitter API and Python for data processing and analysis.

Features

  • Developed a data analytics pipeline to collect and process Twitter user data from two countries.
  • Analyzed geographic distribution, user biographies, and posting behavior of followers.
  • Identified socio-cultural and geographic trends within follower groups, uncovering regional engagement styles and community structures.

Architecture

  BATCH DATA PIPELINE
───────────────────────
+---------------------+
|   Social Media API  |
+---------------------+
         |
         v
+---------------------+
|  Data Ingestion     |
+---------------------+
         |
         v
+---------------------+
|  Raw Data Storage   |
+---------------------+
         |
         v
+---------------------+
|   Preprocessing &   |    now            
|        EDA          |   ------
+---------------------+
         |
         v
+---------------------+
|   Trend Detection   |
|                     |
+---------------------+
         |
         v
+---------------------+
|   Reporting & Viz   |
|                     |
+---------------------+
         REAL-TIME DATA PIPELINE
────────────────────────────────────────
        +-------------------------+
        | Streaming API / Webhook |
        +-------------------------+
                    |
                    v
      +------------------------------+
      |  Stream Ingestion Engine     |
      +------------------------------+
                    |
                    v
          +-----------------------+
          | Message Queue System  |
          |  (Kafka / RabbitMQ)   |
          +-----------------------+
                    |
                    v
next    +-------------------------+
----->  |  Stream Processing Uni  |
        +-------------------------+
                    |
                    v
+-----------------------------------------+
|           Real-time Data Store          |
| (ElasticSearch / MongoDB / TimescaleDB) |
+-----------------------------------------+
                    |
                    v
+------------------------------------------+
|          Real-Time Dashboards            |
| (Kibana / Grafana / Custom Python Dash)  |
+------------------------------------------+

Project Report

The full report is available via Overleaf

Releases

No releases published

Packages

 
 
 

Contributors