This project analyzes global user engagement patterns by focusing on Greta Thunberg's Twitter followers, leveraging data from the Twitter API and Python for data processing and analysis.
- Developed a data analytics pipeline to collect and process Twitter user data from two countries.
- Analyzed geographic distribution, user biographies, and posting behavior of followers.
- Identified socio-cultural and geographic trends within follower groups, uncovering regional engagement styles and community structures.
BATCH DATA PIPELINE
───────────────────────
+---------------------+
| Social Media API |
+---------------------+
|
v
+---------------------+
| Data Ingestion |
+---------------------+
|
v
+---------------------+
| Raw Data Storage |
+---------------------+
|
v
+---------------------+
| Preprocessing & | now
| EDA | ------
+---------------------+
|
v
+---------------------+
| Trend Detection |
| |
+---------------------+
|
v
+---------------------+
| Reporting & Viz |
| |
+---------------------+
|
REAL-TIME DATA PIPELINE
────────────────────────────────────────
+-------------------------+
| Streaming API / Webhook |
+-------------------------+
|
v
+------------------------------+
| Stream Ingestion Engine |
+------------------------------+
|
v
+-----------------------+
| Message Queue System |
| (Kafka / RabbitMQ) |
+-----------------------+
|
v
next +-------------------------+
-----> | Stream Processing Uni |
+-------------------------+
|
v
+-----------------------------------------+
| Real-time Data Store |
| (ElasticSearch / MongoDB / TimescaleDB) |
+-----------------------------------------+
|
v
+------------------------------------------+
| Real-Time Dashboards |
| (Kibana / Grafana / Custom Python Dash) |
+------------------------------------------+
|