Skip to content
View rafabelokurows's full-sized avatar

Block or report rafabelokurows

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
rafabelokurows/README.md

Hi, I'm Rafael Belokurows 👋

Email LinkedIn Website

📈 Data Scientist & Analytics Engineer @ Primetag 🇵🇹
🎓 MSc Data Science — Universidade do Porto
🤖 Currently exploring: LLM engineering · RAG pipelines · Agentic AI


🔍 What I work on

  • Building and automating data pipelines that save real time and money
  • Forecasting, causal inference, and geospatial analysis applied to business problems
  • Deploying ML models end-to-end — not just training them
  • Extracting value from unstructured data using Azure AI and LLMs

🛠 Tools & Stack

Languages: Python · R · SQL
ML & AI: scikit-learn · XGBoost · LangChain · ChromaDB · Streamlit
BI & Viz: Power BI · Tableau · R/Shiny
Cloud & MLOps: Azure (ML · Vision · LLM) · Docker · GitHub Actions
Other: dbt · Git · JIRA · Agile


🖥 Latest Data Projects

Project Description Stack
Household Finance Copilot AI-powered personal finance platform: ingests bank statements via Gmail, extracts transactions with Groq/Llama 3, review queue, analytics dashboard — fully deployed FastAPI · React · Supabase · Groq · Docker
GeoAI Airbnb Intelligence Platform Geospatial ML platform explaining what drives Airbnb performance in Porto — spatial feature engineering, LightGBM, SHAP explainability, H3 hexagon maps, FastAPI backend Python · LightGBM · SHAP · FastAPI · DuckDB · Deck.gl
TikTok Brand/Creator Classifier Cost-sensitive ML pipeline separating brands from creators using profile signals: bios, usernames, emojis, engagement patterns Python · XGBoost · scikit-learn
Influencer Research RAG Local RAG assistant to query academic papers with cited answers — no API keys, runs on-device Python · ChromaDB · Qwen2.5 · Streamlit
Data Analyst Job Skills What skills and salaries actually look like for data analysts — scraped from LinkedIn Python · Jupyter · GitHub Pages
Causal Effect of Layoffs on Stock Price Event study + DiD to measure how layoff announcements move stock prices R · causalimpact

📊 Data Visualizations

Time Series Explorer Map Variation Inflation
Flights map Brazil Brazil endangered languages
Game of Thrones IMDB reviews Ryanair domination
Europe gender pay gap Porto Starmap

Tuga inflation animation

Pinned Loading

  1. influencer-marketing-research-rag influencer-marketing-research-rag Public

    Tired of ctrl+F-ing through PDFs? Ask natural language questions across your research paper library and get cited answers — no API keys, no cloud, runs on your laptop.

    Python

  2. tiktok-brand-creator-classifier tiktok-brand-creator-classifier Public

    Cost-sensitive machine learning pipeline that separates TikTok brands from creators using the small signals hidden obtained via webscraping: bios, usernames, emojis, engagement patterns, and commer…

    Python

  3. household-finance-copilot household-finance-copilot Public

    A personal AI-powered household finance platform. Ingests bank statements via Gmail or manual upload, extracts transactions using Google Gemini Flash, and provides a full review + analytics suite.

    Python

  4. shiny_time_series_forecasting shiny_time_series_forecasting Public

    R/Shiny app showcasing forecasting methods for multiple time series

    Jupyter Notebook 6 2

  5. sports-odds sports-odds Public

    Obtaining odds for MLB and NFL games through an API

    Python 7

  6. data-analyst-job-skills data-analyst-job-skills Public

    Insights on skills and salaries using real data (scraped from Linkedin) - https://rafabelokurows.github.io/data-analyst-job-skills/

    Jupyter Notebook 3