Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching" (COLING 2025)
-
Updated
Jan 3, 2026 - Python
Code for the paper "Match, Compare, or Select? An Investigation of Large Language Models for Entity Matching" (COLING 2025)
The official implementation of [Quality over Quantity: Boosting Data Efficiency Through Ensembled Multimodal Data Curation] in AAAI2025.
Our project for the "Data Intelligence Applications" exam at Politecnico di Milano. The project was about Social Influence and Pricing techniques applied to networks.
Decision intelligence platform for industrial manufacturing. Connects to CRM, ERP, and CMMS systems and monitors industry and macroeconomic conditions to compute leading indicators, generate predictions, and deliver daily executive briefings.
List of various ethical hacking tools
Cyber attack surface report for any domain — enter a target and get a full external risk assessment in minutes.
AI-powered job screening system that helps match candidates with job openings
The Cognitive Node of the Automated Data Intelligence Platform (ADIP). An AI-powered analytical infrastructure that consumes raw data from the Ingestion Engine into automated insights, forecasts and applied LLM reasoning, all served via Streamlit.
DomainKits connects Claude to live domain data. Built-in workflows help verify domains from multiple angles before making decisions. Supports domain search, analysis, brand conflict detection, valuation, trend discovery,
Trade-Based Money Laundering investigation reports generated in minutes — not weeks. This actor is built for AML compliance officers, trade finance banks, and financial investigators who need a scored TBML risk assessment backed by data from 14+ authoritative sources and five independent forensic algorithms.
Sensorium is a real-time data intelligence platform that ingests, processes, and visualizes sensor data through a resilient Python service, a high-performance Node.js/Express API, and an intuitive React.js dashboard—transforming raw data into actionable insights for smarter decision-making.
Data visualizations through Tableau for insightful analytics and decision-making using Walmart retail data.
Data collection of LoRaWAN 1.1 packets. This work was carried out as part of a bachelor's thesis in cybersecurity on the analysis of the security layer of the LoRaWAN 1.1 IoT protocol.
AML entity screening that searches 13 public compliance databases simultaneously — so your BSA/AML team gets a complete risk picture in under two minutes instead of hours of manual lookups.
Integration repository for Google Cloud Conversational Analytics API with Antigravity IDE (with possible support for other IDE's in the future). Leverages Gemini, BigQuery, and Looker for intelligent conversational analytics.
Sanctions network analysis that screens any person or company across 8 authoritative sources simultaneously — OFAC SDN list, OpenSanctions, Interpol Red Notices, and five international corporate registries — then applies four specialized scoring algorithms to produce an actionable Evasion Probability Score with a five-tier verdict.
Export compliance screening gives trade compliance teams, legal counsel, and defense contractors a structured risk verdict — APPROVED, LICENSE_REQUIRED, ENHANCED_REVIEW, or DENIED — on any entity, technology, or trade corridor in minutes.
Interactive HR Analytics Dashboard in Power BI — tracking 1,470 employees across promotion eligibility, retrenchment risk, gender ratio, and job satisfaction
Production-ready data enrichment API with 9 AI-powered tools: web scraping, email intel, phone validation, company data, and more. SaaS-ready with OpenAPI docs.
Company due diligence screening across 18 data sources — in one automated run. This actor performs KYB (Know Your Business) checks against global corporate registries, OFAC and OpenSanctions watchlists, Interpol Red Notices, SEC EDGAR filings, insider trading records, CFPB consumer complaints, WHOIS data, and tech stack analysis.
Add a description, image, and links to the data-intelligence topic page so that developers can more easily learn about it.
To associate your repository with the data-intelligence topic, visit your repo's landing page and select "manage topics."