Leveraging LLM Pipelines to Generate Insurance Product Comparison Data

Project Summary

This repository implements a proof-of-concept, LLM-driven competitive market analysis system that generates structured insurance product comparison data across multiple companies in the German insurance market.

The project was developed as part of a Master-Praktikum at the Technical University of Munich and conducted within the Social Computing Research Group at the TUM School of Computation, Information and Technology.

At a high level, the system operationalizes a “crawl → clean → extract → classify → compare → evaluate” workflow through a reproducible pipeline architecture, aiming to improve accuracy, consistency, and coverage of LLM-generated product comparison outputs via prompt engineering and automated evaluation.

For more details about this project, please refer to the project report and poster included in this repository.

What the Project Builds

The repository describes an end-to-end pipeline designed to produce product comparisons from publicly available insurer webpages.

It begins with data collection by crawling insurer sites to gather product pages, followed by filtering and transformation steps to remove irrelevant content and convert or normalize content for downstream processing.

Next, the pipeline performs LLM-based extraction and classification of product details.
The project explicitly states that GPT-4o is the primary model used for classification and comparison generation, and that multiple prompting strategies. Finally, the system generates product comparisons by identifying similar products across providers using embeddings and cosine similarity.

Repository Structure and Deliverables

NLP Competitive Market Analysis/
│── .viz/
│── conf/
│── data/
│   ├── 1_insurances_crawled_data/
│   ├── 2_filtered_product_pages/
│   ├── 3_filtered_product_markdowns/
│   ├── 4_extracted_product_details/
│   ├── 5_classified_product_details/
│   ├── 6_product_comparisons_markdowns/
│   ├── 8_evaluation/
│── lab_competitive_analysis/
│   ├── pipelines/
│   │   ├── insurance_data_classification/
│   │   ├── insurance_data_comparison/
│   │   ├── insurance_data_crawling/
│   │   ├── insurance_data_evaluation/
│   │   ├── insurance_data_extraction/
│   │   ├── insurance_data_filtering/
│   │   ├── insurance_data_markdown/
│   │   ├── reference_data/
│── notebooks/
│── tests/
│── pyproject.toml
│── README.md
│── poetry.lock

Data Flow

The repository’s data/ directory reflects a multi-stage processing pipeline, where each stage produces artifacts used by subsequent steps.

The stages include:

Crawled data – raw data collected from insurance company webpages
Filtered product pages – webpages filtered to retain only relevant product information
Filtered product markdowns – webpages converted into normalized Markdown format
Extracted product details – structured product information extracted from text and stored in JSON format
Classified product details – product categories added to the structured data using LLM classification
Product comparison markdowns – generated comparison tables for similar insurance products using cosine similarity
Evaluation artifacts – evaluation metrics and outputs, including the prompts used during the evaluation process

This staged design follows reproducible data engineering practices, enabling iterative improvement of upstream steps and clear observation of downstream impacts on comparison and evaluation results under different prompting strategies.

Methods

The project is orchestrated using Kedro, a framework designed to build robust, modular, and maintainable data pipelines with consistent project structure and pipeline assembly.

For pipeline visibility and debugging, the project references Kedro-Viz, an interactive visualization tool for Kedro pipelines that supports exploration of pipeline graphs and metadata.

Evaluation

For evaluation, the project uses TruLens metrics.

Groundedness – traceability of generated outputs to the original source information
Comprehensiveness – coverage of relevant key points in the generated output
Ground-truth agreement – similarity between generated output and reference answers

Conceptually, this evaluation strategy aligns with the broader pattern known as LLM-as-a-judge, where LLM-based systems are used to evaluate model outputs.

Research indicates that while LLM-as-judge approaches can be scalable and useful, they require careful design to ensure reliability and alignment with human intent, and should be considered decision-support tools rather than definitive ground truth.

Trulens : https://www.trulens.org/getting_started/

Key Findings

The repository reports three main findings:

Advanced classification prompts increase comprehensiveness but may reduce specificity, indicating a trade-off that affects downstream comparison usefulness.
LLM-as-a-judge metrics (via TruLens) can measure factual alignment and completeness, providing a more systematic alternative to purely manual inspection.
Combining structured prompting with metric-based evaluation improves reliability of generated product comparisons.

Configuration & Setup

Base Configuration

The conf/base/ folder contains shared configuration files used across the project.

These configurations include project-level settings such as dataset definitions, pipeline parameters, and other non-sensitive settings that can be safely shared among team members.

Environment Setup

The project uses Poetry for dependency management and virtual environment handling.

Install dependencies:

poetry install

Activate the virtual environment:

poetry shell

Running the Kedro Pipeline

Run the entire pipeline:

poetry run kedro run

This executes the pipeline in the order specified in: LAB-COMPETITIVE-ANALYSIS/lab_competitive_analysis/pipeline_registry.py

Run a specific node:

poetry run kedro run --node <node_name>

Run a specific pipeline:

poetry run kedro run --pipeline <pipeline_name>

Run a specific node inside a specific pipeline:

poetry run kedro run --pipeline <pipeline_name> --nodes <node_name>

Pipeline and Dataset Configuration

Pipeline names are located in: LAB-COMPETITIVE-ANALYSIS/lab_competitive_analysis/pipeline_registry.py

Dataset configurations are defined in: conf/base/catalog.yml

Node names refer to directory names under LAB-COMPETITIVE-ANALYSIS/lab_competitive_analysis/

Supported dataset types can be found in the Kedro datasets documentation:

https://docs.kedro.org/projects/kedro-datasets/en/kedro-datasets-6.0.0/api/kedro_datasets.html

For more information about Kedro configuration, see the official documentation:

https://docs.kedro.org/en/stable/configuration/configuration_basics.html

Acknowledgments

Team Members

Khalil Chikhaoui
Jaeyeop Chung
Umut Ekin Gezer

Advisor

Dr. Gerhard Johann Hagerer

Name		Name	Last commit message	Last commit date
Latest commit History 27 Commits
.idea		.idea
LAB-COMPETITIVE-ANALYSIS		LAB-COMPETITIVE-ANALYSIS
Leveraging_LLM_Pipelines_to_Generate_Product_Comparison_Data_Report.pdf		Leveraging_LLM_Pipelines_to_Generate_Product_Comparison_Data_Report.pdf
NLP_Poster_Final-3.pdf		NLP_Poster_Final-3.pdf
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Leveraging LLM Pipelines to Generate Insurance Product Comparison Data

Project Summary

What the Project Builds

Repository Structure and Deliverables

Data Flow

Methods

Evaluation

Key Findings

Configuration & Setup

Base Configuration

Environment Setup

Running the Kedro Pipeline

Pipeline and Dataset Configuration

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Leveraging LLM Pipelines to Generate Insurance Product Comparison Data

Project Summary

What the Project Builds

Repository Structure and Deliverables

Data Flow

Methods

Evaluation

Key Findings

Configuration & Setup

Base Configuration

Environment Setup

Running the Kedro Pipeline

Pipeline and Dataset Configuration

Acknowledgments

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages