LLM Vending Machine Simulation

A modular Python-based simulation where Large Language Models (LLMs) act as autonomous economic agents, competing to optimize pricing and maximize profit in a dynamic market environment.

New in v2.0: Now featuring an Agentic Mode powered by LangGraph, enabling distinct reasoning, tool usage, and state management steps for the AI agents.

🚀 Overview

This project simulates a competitive vending machine market. It provides a platform to test how different "Strategic Business Manager" personas (implemented via LLMs) adapt to:

Competitor Actions: Surviving against a fixed-strategy "BasicMachine" or other LLMs.
Market Dynamics: Reacting to price sensitivity, random demand fluctuations, and stockouts.
Long-term Planning: Balancing immediate sales vs. profit margins over a 52-week simulation.

✨ Features

🧠 Agentic Architecture: Uses LangGraph to model the agent's workflow: Simulate Market -> Reasoning (LLM) -> Tool Execution -> Next Week.
⚔️ Multi-Model Competition: Pit top models (Mistral, DeepSeek, GPT-4o, etc.) against each other or against baseline algorithms.
🛠️ Tool Use: Agents have discrete tools to change_price(product, new_price) and inspect get_market_data().
📈 Realistic Econ-Sim: Includes product elasticity, maintenance costs, restocking mechanics, and cumulative profit tracking.
📊 Analytics & Benchmarking:
- Automated profit plotting (matplotlib).
- Detailed CSV logs of every decision and market event.
- Benchmarking pipeline to run N trials and calculate win rates.

🏗️ Project Structure

LLM_Vending_Machine/
├── src/
│   ├── agentic_run.py    # 🧠 Agent Run: LangGraph-based agent simulation (Recommended)
│   ├── competitive_run.py # 📜 Legacy: Standard loop simulation
│   ├── benchmark.py       # 📊 Pipeline to run multiple trials & aggregate stats
│   ├── multi_model_run.py # ⚔️ AI vs AI competition script
│   ├── main.py            # 🟢 Simple single-machine baseline
│   ├── config.py          # ⚙️ Configuration (Products, Costs, API Keys)
│   ├── models/            # 📦 Domain Objects: Product, VendingMachine
│   ├── simulation/        # 🎲 Market Engine & Physics
│   ├── llm/               # 🤖 LLM Client & Tool Definitions
│   └── utils/             # 📉 Plotting & Helper functions
├── data/
│   ├── benchmarks/        # 📂 Raw & Summary CSVs from benchmarks
│   ├── results/           # 📂 Global simulation history
│   └── logs/              # 📝 Detailed weekly logs
├── .env                   # 🔑 Secrets (API Keys)
└── *.png                  # 🖼️ Generated profit graphs

🛠️ Getting Started

Prerequisites

Python 3.12+
OpenRouter API Key (or OpenAI compatible key)

Installation

Clone the repository:

git clone https://github.com/santos-sanz/LLM_Vending_Machine.git
cd LLM_Vending_Machine

Install Dependencies:

pip install -r requirements.txt
# Or manually:
pip install python-dotenv openai matplotlib langgraph langchain-openai langchain-core

Configure Environment: Create a .env file in the root directory:

OPENROUTER_API_KEY=sk-or-your-key-here
DEFAULT_MODEL=mistralai/mistral-7b-instruct:free

🎮 Running Simulations

The project supports multiple modes depending on what you want to test.

1. Agentic Competitive Run (Recommended)

Runs the LangGraph agent against the Baseline Machine. This mode allows the agent to "think", use tools, and loop until it decides to proceed to the next week.

python3 src/agentic_run.py --model mistralai/mistral-large-latest --weeks 52

2. Standard Benchmarking

Run a batch of simulations to statistically validate a model's performance (e.g., 5 runs of 52 weeks).

# Run agentic benchmarks
python3 src/benchmark.py --mode agentic

# Run legacy benchmarks
python3 src/benchmark.py --mode legacy

3. Legacy One-vs-One

The original simulation loop without LangGraph. Good for quick debugging.

python3 src/competitive_run.py --model nex-agi/deepseek-v3 --weeks 20

4. AI vs AI (Battle Mode)

Two different LLMs compete in the same market.

python3 src/multi_model_run.py

🧩 How It Works

Initialization: Two machines (BasicMachine & LLMMachine) start with 0 cash and full stock.
The Week Loop:
- Refill: Machines are restocked to capacity.
- Agent Turn:
  - The Agent receives last week's sales data, stockouts, and current pricing.
  - It reasons about the market (e.g., "I sold out too fast, I should raise prices").
  - It executes tools to update prices.
- Simulation: The engine runs 7 virtual days. Customers choose products based on Purchase Probability ~ (Base Utility / Price).
- Accounting: Profits are calculated (Revenue - Cost of Goods - Maintenance).
Winner: After N weeks, the machine with the highest customizable Net Profit wins.

📊 Monitoring

Terminal: Real-time "Thinking..." logs from the agent and weekly profit summaries.
Images: Check the root (or images/ folder) for profit_comparison_*.png plots.
Data: data/results/simulation_results.csv contains the high-level metrics of every run.

🛡️ License

MIT License. Free to use and modify.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
data		data
docs		docs
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Vending Machine Simulation

🚀 Overview

✨ Features

🏗️ Project Structure

🛠️ Getting Started

Prerequisites

Installation

🎮 Running Simulations

1. Agentic Competitive Run (Recommended)

2. Standard Benchmarking

3. Legacy One-vs-One

4. AI vs AI (Battle Mode)

🧩 How It Works

📊 Monitoring

🛡️ License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

LLM Vending Machine Simulation

🚀 Overview

✨ Features

🏗️ Project Structure

🛠️ Getting Started

Prerequisites

Installation

🎮 Running Simulations

1. Agentic Competitive Run (Recommended)

2. Standard Benchmarking

3. Legacy One-vs-One

4. AI vs AI (Battle Mode)

🧩 How It Works

📊 Monitoring

🛡️ License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages