Quick Reference Guide

One-page reference for common tasks.

Installation

pip install -r requirements.txt
# Install FFmpeg from https://ffmpeg.org
# Install Ollama from https://ollama.ai
ollama pull gpt-oss:20b

Basic Usage

Web UI

python app.py
# Open: http://127.0.0.1:7860

Use the New Blank Campaign button on the Process Session tab to spin up a fresh profile; the dashboard, library, and import tools will reset to that campaign automatically.

CLI - Simple

python cli.py process recording.m4a

CLI - Full Options

python cli.py process recording.m4a \
  --session-id "session1" \
  --characters "Thorin,Elara,Zyx" \
  --players "Alice,Bob,Charlie,DM" \
  --num-speakers 4

CLI - With Party Config

# Use pre-configured party (easier!)
python cli.py process recording.m4a \
  --party default \
  --session-id "session1"

Commands

# Process audio
python cli.py process <file>

# Process with party config
python cli.py process <file> --party default

# Party management
python cli.py list-parties                           # List all parties
python cli.py show-party default                     # Show party details
python cli.py export-party default my_party.json     # Export party
python cli.py import-party my_party.json             # Import party
python cli.py export-all-parties backup.json         # Backup all parties

# Character profiles
python cli.py list-characters                        # List all characters
python cli.py show-character "Name"                  # Show character overview
python cli.py export-character "Name" file.json      # Export character
python cli.py import-character file.json             # Import character

# Map speakers
python cli.py map-speaker <session> <speaker_id> <name>

# View speakers
python cli.py show-speakers <session>

# Check setup
python cli.py check-setup

# View config
python cli.py config

File Outputs

After processing, each session gets its own timestamped folder:

output/
└── YYYYMMDD_HHMMSS_session_id/
    ├── session_id_full.txt       # Complete transcript
    ├── session_id_ic_only.txt    # Game narrative only
    ├── session_id_ooc_only.txt   # Banter only
    ├── session_id_data.json      # Structured data
    ├── session_id_full.srt       # Full subtitles
    ├── session_id_ic_only.srt    # IC subtitles
    └── session_id_ooc_only.srt   # OOC subtitles

Example: output/20251019_194750_session1/

Session Narratives: Transform transcripts into story formats (see Story Notebooks tab)

Configuration (.env)

# Backends
WHISPER_BACKEND=local      # or: groq, openai
LLM_BACKEND=ollama         # or: openai

# Models
WHISPER_MODEL=large-v3
OLLAMA_MODEL=gpt-oss:20b

# Processing
CHUNK_LENGTH_SECONDS=600
CHUNK_OVERLAP_SECONDS=10

# API Keys (optional)
GROQ_API_KEY=
OPENAI_API_KEY=
HF_TOKEN=                  # For PyAnnote

Common Options

--skip-diarization        # ~30% faster, no speaker labels, no HF token needed
--skip-classification     # ~20% faster, no IC/OOC separation
--skip-snippets           # ~10% faster, saves disk space, no audio clips
--num-speakers N          # Expected speaker count (improves diarization)
--output-dir PATH         # Custom base output location

Processing Times

4-hour session:

Local (CPU): ~8-12 hours
Local (GPU): ~2-4 hours
Groq API: ~30-60 minutes

Troubleshooting

Problem	Solution
FFmpeg not found	Install FFmpeg, add to PATH
Ollama error	Run `ollama serve`
PyAnnote error	Set HF_TOKEN in .env
Out of memory	Skip diarization/classification
Slow processing	Use Groq API or GPU

File Structure

VideoChunking/
├── src/               # Source code
├── cli.py             # Command-line interface
├── app.py             # Web interface
├── example.py         # Usage examples
├── output/            # Generated transcripts
├── temp/              # Temporary files
├── models/            # Speaker profiles
├── .env               # Your config (copy from .env.example)
└── requirements.txt   # Dependencies

Example Workflow

# 1. Process
python cli.py process session1.m4a --session-id s1

# 2. Identify speakers from output
cat output/s1_full.txt | head

# 3. Map speakers
python cli.py map-speaker s1 SPEAKER_00 "DM"
python cli.py map-speaker s1 SPEAKER_01 "Alice"
python cli.py map-speaker s1 SPEAKER_02 "Bob"
python cli.py map-speaker s1 SPEAKER_03 "Charlie"

# 4. Verify
python cli.py show-speakers s1

# 5. Use outputs
cat output/s1_ic_only.txt  # Session notes!

Python API

from src.pipeline import DDSessionProcessor

# Option 1: Manual entry
processor = DDSessionProcessor(
    session_id="my_session",
    character_names=["Thorin", "Elara"],
    player_names=["Alice", "Bob", "DM"]
)

# Option 2: Use party config (recommended)
processor = DDSessionProcessor(
    session_id="my_session",
    party_id="default"
)

result = processor.process(
    input_file="session.m4a",
    output_dir="./output"
)

print(result['statistics'])

Output Format Examples

Full Transcript

[00:15:23] DM (IC): Je betreedt een donkere grot.
[00:15:45] Alice as Thorin (IC): Ik steek mijn fakkel aan.
[00:16:02] Bob (OOC): Haha, alweer een grot!

IC-Only

[00:15:23] DM: Je betreedt een donkere grot.
[00:15:45] Thorin: Ik steek mijn fakkel aan.

JSON

{
  "segments": [{
    "text": "Je betreedt een donkere grot.",
    "speaker": "DM",
    "classification": "IC",
    "start_time": 923.45,
    "end_time": 928.12
  }]
}

Documentation

README.md - Overview
SETUP.md - Installation
USAGE.md - Detailed guide
DEVELOPMENT.md - Technical details

Quick help: python cli.py --help

Diagnostics & Status Quick Reference

python app_manager.py ? launch the manager (auto-refreshing status, option recap, per-stage clocks).
Diagnostics tab (Gradio UI) ? Discover Tests lists pytest node IDs; Run Selected Tests executes chosen entries; Run All Tests runs pytest -q.
Recent events in the manager include "Next" hints when a stage finishes, making it easy to track long pipelines at a glance.

Story Notebook Cheatsheet

Document Viewer ? Fetch Google Doc (share link set to view) to refresh notebook context.
Story Notebooks ? Refresh Sessions ? select session ? generate narrator or character POV (files land in output/<session>/narratives/).
Use Refresh Notebook Context in the tab if you update the source doc mid-session.
Dashboard idle indicator: if you see "Status tracker: idle", start the app from the manager to refresh; live stage data only appears during an active run.

Campaign Dashboard Cheatsheet

Access: Campaign Dashboard tab → Select campaign → Click "Refresh Campaign Info"
Health Indicators: 🟢 90-100% | 🟡 70-89% | 🟠 50-69% | 🔴 0-49%
Components: Party config, settings, knowledge base, character profiles, sessions, narratives
Status: ✅ Configured | ⚠️ Needs attention | ❌ Missing

Campaign Library Cheatsheet

Access: Campaign Library tab → Select campaign → Click "Load Knowledge Base"
Auto-extraction: Enabled by default when processing sessions
Disable: Uncheck "Skip Campaign Knowledge Extraction" when processing
Tracked: 🎯 Quests | 👥 NPCs | 🔓 Plot hooks | 📍 Locations | ⚡ Items

Import Session Notes Cheatsheet

Access: Import Session Notes tab
Purpose: Backfill early sessions from written notes (no recording needed)
Process: Session ID → Campaign → Paste/upload notes → Enable options → Import
Options: Extract Knowledge | Generate Narrative
Output: models/knowledge/{campaign}_knowledge.json + output/imported_narratives/

MCP Toolchain (Future)

python tools/make_pipeline_flowchart.py ? generates pipeline flowchart (Graphviz).
Agent commands (LangChain) to be added for automated runs.
Retrieval/LLM backends selectable via config.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quick Reference Guide

Installation

Basic Usage

Web UI

CLI - Simple

CLI - Full Options

CLI - With Party Config

Commands

File Outputs

Configuration (.env)

Common Options

Processing Times

Troubleshooting

File Structure

Example Workflow

Python API

Output Format Examples

Full Transcript

IC-Only

JSON

Documentation

Diagnostics & Status Quick Reference

Story Notebook Cheatsheet

Campaign Dashboard Cheatsheet

Campaign Library Cheatsheet

Import Session Notes Cheatsheet

MCP Toolchain (Future)

FilesExpand file tree

QUICKREF.md

Latest commit

History

QUICKREF.md

File metadata and controls

Quick Reference Guide

Installation

Basic Usage

Web UI

CLI - Simple

CLI - Full Options

CLI - With Party Config

Commands

File Outputs

Configuration (.env)

Common Options

Processing Times

Troubleshooting

File Structure

Example Workflow

Python API

Output Format Examples

Full Transcript

IC-Only

JSON

Documentation

Diagnostics & Status Quick Reference

Story Notebook Cheatsheet

Campaign Dashboard Cheatsheet

Campaign Library Cheatsheet

Import Session Notes Cheatsheet

MCP Toolchain (Future)