SuperagenticAI
diff --git a/‎assets/favicon.png‎
2.99 KB b/‎assets/favicon.png‎
2.99 KB
diff --git a/‎assets/rlm-code.png‎ ‎assets/rlm-code-logo.png‎assets/rlm-code.png renamed to assets/rlm-code-logo.png b/‎assets/rlm-code.png‎ ‎assets/rlm-code-logo.png‎assets/rlm-code.png renamed to assets/rlm-code-logo.png
diff --git a/‎docs/assets/favicon.png‎
2.99 KB b/‎docs/assets/favicon.png‎
2.99 KB
diff --git a/‎docs/assets/logo.png‎
622 KB b/‎docs/assets/logo.png‎
622 KB
diff --git a/‎docs/getting-started/index.md‎
Lines changed: 19 additions & 22 deletions b/‎docs/getting-started/index.md‎
Lines changed: 19 additions & 22 deletions
diff --git a/‎docs/getting-started/installation.md‎
Lines changed: 3 additions & 3 deletions b/‎docs/getting-started/installation.md‎
Lines changed: 3 additions & 3 deletions
diff --git a/‎docs/getting-started/quickstart.md‎
Lines changed: 40 additions & 38 deletions b/‎docs/getting-started/quickstart.md‎
Lines changed: 40 additions & 38 deletions
@@ -1,52 +1,49 @@
-# Getting Started
+# 🚀 Getting Started
 
-Welcome to **RLM Code** -- the Research Playground and Evaluation OS for Recursive Language Model (RLM) agentic systems. RLM Code provides an interactive TUI-based development environment for building, benchmarking, and optimizing agent workflows through slash commands and natural language.
+Welcome to **RLM Code**, the Research Playground and Evaluation OS for Recursive Language Model (RLM) agentic systems. RLM Code provides a unified TUI-based development environment for building, benchmarking, and optimizing agent workflows through slash commands and natural language.
 
 ---
 
-## What is RLM Code?
+## 🧪 What is RLM Code?
 
 RLM Code implements the **Recursive Language Model** paradigm from the research paper *"Recursive Language Models"* (Zhang, Kraska, Khattab, 2025). It extends the paper's concepts with:
 
-- **Context-as-variable**: Context is stored as a REPL variable rather than in the token window, enabling unbounded output and token-efficient processing.
-- **Deep recursion**: Support for recursion depth > 1, exceeding the paper's original limitation.
-- **Multi-paradigm execution**: Pure RLM, CodeAct, and Traditional paradigms with side-by-side comparison.
-- **Pluggable observability**: MLflow, OpenTelemetry, LangSmith, LangFuse, and Logfire integrations.
-- **Sandbox runtimes**: Local, Docker, Apple Container, Modal, E2B, and Daytona execution environments.
+- 🧠 **Context-as-variable**: Context is stored as a REPL variable rather than in the token window, enabling unbounded output and token-efficient processing
+- 🔁 **Deep recursion**: Support for recursion depth > 1, exceeding the paper's original limitation
+- 🔀 **Multi-paradigm execution**: Pure RLM, CodeAct, and Traditional paradigms with side-by-side comparison
+- 📊 **Pluggable observability**: MLflow, OpenTelemetry, LangSmith, LangFuse, and Logfire integrations
+- 📦 **Sandbox runtimes**: Local, Docker, Apple Container, Modal, E2B, and Daytona execution environments
 
 ---
 
-## Where to Go Next
+## 📚 Where to Go Next
 
 | Guide | Description |
 |-------|-------------|
-| [Installation](installation.md) | System requirements, package installation, optional dependencies, and verification |
-| [Quick Start](quickstart.md) | Launch the TUI, connect a model, run your first benchmark, and explore session replay |
-| [CLI Reference](cli.md) | Complete reference for both entry points and all 50+ slash commands |
-| [Configuration](configuration.md) | Full `rlm_config.yaml` schema, environment variables, and ConfigManager API |
+| [📦 Installation](installation.md) | System requirements, package installation, optional dependencies, and verification |
+| [⚡ Quick Start](quickstart.md) | Launch the TUI, connect a model, run your first benchmark, explore the Research tab |
+| [💻 CLI Reference](cli.md) | Complete reference for the entry point and all 50+ slash commands |
+| [⚙️ Configuration](configuration.md) | Full `rlm_config.yaml` schema, environment variables, and ConfigManager API |
 
 ---
 
-## Quick Overview
+## ⚡ Quick Overview
 
 ```bash
 # Install
 pip install rlm-code
 
-# Launch the standard TUI
+# Launch the unified TUI
 rlm-code
 
-# Launch the Research TUI directly
-rlm-research
-
 # Connect to a model and run a benchmark
 /connect anthropic claude-sonnet-4-20250514
 /rlm bench preset=dspy_quick
 /leaderboard
 ```
 
-!!! tip "First Time?"
-    Start with the [Installation](installation.md) guide to set up your environment, then follow the [Quick Start](quickstart.md) for a hands-on walkthrough.
+!!! tip "🆕 First Time?"
+    Start with the [📦 Installation](installation.md) guide to set up your environment, then follow the [⚡ Quick Start](quickstart.md) for a hands-on walkthrough.
 
-!!! info "Two TUI Modes"
-    RLM Code ships with two TUI modes: the **Standard TUI** (multi-pane workspace with chat, files, details, and shell panels) and the **Research TUI** (dark-themed research lab interface with file browser, code viewer, and metrics bar). Use `rlm-code` for the standard mode or `rlm-research` (or `rlm-code --research`) for the research mode.
+!!! info "🖥️ Unified TUI"
+    RLM Code ships a **single TUI** with 5 tabs: **💬 Chat**, **📁 Files**, **📋 Details**, **⚡ Shell**, and **🔬 Research**. Use `rlm-code` to launch, and press `Ctrl+5` to access the Research tab for experiment tracking, benchmarks, and session replay.
@@ -1,4 +1,4 @@
-# Installation
+# 📦 Installation
 
 This guide covers how to install RLM Code, its optional dependencies, and how to verify your installation.
 
@@ -88,8 +88,8 @@ The multi-pane terminal interface requires Textual:
 pip install rlm-code[tui]
 ```
 
-!!! note "TUI Required for Interactive Mode"
-    The `textual` package (>= 0.86.0) is required for both the Standard TUI and the Research TUI. Without it, only headless/scripting usage is available.
+!!! note "🖥️ TUI Required for Interactive Mode"
+    The `textual` package (>= 0.86.0) is required for the TUI with all 5 tabs (Chat, Files, Details, Shell, Research). Without it, only headless/scripting usage is available.
 
 ### LLM Providers
 
 
@@ -1,18 +1,18 @@
-# Quick Start
+# ⚡ Quick Start
 
-This guide walks you through launching RLM Code, connecting to an LLM, running your first benchmark, viewing the leaderboard, and exploring session replay -- all in under 10 minutes.
+This guide walks you through launching RLM Code, connecting to an LLM, running your first benchmark, viewing the leaderboard, and exploring the Research tab, all in under 10 minutes.
 
 ---
 
-## Prerequisites
+## ✅ Prerequisites
 
 Before you begin, make sure you have:
 
-- [x] Python 3.10+ installed
-- [x] RLM Code installed (`pip install rlm-code[tui,llm-all]`)
-- [x] At least one LLM API key (OpenAI, Anthropic, or Gemini) or a local Ollama instance
+- [x] 🐍 Python 3.10+ installed
+- [x] 📦 RLM Code installed (`pip install rlm-code[tui,llm-all]`)
+- [x] 🔑 At least one LLM API key (OpenAI, Anthropic, or Gemini) or a local Ollama instance
 
-!!! tip "Local Models"
+!!! tip "🏠 Local Models"
     You can use RLM Code entirely with local models via [Ollama](https://ollama.com/). No API keys needed:
 
     ```bash
@@ -21,7 +21,7 @@ Before you begin, make sure you have:
 
 ---
 
-## Step 1: Launch the TUI
+## Step 1: 🚀 Launch the TUI
 
 Navigate to a project directory (not your home directory) and launch:
 
@@ -30,28 +30,14 @@ mkdir -p ~/projects/rlm-demo && cd ~/projects/rlm-demo
 rlm-code
 ```
 
-!!! warning "Directory Safety Check"
+!!! warning "⚠️ Directory Safety Check"
     RLM Code performs a safety check on startup. It will warn you if you are running from your home directory, Desktop, Documents, or a system directory. Always run from a dedicated project directory.
 
-You should see the RLM Code TUI with a multi-pane layout: a chat panel, file browser, details panel, and shell.
-
-### Alternative: Research TUI
-
-For a dark-themed research lab interface with file browser, code viewer, and metrics bar:
-
-```bash
-rlm-research
-```
-
-Or use the flag:
-
-```bash
-rlm-code --research
-```
+You should see the **RLM Research Lab** TUI with 5 tabs: 💬 Chat, 📁 Files, 📋 Details, ⚡ Shell, and 🔬 Research. The Chat tab is active by default.
 
 ---
 
-## Step 2: Initialize Your Project
+## Step 2: 📁 Initialize Your Project
 
 Initialize a project configuration file:
 
@@ -63,7 +49,7 @@ This creates an `rlm_config.yaml` in your current directory with default setting
 
 ---
 
-## Step 3: Connect to a Model
+## Step 3: 🔗 Connect to a Model
 
 Use the `/connect` command to connect to an LLM provider:
 
@@ -125,7 +111,7 @@ This shows the current model, provider, connection status, sandbox runtime, and
 
 ---
 
-## Step 4: Run a Benchmark
+## Step 4: 🏆 Run a Benchmark
 
 RLM Code ships with 10+ built-in benchmark presets. Start with the quick DSPy smoke test:
 
@@ -178,7 +164,7 @@ Supported formats include explicit preset mappings, Pydantic-style dataset cases
 
 ---
 
-## Step 5: View the Leaderboard
+## Step 5: 📊 View the Leaderboard
 
 After running benchmarks, view aggregated results on the leaderboard:
 
@@ -209,7 +195,7 @@ Available ranking metrics: `reward`, `completion_rate`, `steps`, `tokens`, `cost
 
 ---
 
-## Step 6: Compare Paradigms
+## Step 6: 🔀 Compare Paradigms
 
 Run the same task through multiple paradigms and compare:
 
@@ -227,7 +213,7 @@ Use the comparison command for direct A/B analysis:
 
 ---
 
-## Step 7: Session Replay
+## Step 7: ⏪ Session Replay
 
 Every RLM run generates a trajectory that can be replayed step by step.
 
@@ -255,7 +241,7 @@ Session replay supports both JSONL trajectory files and JSON snapshot files.
 
 ---
 
-## Step 8: Explore Slash Commands
+## Step 8: ⌨️ Explore Slash Commands
 
 RLM Code has 50+ slash commands. Here are the most useful ones to explore next:
 
@@ -310,7 +296,22 @@ RLM Code has 50+ slash commands. Here are the most useful ones to explore next:
 
 ---
 
-## Full Workflow Example
+## 🔬 Step 9: Explore the Research Tab
+
+After running a benchmark, press `Ctrl+5` to switch to the **🔬 Research** tab:
+
+- **📊 Dashboard**: See run metrics, reward sparkline, and summary
+- **📈 Trajectory**: Step-by-step breakdown of agent actions and rewards
+- **🏆 Benchmarks**: Leaderboard table from all your runs
+- **⏪ Replay**: Step-through controls for time-travel debugging
+- **📡 Events**: Live event stream from the RLM event bus
+
+!!! tip "🔬 Research Tab"
+    The Research tab updates automatically when you run `/rlm bench` or `/rlm run` commands. No manual refresh needed!
+
+---
+
+## 🎯 Full Workflow Example
 
 Here is a complete workflow from start to finish:
 
@@ -363,10 +364,11 @@ rlm-code
 
 ---
 
-## What's Next?
+## 📚 What's Next?
 
-- **[CLI Reference](cli.md)**: Complete documentation for all commands and flags
-- **[Configuration](configuration.md)**: Customize every aspect of RLM Code via `rlm_config.yaml`
-- Explore the [Core Engine](../core/index.md) documentation for the RLM Runner, Environments, and Event System
-- Set up [Observability](../observability/index.md) with MLflow, OpenTelemetry, or your preferred platform
-- Learn about [Sandbox Runtimes](../sandbox/index.md) for isolated code execution with Docker, Modal, or E2B
+- 💻 **[CLI Reference](cli.md)**: Complete documentation for all commands and flags
+- ⚙️ **[Configuration](configuration.md)**: Customize every aspect of RLM Code via `rlm_config.yaml`
+- 🧠 **[Core Engine](../core/index.md)**: RLM Runner, Environments, and Event System
+- 🔬 **[Research Tab](../tui/research.md)**: Deep dive into the experiment tracking interface
+- 📊 **[Observability](../observability/index.md)**: MLflow, OpenTelemetry, LangSmith, LangFuse, Logfire
+- 📦 **[Sandbox Runtimes](../sandbox/index.md)**: Docker, Modal, E2B for isolated code execution