Version: 2.0.0 Last Updated: December 2025
CodeSeeker is an intelligent CLI tool that enhances Claude Code with semantic search, knowledge graphs, and AI-powered code analysis.
- Installation & Quick Start
- Command-Line Usage
- Infrastructure Setup
- Project Initialization
- Semantic Search
- Project Management
- Code Analysis
- Natural Language Queries
- Built-in Commands
- Environment Variables
# Install globally
npm install -g codeseeker
# Or link from source
cd CodeSeeker
npm run build && npm link
# Verify installation
codeseeker --help# 1. Setup infrastructure (first time only)
codeseeker setup
# 2. Initialize your project
cd /path/to/your/project
codeseeker init
# 3. Ask questions about your code
codeseeker -c "what is this project about"codeseeker [options] [command]| Option | Description |
|---|---|
-V, --version |
Output the version number |
-p, --project <path> |
Specify project path |
-c, --command <cmd> |
Execute single command and exit |
-t, --transparent |
Skip interactive prompts, output context directly |
--no-color |
Disable colored output |
-h, --help |
Display help information |
# Start interactive REPL mode
codeseeker
# Start with specific project path
codeseeker -p /path/to/project
# Execute single command and exit
codeseeker -c "analyze main entry point"
# Execute in transparent mode (no prompts)
codeseeker -t -c "what files handle authentication"
# Direct command (same as -c)
codeseeker "what is this project about"One-time infrastructure setup for Docker containers and databases.
codeseeker setup [options]| Option | Description |
|---|---|
--force |
Force setup even if already configured |
--skip-docker |
Skip Docker container setup |
--skip-db, --skip-databases |
Skip database initialization |
--project-path <path> |
Setup from specific directory |
- Verifies Docker is installed and running
- Starts required containers (PostgreSQL, Neo4j, Redis)
- Initializes database schemas
- Creates required indexes and extensions (pgvector)
# Full infrastructure setup
codeseeker setup
# Force re-setup
codeseeker setup --force
# Setup without Docker (use existing databases)
codeseeker setup --skip-docker
# Setup with custom project path
codeseeker setup --project-path /path/to/projectInitialize a project for CodeSeeker analysis. Creates embeddings, knowledge graph, and configuration.
codeseeker init [options] [path]| Option | Description |
|---|---|
--reset |
Complete cleanup and reinitialization |
--quick |
Initialize without indexing (faster) |
--new-config |
Reset project configuration (for copied folders) |
- Registers project in database
- Creates
.codeseeker/project.jsonconfiguration - Indexes codebase for semantic search (unless
--quick) - Builds initial knowledge graph in Neo4j
- Creates
CODESEEKER.mdtemplate if missing
# Initialize current directory
codeseeker init
# Quick initialization (no indexing)
codeseeker init --quick
# Complete reset (clears all data)
codeseeker init --reset
# Reset config for copied project folder
codeseeker init --new-config🚀 Initializing CodeSeeker project...
📁 Project path: /path/to/project
📊 Setting up database...
✅ Database connection established
📋 Registering project...
✅ Project registered: MyProject (uuid)
🔍 Indexing codebase for semantic search...
✅ Indexed 150 files, 450 code segments
🕸️ Building knowledge graph...
✅ Knowledge graph created with triads
📝 Setting up project instructions...
✅ CODESEEKER.md already exists
🎉 CodeSeeker project initialized successfully!
Index codebase and perform semantic searches using vector embeddings.
codeseeker search [query] [options]| Option | Description |
|---|---|
--index |
Index/reindex the codebase |
--threshold=<value> |
Similarity threshold (default: 0.7) |
--limit=<value> |
Maximum results (default: 10) |
--verbose |
Show detailed content previews |
# Index codebase for semantic search
codeseeker search --indexIndexing features:
- Incremental indexing (only changed files)
- Content hashing for change detection
- Automatic cleanup of deleted files
- Uses Xenova/all-MiniLM-L6-v2 model (384 dimensions)
# Basic search
codeseeker search "authentication middleware"
# Search with options
codeseeker search "database connection" --threshold=0.5 --limit=20 --verbose# Index the codebase first
codeseeker search --index
# Search for specific functionality
codeseeker search "user validation"
# Search with lower threshold for more results
codeseeker search "error handling" --threshold=0.5
# Verbose search with content previews
codeseeker search "API endpoints" --verbose --limit=15🔍 Searching for: "authentication middleware"
🧠 Found 450 code segments to search
🔍 Search Results (5 found):
📄 Result 1:
File: src/middleware/auth.ts
Type: code
Similarity: 89.2%
📄 Result 2:
File: src/services/auth-service.ts
Type: code
Similarity: 76.5%
Manage project registration, identity, and duplicates.
codeseeker project <subcommand> [args]| Subcommand | Alias | Description |
|---|---|---|
list |
ls |
List all registered projects |
info |
- | Show detailed project information |
id |
- | Get deterministic ID for a path |
cleanup |
clean |
Clean up duplicate project entries |
duplicates |
dups |
Find all duplicate project entries |
help |
- | Show project command help |
# List all projects
codeseeker project list
codeseeker project ls
# Show project info
codeseeker project info
codeseeker project info /path/to/project
# Get deterministic ID
codeseeker project id
codeseeker project id /path/to/project
# Find duplicates
codeseeker project duplicates
codeseeker project dups
# Clean up duplicates for current project
codeseeker project cleanup
codeseeker project clean /path/to/project
# Show help
codeseeker project help=== Registered Projects ===
✓ MyProject
ID: 2b9a2a85-b3e4-47cc-ade1-2da2724fe0f4
Path: /workspace/myproject
Status: active
Embeddings: 450
✓ AnotherProject
ID: df588867-a7f4-4964-b0e2-f732cb3438f2
Path: /workspace/another
Status: active
Embeddings: 280
CodeSeeker uses deterministic project IDs based on SHA-256 hash of normalized path:
- Ensures consistent IDs across reinitializations
- Prevents duplicate entries for the same project
- Handles path normalization (Windows/Unix differences)
Perform AI-enhanced code analysis using the 11-step workflow.
codeseeker analyze <query>- Query Analysis - Detects assumptions and ambiguities
- Task Decomposition - Splits complex queries into focused sub-tasks
- User Clarification - Prompts for clarification when needed
- Hybrid Search - Semantic + text + path search with RRF fusion
- Knowledge Graph Query - Analyzes code relationships
- Sub-Task Context - Generates tailored context per sub-task
- Context Building - Combines all sources into enhanced context
- AI Analysis - Generates contextual recommendations
- File Approval - Confirms any file modifications
- Build/Test Verification - Validates code changes
- Database Sync - Updates embeddings and knowledge graph
# Analyze specific functionality
codeseeker analyze "how does user authentication work"
# Analyze code patterns
codeseeker analyze "where is validation logic implemented"
# Analyze architecture
codeseeker analyze "what design patterns are used"🔍 Analyzing: "how does user authentication work"
🚀 Starting enhanced AI workflow...
1️⃣ Detecting assumptions...
📝 Detected 3 contextual assumptions
• Project implements API/service architecture
• Security and authorization is important
• User seeks understanding of system behavior
2️⃣ Processing clarifications...
🎯 Enhanced query: "how does user authentication work (Context: ...)"
3️⃣ Performing semantic search...
🧠 Found 450 code segments to search
🔍 Found 5 relevant code segments
4️⃣ Querying knowledge graph...
🕸️ Found 12 relationships in knowledge graph
5️⃣ Building enhanced context...
📊 Context enhancement quality: 8/10
6️⃣ Generating AI analysis...
🤖 Generated contextual analysis with 4 recommendations
7️⃣ File modifications: Not implemented in MVP
8️⃣ Preparing summary...
📋 Analysis Summary:
Query: how does user authentication work
Code segments found: 5
Relationships found: 12
🔍 Key Insights:
1. Contains function/class definitions for auth
2. Handles imports and dependencies
3. Provides testing logic and validation scenarios
💡 Recommendations:
1. Review 5 code segments found for "authentication"
2. Check consistency across 3 files
3. Analyze 12 architectural relationships
✅ Enhanced analysis completed!
Ask questions directly without specific commands. CodeSeeker automatically detects natural language and triggers the enhanced workflow.
Before entering a prompt, you can toggle semantic search on/off. This is useful when:
- You want to skip file discovery and send prompts directly to Claude
- You know exactly what you want Claude to do without context gathering
- You want faster responses for simple queries
When using the menu-based prompt interface, you'll see:
Search: ON
? Options:
> Enter prompt (with search)
Turn OFF search (skip file discovery)
Cancel
Select "Turn OFF search" to disable semantic search, then enter your prompt.
When using the inline prompt interface:
[s] to toggle search | Search: ON
>
- Type
sand press Enter to toggle search mode - Or type your prompt directly
codeseeker -c "<natural language query>"
# or in interactive mode, just type your question# Understanding the project
codeseeker -c "what is this project about"
codeseeker -c "explain the main architecture"
# Finding code
codeseeker -c "where are the API endpoints defined"
codeseeker -c "show me files related to user authentication"
# Code changes (triggers approval workflow)
codeseeker -c "add error handling to the database service"
codeseeker -c "create a new middleware for request logging"
# Analysis requests
codeseeker -c "check this codebase for SOLID violations"
codeseeker -c "find duplicate code patterns"codeseeker> what is this project about
codeseeker> show me how authentication works
codeseeker> create a new API endpoint for users
Available in both slash-command (/command) and direct (command) formats.
/help # Show all available commands
help # Same as above/status # Show current project and service status
status # Same as aboveShows:
- Current project name, path, and ID
- File count and embeddings count
- Database connection status
- Workflow service status
/history # Show command history
/history show # Same as above
/history clear # Clear command history
/history help # Show history helpFeatures:
- Per-project command history
- Stored in
~/.codeseeker/history/ - Up to 100 commands preserved
/exit # Exit CodeSeeker
/quit # Same as above
exit # Same as above
quit # Same as aboveConfigure CodeSeeker behavior via environment variables or .env file.
| Variable | Default | Description |
|---|---|---|
DB_HOST |
localhost |
PostgreSQL host |
DB_PORT |
5432 |
PostgreSQL port |
DB_NAME |
codeseeker |
Database name |
DB_USER |
codeseeker |
Database user |
DB_PASSWORD |
codeseeker123 |
Database password |
| Variable | Default | Description |
|---|---|---|
NEO4J_URI |
bolt://localhost:7687 |
Neo4j connection URI |
NEO4J_USER |
neo4j |
Neo4j user |
NEO4J_PASSWORD |
codeseeker123 |
Neo4j password |
| Variable | Default | Description |
|---|---|---|
REDIS_HOST |
localhost |
Redis host |
REDIS_PORT |
6379 |
Redis port |
REDIS_PASSWORD |
(empty) | Redis password |
# Database
DB_HOST=localhost
DB_PORT=5432
DB_NAME=codeseeker
DB_USER=codeseeker
DB_PASSWORD=codeseeker123
# Neo4j
NEO4J_URI=bolt://localhost:7687
NEO4J_USER=neo4j
NEO4J_PASSWORD=codeseeker123
# Redis
REDIS_HOST=localhost
REDIS_PORT=6379Low semantic similarity scores:
# Regenerate embeddings with correct model
codeseeker search --indexProject not found:
# Initialize the project first
codeseeker initDatabase connection failed:
# Check Docker containers
docker ps
# Restart infrastructure
codeseeker setup --forcePath mismatch (copied project folder):
# Reset configuration for new location
codeseeker init --new-configDuplicate projects:
# Find duplicates
codeseeker project duplicates
# Clean up
codeseeker project cleanup| Shortcut | Action |
|---|---|
Ctrl+C (once) |
Show warning, press again to exit |
Ctrl+C (twice) |
Force exit |
Escape |
Interrupt current operation |
Ctrl+Z |
Interrupt current operation |
Up/Down |
Navigate command history |
CodeSeeker uses Xenova/all-MiniLM-L6-v2 for all embeddings:
- 384-dimensional vectors
- Consistent model across indexing and retrieval
- pgvector extension in PostgreSQL for similarity search
- Deterministic IDs using SHA-256 of normalized path
- Prevents duplicates across reinitializations
- Supports project migration and path changes
- PostgreSQL: Projects, embeddings, analysis results
- Neo4j: Knowledge graph (files, classes, relationships)
- Redis: Caching and session management
- Issues: https://github.com/anthropics/claude-code/issues
- Documentation:
codeseeker --help - Interactive Help:
/helpin REPL mode