llm-efficiency

Here are 6 public repositories matching this topic...

umitkacar / llm-context-optimizer

Biological code organization system with 1,029+ production-ready snippets - 95% token reduction for Claude/GPT with AI-powered discovery & offline packs

Updated Nov 10, 2025
Python

anshmajumdar121 / context-optimizer

Star

Reduce Claude AI token consumption by 5x-27x using prompt-native workflows and structural code manifests

productivity mcp code-analysis openai developer-tools ai-assistant prompt-engineering chatgpt anthropic claude-ai context-window model-context-protocol token-optimization llm-efficiency ai-cost-reduction

Updated Apr 26, 2026
HTML

VDADev2022 / token-diet

Star

Advanced token reduction and prompt optimization framework for LLMs, featuring linguistic, algorithmic, and architectural patterns.

nlp machine-learning ai deep-learning ai-development prompt-engineering generative-ai token-reduction llm-optimization context-management prompt-compression agentic-ai llm-efficiency ai-cost-savings

Updated Apr 25, 2026

augstentatious / UnSwagAI

Star

Variance-stable routing for 2-bit quantized MoE models. Features dynamic phase correction (Armen Guard), syntactic stabilization layer, and recursive residual quantization for efficient inference.

flax pallas fine-tuning tpu jax xla memory-optimization opensource-ai gemma-2 llm-efficiency

Updated Dec 29, 2025
Python

wasim / scaling-specialization-dense-lms

Star

Do dense LMs develop MoE-like specialization as they scale? Measure it, visualize it, and turn it into speed.

transformers sparse-autoencoders scaling-laws mechanistic-interpretability llm-efficiency

Updated Oct 26, 2025
Python

TokenCave is a browser extension for Claude AI that helps you monitor and optimize token usage with real-time counters, usage insights, and a “caveman mode” that dramatically reduces output length while preserving technical accuracy.

chrome-extension developer-tools browser-extension productivity-tools ai-utilities prompt-engineering claude-ai context-management token-optimization llm-efficiency

Updated Apr 18, 2026
JavaScript

Improve this page

Add a description, image, and links to the llm-efficiency topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-efficiency topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-efficiency

Here are 6 public repositories matching this topic...

umitkacar / llm-context-optimizer

anshmajumdar121 / context-optimizer

VDADev2022 / token-diet

augstentatious / UnSwagAI

wasim / scaling-specialization-dense-lms

PRATHAM777P / tokencave

Improve this page

Add this topic to your repo