local-llm-gguf

Here are 2 public repositories matching this topic...

ShotokanOSS / ggufForge

A comprehensive toolkit for training and running lightweight adapters for GGUF-based language models (ERNIE, Llama, Mistral, Phi-3, etc.) without modifying the base model.

pytorch transfer-learning efficient-training transfer-learning-nlp efficient-training-and-inference language-model-adapter llama-cpp local-llm llama-cpp-python gguf llm-optimization transfer-learning-and-fine-tuning local-llm-gguf low-resource-finetuning quantization-recovery logit-correction cost-efficient-ai

Updated Feb 23, 2026
Python

GPUforLLM / llm-vram-calculator

Star

Accurate VRAM calculator for Local LLMs (Llama 4, DeepSeek V3, Qwen 2.5). Calculates GGUF quantization, GQA context overhead, and offloading limits

gpu-memory llama llm-inference gguf deepseek llm-vram-calculator gpu-calculator local-llm-gguf

Updated Nov 27, 2025
HTML

Improve this page

Add a description, image, and links to the local-llm-gguf topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the local-llm-gguf topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly