finetuning-vision-models

Here are 14 public repositories matching this topic...

ReinFlow / ReinFlow

[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., Pi0, Pi0.5, GR00TN1.5. Fully open-sourced.

flow robotics rl manipulation locomotion vla robot-learning fine-tuning post-training actorcritic pi0 policygradient finetuning-rl visuomotor finetuning-vision-models flowmatching onlinerl

Updated Mar 21, 2026
Python

SuyogKamble / simpleVLM

Star

building a simple VLM. Implementing LlaMA-SmolLM2 from scratch + SigLip2 Vision Model. KV-Caching is supported and implemented from scratch as well

nlp computer-vision deep-learning transformers pytorch vlm multimodal huggingface llm vision-language-model finetuning-llms finetuning-vision-models

Updated Feb 19, 2026
Jupyter Notebook

shreydan / simpleVLM

Star

building a simple VLM. Implementing LlaMA-SmolLM2 from scratch + SigLip2 Vision Model. KV-Caching is supported and implemented from scratch as well

nlp computer-vision deep-learning transformers pytorch vlm multimodal huggingface llm vision-language-model finetuning-llms finetuning-vision-models

Updated May 12, 2025
Jupyter Notebook

sidd707 / Aurigen-AI-Powered-Jewelry-Design-Studio

Star

AI-powered jewelry design studio using fine-tuned Stable Diffusion XL + ControlNet. Generate photorealistic rings, necklaces, earrings & bracelets from text prompts with a Streamlit interface.

image-generation text-to-image diffusion-models streamlit streamlit-webapp stable-diffusion generative-ai controlnet sdxl finetuning-vision-models

Updated Apr 22, 2026
Jupyter Notebook

Raxephion / loRA-Strength-Analyser

Star

A Python script to analyze images generated using a LoRA (Low-Rank Adaptation) model applied at various strength levels. This tool helps determine an optimal strength for a given LoRA by evaluating image quality and similarity to control images.

fine-tuning finetuning transformers-models safetensors low-rank-adaptation finetuning-large-language-models finetuning-vision-models

Updated May 24, 2025
Python

umair-hassan2 / paligemma-3b-finetuning

Star

Fine-tuned 3B parameters PaliGemma2 vision model on Valorant object detection improving IoU scores across all classes. Project is developed for research experimentation.

torch quantization huggingface vision-transformer vision-language-model siglip finetuning-vision-models

Updated Aug 29, 2025
Jupyter Notebook

carlos-h-Al / HouseCatVision

Star

Building models from scratch and tuning pre-trained models to recognise different house cats

python computer-vision cnn-for-visual-recognition finetuning-vision-models

Updated Nov 14, 2025
Jupyter Notebook

DURGESH716 / Fine_tuned_Multimodal_AI_Retinal_Diagnostic_System

Star

Multimodal Medical AI Fine-Tuned on Qwen-2.5-VL-7B with LoRA + Medical Distillation

ai-safety distillation medical-ai finetuning-vision-models qwen2-5-vl-7b

Updated Feb 16, 2026
Python

Absurd7550 / lfm2-vl-finetune-guide

Star

Fine-tuning LiquidAI/LFM2-VL-1.6B in Colab (LoRA/4-bit) + dataset template + probe test.

colab lora peft finetuning vision-language-model finetuning-vision-models liquidai lfm2 lfm2-vl

Updated Jan 5, 2026
Jupyter Notebook

surajrao2003 / DINO_Object_Detection

Star

Fine-tuning DINO object detection model on a COCO-annotated pedestrian dataset from IIT Delhi. Includes data prep, training, evaluation, and visualization scripts.

transformers pytorch dino pedestrian-detection finetuning-vision-models

Updated May 23, 2025
Jupyter Notebook

beingdutta / Gemma-3-Multimodal-Finetuning-Using-Native-PyTorch

Star

PyTorch Native finetuning of Multimodal Instruction tuned model (Gemma 3) from Google.

pytorch finetuning pytorch-implementation finetuning-vision-models gemma3

Updated Mar 10, 2026
Jupyter Notebook

MMDPROJECT / multi-tag-image-classification-with-fashion-product-images

Star

This repository includes of a Multi-Tag (acronyms are Multi-Task and Multi-Output as well) Image Classification on Fashion Products Images dataset on Kaggle using EfficientNetB0 with high accuracies

computervision imageclassification efficientnetb0 finetuning-vision-models

Updated Jul 19, 2025
Jupyter Notebook

JackTheProgrammer / Fine-tuned-YOLO11

Star

A fine tuned YOLO11 model up to 100 epochs. This custom dataset based fine tuned yolo11s is down streamed on the task of traffic signals detection in both images, videos. Furthermore, the model has been exported to the ONNX format as well. You may export it to your desired serialization format.

Updated Apr 17, 2026
Python

khadimhussain0 / kiffusion

Star

A toolkit for training and fine-tuning diffusion model LoRAs.

lora text-to-image finetuning huggingface diffusion-models flux2 finetuning-vision-models

Updated Feb 10, 2026
Python

Improve this page

Add a description, image, and links to the finetuning-vision-models topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the finetuning-vision-models topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetuning-vision-models

Here are 14 public repositories matching this topic...

ReinFlow / ReinFlow

SuyogKamble / simpleVLM

shreydan / simpleVLM

sidd707 / Aurigen-AI-Powered-Jewelry-Design-Studio

Raxephion / loRA-Strength-Analyser

umair-hassan2 / paligemma-3b-finetuning

carlos-h-Al / HouseCatVision

DURGESH716 / Fine_tuned_Multimodal_AI_Retinal_Diagnostic_System

Absurd7550 / lfm2-vl-finetune-guide

surajrao2003 / DINO_Object_Detection

beingdutta / Gemma-3-Multimodal-Finetuning-Using-Native-PyTorch

MMDPROJECT / multi-tag-image-classification-with-fashion-product-images

JackTheProgrammer / Fine-tuned-YOLO11

khadimhussain0 / kiffusion

Improve this page

Add this topic to your repo