Skip to content
View muhammad-ahmed-ghani's full-sized avatar
🤔
Thought Processing...
🤔
Thought Processing...

Block or report muhammad-ahmed-ghani

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Muhammad Ahmed Ghani — ML Lead @ ImagineArt


$ whoami --verbose

I'm Muhammad Ahmed Ghani — Machine Learning Lead at ImagineArt (formerly Vyro), where I build the models behind one of the fastest-growing creative AI platforms in the world.

I work across Speech, Vision, NLP, and Agentic AI — not as a timeline of past phases, but as four threads I'm actively pulling on right now. The interesting problems sit at the seams between them, and that's where I spend most of my time.

How I think about this work

Research is the starting line, not the trophy. Anyone can make a model produce one good output; making it produce ten million is a different discipline. That's where I live.

role:      ML Lead @ ImagineArt
           (formerly Vyro)
focus:     GenAI · CV · Agents
location:  Islamabad | Lahore 🇵🇰
education: BS CS, UCP
building:  ImagineArt
coffee:    ████████░░ 80%




$ cat ./domains.log

four domains I work across

Speech

Recognition, separation, and synthesis pipelines. Turning messy audio into structured signal — and back again — at production latency.

Vision

Detection, segmentation, generation, and editing. From frame-perfect video pipelines to real-time inference on tight budgets.

NLP

Language understanding, retrieval, and generation. Grounding LLMs in real data and real constraints, not just vibes.

Agentic AI

Planning, tool-use, memory, and recovery. Systems that do the work — not just describe how they'd do it if asked nicely.


Selected work

ImagineArt 2.0

Creative AI platform spanning image and video generation, serving millions of creators worldwide. Led the ML stack behind the 2.0 release — unifying modalities, tightening consistency, and shipping at scale.

Video AI Suite

Led the release of seven video-AI models at ImagineArt — pushing frame-to-frame consistency from "demo quality" to something creators can actually build with.

Open Source

Voice separation, video object removal pipelines, super-resolution ports, and inference tooling — quietly powering other people's projects on GitHub and Hugging Face.

Enterprise GenAI

Production-grade GenAI deployments in regulated industries — real customers, real compliance, real stakes. The kind of work that teaches you what "production" actually means.


$ ls -la ./arsenal

skill constellation

⌁ the full stack, unabbreviated
Domain Focus
Speech ASR · TTS · separation · voice cloning · audio pipelines
Vision detection · segmentation · generation · editing · super-resolution
NLP LLMs · RAG · fine-tuning · evaluation · prompt engineering
Agentic AI planning · tool-use · multi-agent orchestration · memory · MCP
Languages Python · C · C++ · JavaScript · Bash
Frameworks PyTorch · TensorFlow · 🤗 Transformers · FastAPI · Gradio
Infra ONNX · Docker · NVIDIA stack · AWS · GCP · HF Inference Endpoints
Data MongoDB · MySQL · SQLite · Pandas · NumPy

$ cat ./philosophy.md

from paper to production

Models are the easy part. Systems are the job.


$ ./connect.sh


signature wave

"Any sufficiently advanced neglect of monitoring is indistinguishable from magic — until it isn't."

↑ back to top

Pinned Loading

  1. svoice_demo svoice_demo Public

    A PyTorch Implementation of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.

    Python 37 4

  2. transformers transformers Public

    Forked from huggingface/transformers

    🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

    Python

  3. RealESRGAN_ONNX RealESRGAN_ONNX Public

    High res Anime images by using the RealESRGAN onnx model. It will upscale 4x the input image by maintaining the original aspect ratio with resize algo.

    Python 21 3

  4. video-inpainting video-inpainting Public

    This repository provides a powerful AI-driven solution for removing objects from videos using text prompts. By integrating SAM2, Florence2, and ProPainter, the model enables precise and seamless ob…

    Python 1