audio-ml

Here are 17 public repositories matching this topic...

ksasso1028 / audio-reverb-removal

Code to train a custom time-domain autoencoder to dereverb audio

audio dsp pytorch autoencoder convolutional-neural-networks time-domain denoising-autoencoders denoising multi-task-learning dereverberation autoencoder-neural-network demucs audio-denoising audio-machine-learning audio-ml audio-ai convtasnet

Updated Nov 30, 2023
Python

zhangzijie-pro / Speaker-Verification

Star

Dual-model speech AI toolkit for speaker verification and speaker-aware diarization, with streaming inference, meeting analysis, long-audio monitoring, and speaker-bank integration.

pytorch speaker-recognition speaker-verification speaker-diarization voice-ai open-set-identification audio-ml streaming-inference meeting-analysis

Updated Apr 16, 2026
Python

alexjsmac / switch-jockey

Star

An intelligent, automated video switcher for live performances

webgl shaders glsl audio-ml

Updated Aug 13, 2020
JavaScript

Stavion-Colquitt / Audio_ML_Noise_Reduction

Star

Real-time speech enhancement pipeline — custom-trained U-Net denoising model, ONNX inference, Overlap-Add synthesis, and virtual audio routing for Teams, Zoom, and DAW use. CPU-only, no cloud dependency.

python deep-learning signal-processing pytorch dns-challenge u-net speech-enhancement onnx noise-suppression u-net-pytorch virtual-audio virtual-audio-cable real-time-audio-signal-processing audio-ml

Updated Apr 2, 2026
Python

ripunjkashyap-a11y / Audio_stem_splt

Star

A custom MCP server that separates a YouTube track into stems (vocals, drums, bass) and extracts a sonic signature: BPM, musical key, stereo width, transient punch, and a 512-dim CLAP semantic embedding. Runs locally on CPU via Demucs and librosa.

machine-learning deep-learning mcp transformers music-information-retrieval librosa clap audio-processing source-seperation demucs audio-ml llm-tools stem-separation

Updated Apr 14, 2026
Python

thavasix-gr8 / engine-identification-acoustic-ml

Star

Engine identification using acoustic signal analysis and machine learning to classify 8 vehicle types. Audio signals are processed using FFT and feature extraction, and a multi-class model predicts vehicle categories based on their unique sound patterns.

python flask machine-learning signal-processing feature-extraction audio-classification librosa svm-classifier fft-analysis scikit-learn-python vehicle-classification acoustic-analysis audio-ml

Updated Mar 29, 2026
Python

hellothisismynewusername / pscps-gan

Star

A Pitch-Synchronous Cylindrical Progressive-Scan Reshaping for Monophonic Timbre Synthesis

dsp gan audio-ml

Updated Apr 20, 2026
Rust

Devanik21 / AI-audio-overview

Star

AI-generated audio summarisation pipeline — Whisper transcription, LLM key-insight extraction, and structured spoken summaries with TTS playback and Streamlit interface.

nlp deep-learning whisper audio-to-text large-language-models generative-ai audio-ml multimodal-ai content-summarization podcast-summarization

Updated Mar 15, 2026
Python

veeravignesh1 / veeravignesh1.github.io

Star

Professional site made with Quarto

nlp speech-processing audio-ml

Updated Sep 19, 2025
CSS

swarajdhondge / speech-emotion-recognition-models

Star

ML-based speech emotion recognition system that analyzes audio features to classify emotions with a simple interface for testing.

emotion-classification speech-emotion-recognition audio-ml

Updated Nov 21, 2024
Python

kershrita / Music-Genre-Classification

Star

Machine learning system for music genre classification using feature engineering, stratified evaluation, SVC/XGBoost modeling, and reproducible prediction export.

python data-science machine-learning scikit-learn xgboost classification data-preprocessing feature-engineering music-genre-classification model-evaluation audio-ml music-analytics

Updated Apr 10, 2026
Jupyter Notebook

Devanik21 / Advanced-AI-voice

Star

Neural TTS and voice-cloning application using XTTS/VITS. Supports 3–30 s reference audio for speaker adaptation, real-time pitch/speed control, and WAV/MP3 export.

natural-language-processing text-to-speech deep-learning speech-synthesis neural-networks conversational-ai neural-tts voice-ai generative-ai audio-ml

Updated Mar 15, 2026
Python

benny-conn / solo-trace

Star

Automated audio/video ML pipeline for detecting and transcribing jazz solos from live recordings. Runs nightly against Smalls Jazz Club archives: uses CLAP (instrument detection), Demucs (source separation), CLIP (performer identification), and basic-pitch (MIDI transcription). Results served via REST API.

python golang machine-learning computer-vision midi pytorch jazz demucs audio-ml

Updated Mar 16, 2026
Python

Devanik21 / MusicVAE

Star

Key Features: Simple VAE architecture with encoder/decoder Synthetic music data generation for training Interactive training with progress tracking Music generation from latent space sampling Audio conversion and playback Downloadable audio files

deep-learning magenta music-generation variational-autoencoder hierarchical-lstm music-ai latent-space-interpolation generative-ai audio-ml sequential-generation

Updated Mar 15, 2026
Python

Devanik21 / audio-file-error-handling-using-gpt-4

Star

Audio file processing pipeline with GPT-4-powered error diagnosis — detects codec issues, sample rate mismatches, and corruption artefacts with automated remediation suggestions.

python deep-learning error-handling neural-networks audio-processing gpt-4 large-language-models generative-ai audio-ml robust-pipeline

Updated Mar 15, 2026
Python

Devanik21 / HarmoniaX

Star

Music harmony AI — chord progression analysis with Roman numeral labelling, voice leading checker, style-conditioned progression generation (Baroque/Jazz/Pop), and MIDI export via music21.

deep-learning neural-networks music-generation sound-synthesis creative-ai music-ai large-language-models generative-ai neural-audio audio-ml

Updated Mar 15, 2026
Python

AbijahKaj / audio-ml

Star

Audio analysis in javascript/typescript

audio-analysis audio-processing audio-js audio-ml

Updated Mar 30, 2026
TypeScript

Improve this page

Add a description, image, and links to the audio-ml topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the audio-ml topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

audio-ml

Here are 17 public repositories matching this topic...

ksasso1028 / audio-reverb-removal

zhangzijie-pro / Speaker-Verification

alexjsmac / switch-jockey

Stavion-Colquitt / Audio_ML_Noise_Reduction

ripunjkashyap-a11y / Audio_stem_splt

thavasix-gr8 / engine-identification-acoustic-ml

hellothisismynewusername / pscps-gan

Devanik21 / AI-audio-overview

veeravignesh1 / veeravignesh1.github.io

swarajdhondge / speech-emotion-recognition-models

kershrita / Music-Genre-Classification

Devanik21 / Advanced-AI-voice

benny-conn / solo-trace

Devanik21 / MusicVAE

Devanik21 / audio-file-error-handling-using-gpt-4

Devanik21 / HarmoniaX

AbijahKaj / audio-ml

Improve this page

Add this topic to your repo