This project implements a Speech-to-Text (STT) system built using Transformers.
It provides modular components for audio preprocessing, model inference, and pipeline integration, making it easy to extend and adapt to different speech recognition tasks.
- Speech-to-Text transcription using Transformer-based models.
- Modular architecture:
components/– Core building blocksmodels/– Transformer-based speech recognition modelspipeline/– End-to-end pipeline for transcriptionutils/– Helper functions
- Configurable settings in
configuration/. - Logging and exception handling built-in.
STT/
│── cloud_storage/ # Cloud storage utilities
│── components/ # Core components of the STT system
│── configuration/ # Configuration files
│── constants/ # Constant values
│── entity/ # Data/entity definitions
│── exceptions/ # Custom exception handling
│── logger/ # Logging setup
│── models/ # Transformer models for STT
│── pipeline/ # Transcription pipeline
│── utils/ # Utility functions
│── templates/ # Templates (if needed for deployment/UI)
│── app.py # Main entry point
│── requirements.txt # Python dependencies
│── setup.py # Setup script
- Clone the repo:
git clone https://github.com/mohamedabbouda/STT.git
cd STT
- Install dependencies:
pip install -r requirements.txt
python pipeline.py
python app.py