Persistent CUDA Error (device-side assert) during VITS Fine-tuning with Bengali Dataset on RTX 306 #4346
Unanswered
surjoshikha
asked this question in
General Q&A
Replies: 1 comment
-
|
This is likely the same issue as idiap#424, set your character config as described there, just replacing your characters with the Bengali ones. I'd also strongly recommend to use our fork (available via |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Goal:
I am trying to fine-tune the
tts_models/multilingual/multi-dataset/vits_vctkmodel on a custom Bengali (Bangla) voice dataset using Coqui TTS.Dataset:
filename|text|text)..wavfiles (mono, 22050 Hz).config.json.Configuration:
config.jsonprovided in troubleshooting (includes custom alphabet,use_cuda: true,batch_sizetested at 16, 8, and 4).restore_path:tts_models/multilingual/multi-dataset/vits_vctkPersistent Error:
Despite extensive troubleshooting, training consistently crashes with
RuntimeError: CUDA error: device-side assert triggered. The traceback consistently points to low-level CUDA operations within PyTorch, oftentorch.embeddingorF.conv1d. This occurs even with very small batch sizes (e.g., 4).Troubleshooting Steps Taken:
venv).metadata.txtformat (3-column) and dataset path.config.json.metadata.txtand remove unknown/hidden characters.batch_sizesignificantly (down to 4).use_cuda: falseandCUDA_VISIBLE_DEVICES=""), but still encountered CUDA errors (indicating a deep conflict).torch-2.9.0+cu121) andtorchcodecmultiple times.pip install TTS).System Environment:
OS:** Windows 11
GPU:** NVIDIA GeForce RTX 3060
NVIDIA Driver:** 581.57 (Studio Driver)
Python: Python 3.11.9 --version
PyTorch:Name: torch
Version: 2.5.1+cu121
Summary: Tensors and Dynamic neural networks in Python with strong GPU acceleration
Home-page: https://pytorch.org/
Author: PyTorch Team
Author-email: packages@pytorch.org
License: BSD-3-Clause
Location: K:\Speech_to_text\my_own_model\venv_new\Lib\site-packages
Requires: filelock, fsspec, jinja2, networkx, sympy, typing-extensions
Required-by: encodec, torchaudio, torchvision, trainer, TTS
TTS:Name: TTS
Version: 0.22.0
Summary: Deep learning for Text to Speech by Coqui.
Home-page: https://github.com/coqui-ai/TTS
Author: Eren Gölge
Author-email: egolge@coqui.ai
License: MPL-2.0
Location: K:\Speech_to_text\my_own_model\venv_new\Lib\site-packages
Requires: aiohttp, anyascii, bangla, bnnumerizer, bnunicodenormalizer, coqpit, cython, einops, encodec, flask, fsspec, g2pkk, gruut, hangul_romanize, inflect, jamo, jieba, librosa, matplotlib, nltk, num2words, numba, numpy, packaging, pandas, pypinyin, pysbd, pyyaml, scikit-learn, scipy, soundfile, spacy, torch, torchaudio, tqdm, trainer, transformers, umap-learn, unidecode
Required-by:
Request:
Could this persistent CUDA error be related to a known incompatibility with the RTX 3060, the specific driver version, or a subtle issue in the VITS model's handling of Bengali script during fine-tuning? Any insights or suggestions would be greatly appreciated.
config.json
{
"run_name": "MyPerfectBengaliModel_V1",
"output_path": "training_output/",
"model": "vits",
"phonemizer": {},
"use_phonemes": false,
"characters": {
"pad": "_",
"eos": "~",
"bos": "^",
"characters": "অআইঈউঊঋএঐওঔকখগঘঙচছজঝঞটঠডঢণতথদধনপফবভমযরলশষসহড়ঢ়য়ৎংঃঁািীুূৃেৈোৌ্ABCDFIMNSTacdefghilmnorstu۱۲৩৪৫৬৭৮৯০0123468*‘॥–—’\ufeffড়য়১২",
"punctuations": "।,-;? !'"().:",
"phonemes": ""
},
"model_args": {
"num_speakers": 0,
"use_d_vector_file": false,
"d_vector_file": null,
"d_vector_dim": 0
},
"datasets": [
{
"formatter": "ljspeech",
"name": "FinalFormattedDataset",
"path": "FinalFormattedDataset/",
"meta_file_train": "metadata.txt"
}
],
"eval_split_size": 0.02,
"restore_path": "tts_models/multilingual/multi-dataset/vits_vctk",
"batch_size": 8,
"eval_batch_size": 8,
"epochs": 1000,
"learning_rate": 0.0001,
"num_loader_workers": 4,
"num_eval_loader_workers": 4,
"save_step": 1000,
"print_step": 25,
"use_cuda": true
}
Beta Was this translation helpful? Give feedback.
All reactions