[TTS] Text preprocessing - remove special characters before sending to TTS engine

## Problem
When generating TTS audio, special characters (HTML tags, markdown symbols, brackets, asterisks, etc.) are sent directly to the TTS engine, causing audio artifacts and mispronunciations.

## Current behavior
Only `UnbreakLine()` and JSON encoding are applied before sending text to TTS engines (in `TtsDownloadService.cs`).

## Proposed solution
Add a configurable `TtsTextPreprocessor` that:
- Strips HTML tags (`<i>`, `<b>`, `<font>`, etc.)
- Removes markdown formatting (`*`, `**`, `#`, etc.)
- Removes brackets and their content `[music]`, `(laughing)`
- Strips non-pronounceable characters
- Optionally converts numbers to words
- Configurable per-engine in `SeVideoTextToSpeech` settings

## Files affected
- `src/UI/Logic/Download/TtsDownloadService.cs` - add preprocessing call
- New: `TtsTextPreprocessor.cs`
- `src/UI/Logic/Config/SeVideoTextToSpeech.cs` - add settings

## Related upstream issues
- #10133 (AllTalk Czech diacritics lost - related encoding issue)

---
*Working on this: @Ironship*

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[TTS] Text preprocessing - remove special characters before sending to TTS engine #10395

Problem

Current behavior

Proposed solution

Files affected

Related upstream issues

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[TTS] Text preprocessing - remove special characters before sending to TTS engine #10395

Description

Problem

Current behavior

Proposed solution

Files affected

Related upstream issues

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions