You can also pass `--device mps` to have all the models set to device mps.
This setting:
The local mac optimal settings set the mode to be local as explained above and change the models to:
- Adds `--device mps` to use MPS for all models.
- LightningWhisperMLX
- Sets LightningWhisperMLX for STT
- MLX LM
- Sets MLX LM for language model
- MeloTTS
- Sets MeloTTS for TTS
### Recommended usage with Cuda
### Recommended usage with Cuda
...
@@ -117,6 +118,57 @@ python s2s_pipeline.py \
...
@@ -117,6 +118,57 @@ python s2s_pipeline.py \
For the moment, modes capturing CUDA Graphs are not compatible with streaming Parler-TTS (`reduce-overhead`, `max-autotune`).
For the moment, modes capturing CUDA Graphs are not compatible with streaming Parler-TTS (`reduce-overhead`, `max-autotune`).
### Multi-language Support
The pipeline supports multiple languages, allowing for automatic language detection or specific language settings. Here are examples for both local (Mac) and server setups: