Back to Candle

candle-metavoice

candle-examples/examples/metavoice/README.md

0.10.1627 B
Original Source

candle-metavoice

MetaVoice-1B is a text-to-speech model trained on 100K hours of speech, more details on the model card.

Note that the current candle implementation suffers from some limitations as of 2024-03-02:

  • The speaker embeddings are hardcoded.
  • The generated audio file quality is weaker than the Python implementation, probably because of some implementation discrepancies.

Run an example

bash
cargo run --example metavoice --release -- \
  --prompt "This is a demo of text to speech by MetaVoice-1B, an open-source foundational audio model."