I’m using https://github.com/rhasspy/piper mostly to create some audiobooks and read some posts/news, but the voices available are not always comfortable to listen to.

Do you guys have any recommendation for a voice changer to process these audio files?
Preferably it’ll have a CLI so I can include it in my pipeline to process RSS feeds, but I don’t mind having to work through an UI.
Bonus points if it can process the audio streams.

    • pe1uca@lemmy.pe1uca.devOP
      link
      fedilink
      English
      arrow-up
      11
      arrow-down
      2
      ·
      4 months ago

      Text to speech is what piper is doing.
      What I’m looking for is called voice changer since I want to change a voice which already read something.

      That’s exactly what I want: “the thing in the Darth Vader halloween masks” but for linux, preferably via CLI to ingest audio files and be able to configure it to change the voice as I want, not only Darth Vader.

      • catloaf@lemm.ee
        link
        fedilink
        English
        arrow-up
        20
        ·
        4 months ago

        Oh, I see. I think it would still be easier to either use a different voice in piper (the github page talks about this) or use a different tts program entirely.

      • bastion
        link
        fedilink
        English
        arrow-up
        4
        ·
        3 months ago

        So, all of the awkward pauses, the lack of inflection - you’re saying keep those, just change who it sounds like is speaking?