this post was submitted on 08 Oct 2024
201 points (98.1% liked)

Open Source

40624 readers
238 users here now

All about open source! Feel free to ask questions, and share news, and interesting stuff!

Useful Links

Rules

Related Communities

Community icon from opensource.org, but we are not affiliated with them.

founded 6 years ago
MODERATORS
 

I just found this.

APKs for TTS engines

Main page

This is huge!

As a german, I use thorsten medium as he simply made the best dataset.

Mixing english with german, speaking numbers, single letters, pausing without a "." but just a linebreak, all those can be essential.

And... it is nearly perfect! And all local!

This is crazy!

eSpeak can finally go to rest!

all 42 comments
sorted by: hot top controversial new old
[–] macattack@lemmy.world 32 points 11 months ago (1 children)

Might want to link to the official page for a bit more context

[–] macattack@lemmy.world 10 points 11 months ago* (last edited 11 months ago) (1 children)

To those late to the party, you can sample the piper-related voices here so that you're not in a crapshoot: https://rhasspy.github.io/piper-samples/

[–] boredsquirrel@slrpnk.net 2 points 11 months ago

Nice, thanks!

[–] BarHocker@discuss.tchncs.de 9 points 11 months ago

I there a way to install multiple languages? I, for example, would like to install both English and German so that I can switch between them depending on the language of the text.

[–] Lemongrab@lemmy.one 9 points 11 months ago
[–] Sir_Kevin@lemmy.dbzer0.com 8 points 11 months ago (1 children)

Can I install it via F-Droid?

[–] boredsquirrel@slrpnk.net 5 points 11 months ago (1 children)

It is FOSS, and the build instructions are here

https://k2-fsa.github.io/sherpa/onnx/android/build-sherpa-onnx.html

Those guys are incredibly active, look at this!

https://k2-fsa.github.io/sherpa/onnx/index.html

(And no, not on F-Droid, if you look at the crazy amount of packages that makes sense)

[–] Swedneck@discuss.tchncs.de 7 points 11 months ago (1 children)

that stupid huge amount of packages is honestly a pretty big UX issue, at least personally that just makes me chuckle at their optimism and close the page as every bit of energy i had vaporizes from the thought of figuring out what to download.

[–] boredsquirrel@slrpnk.net 0 points 11 months ago (1 children)

Just scroll down until the page is fully loaded and use the search function.

[–] PlexSheep@infosec.pub 1 points 11 months ago (1 children)

Yeah but search for what even?

[–] boredsquirrel@slrpnk.net 1 points 11 months ago

your language abbreviation like "de" or "en"

[–] Rogers@lemmy.ml 7 points 11 months ago* (last edited 11 months ago) (2 children)

This is amazing! If you are looking for US EN and use a phone with arm64 I can recommend sherpa-onnx-1.10.27-arm64-v8a-en-tts-vits-piper-en_US-kristin-medium and sherpa-onnx-1.10.27-arm64-v8a-en-tts-vits-piper-en_US-norman-medium.

Edit: I don't seem to be able to get the engine to show up as a tts engine but it works well within the tts app itself. Hopefully I'll find a fix I've been searching for a good tts engine for android eBook apps.

Edit 2. Fixed I mistakenly downloaded the standalone version from https://k2-fsa.github.io/sherpa/onnx/tts/apk.html instead of https://k2-fsa.github.io/sherpa/onnx/tts/apk-engine.html

[–] boredsquirrel@slrpnk.net 2 points 11 months ago (2 children)

These names are a joke hahaha

[–] Rogers@lemmy.ml 7 points 11 months ago (1 children)

Yeah IMO they need an app that can download/manage models directly

[–] boredsquirrel@slrpnk.net 1 points 11 months ago

That would then require running code from memory, which needs to be enabled in recent GrapheneOS. But all fine, they should absolutely do that as they obviously do that currently but during compilation.

Client app + wizard to select language and model type + downloaded model

[–] semperverus@lemmy.world 1 points 11 months ago* (last edited 11 months ago) (1 children)

You are clearly not the target audience then. There is a lot of extremely useful info in those model names.

[–] boredsquirrel@slrpnk.net 1 points 11 months ago

Yes of course :) I use sherpa-onnx-1.10.26-arm64-v8a-de-tts-engine-vits-piper-de_DE-thorsten-medium and there is nothing irrelevant in that name.

Still after another commenter wrote how overwhelmed they are with that INSANE download page, I felt that when you dropped those 2 names

[–] boredsquirrel@slrpnk.net 1 points 11 months ago

Yes they have 3 types of client apps

[–] xlash123@sh.itjust.works 4 points 11 months ago (1 children)

Lmao, there's a GLaDOS voice. I'm taking that

[–] boredsquirrel@slrpnk.net 1 points 11 months ago

Whats that? Mind to share an audio file?

[–] Teppichbrand@feddit.org 4 points 11 months ago* (last edited 11 months ago) (1 children)

eSpeak is a damn good conversation starter. If I fire up my OsmAnd and some german robot with a dutch accent (or is it Kölsch?!) starts giving me directions noone ever not commented on it. Downloading Thorsten Medium right now.

[–] boredsquirrel@slrpnk.net 1 points 11 months ago

Hahaha yes it is.

[–] justtobbi@feddit.org 3 points 11 months ago (1 children)

How do I apply these voices, so apps use it?

[–] justtobbi@feddit.org 2 points 11 months ago (1 children)

Go into Settings -> Accessability -> Speech (at the very bottom; at least on Android 13) and then TTS :) hope that helps

[–] boredsquirrel@slrpnk.net 2 points 11 months ago (1 children)

They have 3 types of clients, only one works as an Android TTS engine, the others are useless.

[–] justtobbi@feddit.org 2 points 11 months ago (2 children)

I downloaded the thorsten medium and wondered, what the difference to thorsten high would be?

[–] Teppichbrand@feddit.org 2 points 11 months ago

You can listen to them here

[–] boredsquirrel@slrpnk.net 1 points 11 months ago

Thorsten high is silly haha. Emotional is also not meant for TTS more for research I think.

I think thorsten made the only good model in German, I really want to make my own one! Or get some famous people on board?

[–] Eyck_of_denesle@lemmy.zip 1 points 10 months ago (1 children)

How do I install it, I don't understand

[–] boredsquirrel@slrpnk.net 1 points 10 months ago (2 children)

Search, click on link, install, open app, open android settings, search for tts, enable the app as tts

[–] Eyck_of_denesle@lemmy.zip 2 points 10 months ago* (last edited 10 months ago) (1 children)
[–] boredsquirrel@slrpnk.net 1 points 10 months ago (1 children)

Uhm that is the site I have linked

Trust me, I checked 3 times and that tiny link isnt making it easier XD

[–] Eyck_of_denesle@lemmy.zip 1 points 10 months ago* (last edited 10 months ago) (1 children)

You linked the download apk directly, not the index with all the apk's.

Edited my previous reply for you to see the url better

[–] boredsquirrel@slrpnk.net 1 points 10 months ago (1 children)

No the main link in the post is exactly the same.

[–] Eyck_of_denesle@lemmy.zip 1 points 10 months ago (1 children)

You edited it. Lmao why you messing with me.

[–] boredsquirrel@slrpnk.net 1 points 10 months ago* (last edited 10 months ago) (1 children)

I added the same link also to the text so that you can find it there too.

Click on either of these links, they are the same

[–] Eyck_of_denesle@lemmy.zip 1 points 10 months ago

I already found it. Thanks for sharing this. The quality is so damn good. The "high" ones lag a lot but medium works perfectly on my device.

[–] Eyck_of_denesle@lemmy.zip 1 points 10 months ago* (last edited 10 months ago) (1 children)

I don't know what to search. I downloaded a package from github releases android.bz2but it doesn't have any apk.

[–] boredsquirrel@slrpnk.net 1 points 10 months ago

Yeah well you need a .apk XD

just search for

  • your language
  • your android architecture (normally modern phones use arm64 or aarch64 or arm v8a
  • your preferred model, or just try out all

Example

https://huggingface.co/csukuangfj/sherpa-onnx-apk/resolve/main/tts-engine-new/1.10.29/sherpa-onnx-1.10.29-arm64-v8a-de-tts-engine-vits-piper-de_DE-thorsten-medium.apk