nicdex @nicdex

**voxel** @voxel@merveilles.town · 15h *

15h *

voxel @voxel@merveilles.town

Non-blind user wondering about TTS accessibility problems

**Nagaram** @Nagaram@hachyderm.io · 1d

Nagaram @Nagaram@hachyderm.io

Spotify isn't working with GrapheneOS.

There seems to be fixes, but I think I'm just going to use this as an excuse to cancel spotify and use other things.

Anyone have a recommendation for #foss #TTS #ereaders ?

If I can't get my audiobooks reliably from Spotify, then I think a decent TTS Engine with epub capabilities would solve the problem. I have one through Google Play I was using but it wasn't foss. Pretty nice though.

**Dan Gero** @dangero@vocalounge.cafe · 2d *

2d *

Dan Gero @dangero@vocalounge.cafe

I wrote this blueprint for a web app that would make it easier for people to build voices and languages for different TTS engines. It's vague, but it's a start if anyone wants to contribute to it or eventually create the real thing. Boosts appreciated, as always. https://github.com/lower-elements/Voice-Creator-Studio #TTS #Accessibility #AI #ML

An easy way to create voices for any TTS engine. Contribute to lower-elements/Voice-Creator-Studio development by creating an account on GitHub.

GitHubGitHub - lower-elements/Voice-Creator-Studio: An easy way to create voices for any TTS engineAn easy way to create voices for any TTS engine. Contribute to lower-elements/Voice-Creator-Studio development by creating an account on GitHub.

**Terence Eden’s Blog** @blog@shkspr.mobi · Jul 21, 2021 *

Jul 21, 2021 *

Terence Eden’s Blog @blog@shkspr.mobi

Synthetic Poetry

https://shkspr.mobi/blog/2021/07/synthetic-poetry/

I've been experimenting with Amazon's Polly service. It's their fancy text-to-sort-of-human-style-speech system. Think "Alexa" but with a variety of voices, genders, and accents.

Here's "Brian" - their English, male, received pronunciation voice - reading John Betjeman's poem "Slough":

https://shkspr.mobi/blog/wp-content/uploads/2021/07/slough.mp4

The pronunciation of all the words is incredibly lifelike. If you heard it on the radio, it might sound like a half-familiar BBC presenter. It has a calm, even tone which suits the poem splendidly.

The rhythm is also spot on. That's mostly a function of the short lines and helpful punctuation the poem contains. Much like iambic pentameter, or a limerick, the syllables lend themselves to a specific and identifiable cadence.

But the emphasis is all wrong. The poem just... ends. There's no sense of finality in the tone. You'd expect a competent reader to recognise "tinned minds" as being worthy of stressing. Polly does have some capability to mark specific words for emphasis, but it's all very manual.

There's no synthetic emotion. Do you feel the rage, desperation, sadness, hopelessness of the poem? While Polly has some SSML (Speech Synthesis Markup Language) support - the range of emotions it can express are severely limited. And, again, must be applied manually.

"I used to be an adventurer like you, but then i took an arrow in the knee!"

One of the reasons stock phrases pop up so often in video games is that it is expensive to write and record thousands of different lines of dialogue.

We're almost at a stage where a computer can procedurally generate lines for background characters to speak, and then "record" an audio version in an array of styles. No more expensive voice actors, no more memetic references for in-group homophily. Each player of a game will have a completely different dialogue experience.

But the bit that we're still missing is the automation of emphasis and emotion and comic timing and understatement and... all the things which trained actors spend years learning how to do successfully.

In 2011, the film critic Roger Ebert had surgery which eliminated his voice. He proposed the following "Ebert Test" for synthetic voices:

If the computer can successfully tell a joke, and do the timing and delivery, as well as Henny Youngman, then that’s the voice I want.

We're so close, I can taste it. The Turing Test for realistic voices is whether they can move the audience to tears with poetry.

Terence Eden’s Blog · Jul 21, 2021Synthetic Poetry

More from

Terence Eden

#AI #Amazon #tts

**Terence Eden’s Blog** @blog@shkspr.mobi · Jul 20

Jul 20

Terence Eden’s Blog @blog@shkspr.mobi

1KB JS Numbers Station

https://shkspr.mobi/blog/2025/07/1kb-js-numbers-station/

Code Golf is the art/science of creating wonderful little demos in an artificially constrained environment. This year the js1024 competition was looking for entries with the theme of "Creepy".

I am not a serious bit-twiddler. I can't create JS shaders which produce intricate 3D worlds in a scrap of code. But I can use slightly obscure JavaScript APIs!

There's something deliciously creepy about Numbers Stations - the weird radio frequencies which broadcast seemingly random numbers and words. Are they spies communicating? Commands for nuclear missiles? Long range radio propagation tests? Who knows!

So I decided to build one. Play with the demo.

Obviously, even the most extreme opus compression can't fit much audio into 1KB. Luckily, JavaScript has you covered! Most modern browsers have a built-in Text-To-Speech (TTS) API.

Here's the most basic example:

m = new SpeechSynthesisUtterance;m.text = "Hello";speechSynthesis.speak(m);

Run that JS and your computer will speak to you!

In order to make it creepy, I played about with the rate (how fast or slow it speaks) and the pitch (how high or low).

m.rate=Math.random();m.pitch=Math.random()*2;

It worked disturbingly well! High pitched drawls, rumbling gabbling, the languid cadence of a chattering friend. All rather creepy.

But what could I make it say? Getting it to read out numbers is pretty easy - this will generate a random integer:

s = Math.ceil( Math.random()*1000 );

But a list of words would be tricky. There's not much space in 1,024 bytes for anything complex. The rules say I can't use any external resources; so are there any internal sources of words? Yes!

Object.getOwnPropertyNames( globalThis );

That gets all the properties of the global object which are available to the browser! Depending on your browser, that's over 1,000 words!

But there's a slight problem. Many of them are quite "computery" words like "ReferenceError", "URIError", "Float16Array". I wanted all the single words - that is, anything which only has one capital letter and that's at the start.

const l = (n) => {    return ((n.match(/[A-Z]/g) || []).length === 1 && (n.charAt(0).match(/[A-Z]/g) || []).length === 1);};//   Get a random result from the filters = Object.getOwnPropertyNames( globalThis ).filter( l ).sort( ()=>.5-Math.random() )[0]

Rather pleasingly, that brings back creepy words like "Event", "Atomics", and "Geolocation".

Of course, Numbers Stations don't just broadcast in English. The TTS system can vocalise in multiple languages.

//   Set the language to Russianm.lang = "ru-RU";

OK, but where do we get all those language strings from? Again, they're built in and can be retrieved randomly.

var e = window.speechSynthesis.getVoices();m.lang = e[ (Math.random()*e.length) |0 ]

If you pass the TTS the number 555 and ask it to speak German, it will read out fünfhundertfünfundfünfzig.

And, if you tell the TTS to speak an English word like "Worker" in a foreign language, it will pronounce it with an accent.

Randomly altering the pitch, speed, and voice to read out numbers and dissociated words produces, I think, a rather creepy effect.

If you want to test it out, you can press this button. I find that it works best in browsers with a good TTS engine - let me know how it sounds on your machine.

🅝🅤🅜🅑🅔🅡🅢 🅢🅣🅐🅣🅘🅞🅝

With the remaining few bytes at my disposal, I produced a quick-and-dirty random pattern using Unicode drawing blocks. It isn't very sophisticated, but it does have a little random animation to it.

You can play with all the js1024 entries - I would be delighted if you voted for mine.

Random monochrome tiles with the word Numbers Station superimposed.

Terence Eden’s Blog · Jul 201KB JS Numbers Station

More from

Terence Eden

**Jenny** @TheJnxx@bonito.cafe · Jul 18 *

Jul 18 *

Jenny @TheJnxx@bonito.cafe

Hablemos de los TTS, antes era algo que ignoraba demasiado, pero recientemente lo veo como una utilidad para escribir mejor ciertas cosas, como el uso de las tildes

Hay una app de Google que viene prácticamente en cualquier teléfono (como casi todas) la cual cumple con esto, mi duda es, como se hara para quienes no tienen Google? Pues esta un poco complicada la cosa, al menos para mi que no me gusta tener dos apps si una sola puede hacerlo perfecto, aunque para evitar eso utilizo esta web pero para quienes lo quieran mas cómodo, aqui el dato

La cosa esta asi, la mayoría de alternativas no traen la funcion para que lea las palabras y eso que en teoria es un TTS, solo proporcionan la voz, asi que si quieren dicha funcion existe una app aparte...

La que proporciona la voz: RHVoice (Recomendada)

Para leer las palabras, con ayuda de la que proporciona la voz: TTS Util

Si, sería mas comodo que al menos RHVoice tuviera esa funcion ya implemetada, pero bueno, algo es algo...

www.text-to-speech.onlineFree Text to Speech Online Converter ToolsWe developed an online text-to-speech synthesis tool, which converts text into natural and smooth human voice, provides 100+ speakers for you to choose, supports multi-language, multi-dialect and Chinese-English mixing, and can configure audio flexibly parameter. It is widely used in news reading, travel navigation, intelligent hardware and notification broadcasting. And can convert the text content into MP3 files to download and save.

#TipsDeJenny #Android #TTS

**Andre Louis** @FreakyFwoof@universeodon.com · Jul 18 *

Jul 18 *

Andre Louis @FreakyFwoof@universeodon.com

Here's a quick demo on how to enable TTS on the Nintendo Switch 2 from the home screen. Hopefully these menus are the same across all devices, though I have no way to know that for certain.

Edit: For other blind Switch/Switch 2 owners, I started a WhatsApp group to discuss the accessibility of the console and it's games. DM if you'd like to join.

Download: https://onj.me/media/Switch2_Accessibility.mp3
#Nintendo #Switch2 #Accessibility #TTS #ScreenReader

**digituba** @digituba@mastodon.social · Jul 18

Jul 18

digituba @digituba@mastodon.social

Blue SAM by Gregfeel/Lepsi De
https://demozoo.org/productions/374494/

The C64 sings Blue (Da Ba Dee) by Eiffel 65 using SAM.

#RetroGaming #RetroComputing
#TTS #Commodore #C64

**Devin Prater :blind:** @pixelate@tweesecake.social · Jul 9

Jul 9

Devin Prater :blind: @pixelate@tweesecake.social

Awww, the Alexa feature where it would read aloud Kindle books isn't available for Alexa Plus. Ah well, I'm just glad Kindle works much better on Android now.
#accessibility #kindle #amazon #alexa #AlexaPlus #blind #TTS

**digituba** @digituba@mastodon.social · Jul 3

Jul 3

digituba @digituba@mastodon.social

I have been investigating some more, retro sounding, speech synthesiser TTS tools and recently discovered this modern build of DECtalk:
https://github.com/dectalk/dectalk

More details on DECtalk:
https://en.wikipedia.org/wiki/DECtalk

You may recognise the voice as Stephen Hawking; it was also used in the move: Back To The Future II

#TTS #Movies #Retro

Replied in thread

**Debby** @debby@hear-me.social · Jul 3 *

Jul 3 *

Debby @debby@hear-me.social

@thelinuxEXP I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by @mkiol

It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.

I primarily use #WhisperAI for transcription and Piper for voice, but many other models are available as well.

It is available as flatpak and https://github.com/mkiol/dsnote

#TTS #transcription #TextToSpeech #translator translation #offline #machinetranslation #sailfishos #SpeechSynthesis #SpeechRecognition #speechtotext #nmt #linux-desktop #stt #asr #flatpak-applications #SpeechNote

**digituba** @digituba@mastodon.social · Jul 2

Jul 2

digituba @digituba@mastodon.social

SAM Software Automatic Mouth
https://discordier.github.io/sam/
This is a vanilla Javascript port of the Text-To-Speech (TTS) software SAM (Software Automatic Mouth) for the Commodore C64 published in the year 1982 by Don't Ask Software (now SoftVoice, Inc.).

It works in your web browser in the link above
#RetroGaming #Commodore #C64 #TTS

**LinuxNews.de** @linuxnews@social.anoxinon.de · Jul 2

Jul 2

LinuxNews.de @linuxnews@social.anoxinon.de

Mozilla Common Voice Corpus 22.0 veröffentlicht
https://linuxnews.de/mozilla-common-voice-corpus-22-0-veroeffentlicht/ #mozilla #tts #opensource

Replied in thread

**Kevin Karhan** @kkarhan@infosec.space · Jun 29

Jun 29

Kevin Karhan @kkarhan@infosec.space

@purplerabbit @nileane yeah, this really pisses me off too.

#YouTube deciding to #AutoDub shit for no valid reason is only.worsened by the fact that they have the most horrendous #TTS voices one can imagine.

Not even the funny "We are Anonymous!" kinda style but the most shitty output ever!

**moagee** @moagee@chaos.social · Jun 22

Jun 22

moagee @moagee@chaos.social

völlig underrated:

#SpeechNote ist eine datenschutzfreundliche Linux-App, die Sprache in Text umwandelt (#STT), Text vorliest (auch Dateien) (#TTS) und übersetzt – alles lokal ohne Internetverbindung.
Viele Sprachen und Open-Source-Modelle stehen zum einbinden zur Verfügung!

Continued thread

**chibi-[N]ah** @alex@social.nah.re · Jun 21 *

Jun 21 *

chibi-[N]ah @alex@social.nah.re

Piper-TTS :

https://github.com/rhasspy/piper

Voix style GlaDOS :

https://github.com/TazzerMAN/piper-voice-glados-fr

A fast, local neural text to speech system. Contribute to rhasspy/piper development by creating an account on GitHub.

GitHubGitHub - rhasspy/piper: A fast, local neural text to speech systemA fast, local neural text to speech system. Contribute to rhasspy/piper development by creating an account on GitHub.

#piperTTS #piper #tts

Continued thread

**chibi-[N]ah** @alex@social.nah.re · Jun 21

Jun 21

chibi-[N]ah @alex@social.nah.re

Test un peu plus sérieux.

Commande utilisée

echo "Enfin, que dis-je, enfin, finalement, une synthèse vocale avec une voix française qui prononce les mots de manière intelligible ! Ça change tellement des voix sans prosodie !" | ./piper --model voices/fr_FR-upmc-medium.onnx --output-wav synt.wav

#piperTTS #TTS

**chibi-[N]ah** @alex@social.nah.re · Jun 21 *

Jun 21 *

chibi-[N]ah @alex@social.nah.re

Quitte à utiliser une IA, autant utiliser la voix de GlaDOS.

Enfin une synthèse vocale (TTS) avec une voix en français qui fonctionne et est intelligible.

Autrement dit, juste un prétexte pour tester piper-tts

Commande utilisée

echo "Quitte à utiliser une IA ; autant utiliser la voix de Gla DOSSE." | ./piper --model voices/fr_FR-glados-medium.onnx --output-raw | aplay -r 22500 -f S16_LE -t raw -D pipewire -

#piperTTS #TTS

**Tarren (They/Them)** @Tarrenvane@dragonscave.space · Jun 20

Jun 20

Tarren (They/Them) @Tarrenvane@dragonscave.space

Things I never thought I'd say: "I can't understand myself; I talk too fast."
***Hashtags***
#iOS #PersonalVoice #TTS #Voiceover

Recent searches

Search options

Administered by:

Server stats:

#tts