mastodon.ie is one of the many independent Mastodon servers you can use to participate in the fediverse.
Irish Mastodon - run from Ireland, we welcome all who respect the community rules and members.

Administered by:

Server stats:

1.5K
active users

#texttospeech

1 post1 participant0 posts today

Hi y’all!
v0.1.0 of my portable text to speech device software was released!
Information about it can be found at https://git.ka-so.me/kasanwa-solane/portable-tts

Why did I make this?
I made it as a tool for my personal accessibility toolbox. There’s times that, for various reasons, I struggle with speech, but am expected to speak aloud or that speech is the modality for communication most useful in said context. This allows me to produce speech in a recognizable facsimile of my voice by typing it on a dedicated device.

Why a dedicated device?
Sometimes, I’m in situations where smartphones are disallowed, as well as any device with an integrated camera or microphone. A dedicated accessibility device that deliberately does not have those input modalities is much more acceptable in those spaces.

Everyone is encouraged to boost this post and/or share the link elsewhere. I want the people who might have a use case for this device to be able to find it.

Summary card of repository kasanwa-solane/portable-tts
Kasanwa Solane's Forgejoportable-ttsportable-tts
Replied in thread

@DigitaleEthik es gibt genug Optionen um die Anonymität von Personen sicherzustellen ohne #WastefulComputing welches nur #Großkonzernen in der #AiBubble hilft und allein wegen der enormen Ressourcenverschwendung ethisch auf dem Niveau von #Shitcoins wie #Bitcoin ist:

  • #Anonymous hat das mit #TextToSpeech vor 15+ Jahren schon gelöst.

  • Die meisten modernen Animationswerkzeuge haben Lippensynchronisation (Das Feature ist über 10 Jahre alt in Source Film Maker!)

  • Der Einsatz von "#KI-Charaktern" als Moderatoren wird eher der Seriösität schaden, weil es nunmal das #UncannyValley gibt.

  • Der #ModiOperandi von "KI-Inhalten" wird damit von #Desinformation weg normalisiert, was inhärent so ethisch falsch ist wie #Misgendering und #Deadnaming!

M.a.W.: Nur weil #Propaganda-Produzenten und #Desiformationsschleudern wie #RIAnovostri / #RT / #sputnik / #redfish , #CCTV / #CGTN & #IRIB / #PressTV dies tun wird es nicht weniger falsch!

  • Es ist naiv anzunehmen dass Regime wie #Venezulea nicht wissen, wer jene #Redakteur*innen sind.

  • Die Nutzung von Pseudonymen existiert und funktioniert.

  • Es dürfte sicherlich "fotogene" Exil-Oppositionelle geben die als Nachrichtensprecher*in vor ner Kamera taugen.

Ich betrachte daher "KI"-Nutzung abseits von Digen die keine "KI" sind sondern efektiv nur #Mustererkennung auf #BigData-Basis für bestenfalls hochgradig fragwürdige #Ressourcenverschwendung, egal ob #Energie oder #Hardware!

  • Gerade bei #Journalismus ist dies zu hinterfragen.

  • Zumal die Resultate bisher nichtmals als #Broll taugen sondern extrem #cringe sind!

Es gibt keinen Grund anzunehmen dass "KI im Journalismus" irgendeinen positiven Effekt hätte, sondern eher dass dies vgl. "#Pressefreiheit für #Medien aus Staaten die systematisch gegen Pressefreiheit agieren" (und #Journalist*innen im Inland oder gar auch Ausland aktiv verfolgen, foltern, verschwindenlassen und z.T ermorden!) eher das Gegenteil bewirkt…

When large language models, LLM run by big corporations, do good things for you it's nice enough to get a chuckle. In this example, of which I will show you the original photograph first, then the screenshot of what Lens did for me, you will see how handy it is, that Lens superimposes an extra layer with the translated text, in any of the hundreds of languages that that large language model supports

Replied in thread

@thelinuxEXP I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by @mkiol

It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.

I primarily use #WhisperAI for transcription and Piper for voice, but many other models are available as well.

It is available as flatpak and github.com/mkiol/dsnote

#TTS #transcription #TextToSpeech #translator translation #offline #machinetranslation #sailfishos #SpeechSynthesis #SpeechRecognition #speechtotext #nmt #linux-desktop #stt #asr #flatpak-applications #SpeechNote

Ah, Book 10 in the Outlander world by #DianaGabaldon has gotten a title:
"A Blessing For A Warrior Going Out".

Beautiful title, eh?
In her blog she says, it doesn't mean Jamie is going to die, in fact he won't die in the book, but just alludes to that St Michael's blessing when a man goes to war that she uses once in a while in the novels.
No publishing date, as the novel isn't finished.
(Any news on GameOfThrones, by the way?)

Am at my 4th read of the series. This time, what really gets on my nerves is how she changes back and forth which facts each character knows.
Mostly, the pattern is: character x suddenly doesn't know fact y.
Eg, wee Ian knew Will as Jamie's son since the incident with the snake in the loo.
In later books he doesn't know it and is surprised to recognize Jamie's features in Will's, a bit later still, Ian again for the first time recognizes Will as Jamie's son .

Or the stuff the travellers assume about their ability changes back and forth, and often in between, contradicting their current train of thought around the matter.
Or which non-traveller knows that character x is a traveller changes back to not knowing.

Annoying. Because it interrupts my immersive reading experience quite a lot when it gets me wondering: what if this character had remembered correctly what he already knew – how would the story, or this particular moment have played out?

Anyway. So Book 10 now has a title.

dianagabaldon.com/wordpress/20

And I found that the Australien iphone voice Karen <premium> is reading #textToSpeech "Go Tell The Bees That I'm Gone" in the Books.app remarkably well. It really works. On iOS 18.5.
Also tested Calibre's content server where the book is presented in Safari on the iPhone. But Calibre doesn't support scrolling at all well in Safari.

I think, the best German voice is Yannik for novels. But it appears to be a bit more glitchy around page turns than Karen in the Books.app .
Surprised how well this works. Not annoying or off-putting at all. Both, Karen and Yannik. That each voice received its own settings page also helps, eg. pause between sentences, pitch and equalizer.
A professional reader like Davina Porter is of course wonderful. But I don't want to cough up € 40 for the audiobook, sorry.

Oh, and Davina Porter is not going to to read #Book10 because she is retired now.

dianagabaldon.comGOOD OMENS | DianaGabaldon.comWell, first things first— A Very (slightly ex post facto…) Happy Birthday to Sam Heughan! When he was first cast to play Jamie Fraser, I noted that he was born on April 30th, while Jamie’s birthday is May 1st—one on either side of the Beltane fire. Beltane is the Celtic fire festival that marks the beginning of summer, so plainly a time of good omens. As it turned out, it was indeed a time of good omens for All Things Outlander, so it seems an appropriate time to offer up two bits of (what I assume will be) further Good News. One on either side of the fire, as it were… The newly recorded audiobook of OUTLANDER, recorded by Kristin Atherton (the lovely actress who played the “mature” version of Jenny Murray in Season Seven), was released just yesterday, and I’m pleased to see that so many people already are excited about it and delighted with the quality. (This in no way denigrates the wonderful Davina Porter, who has read […]

🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬

Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬

follow hem here: @thorstenvoice
or on YouTube: youtube.com/@ThorstenMueller YouTube channel!

www.youtube.comBefore you continue to YouTube
Replied in thread

@silentLurker
I have found cool reader for android useful, open source. Highly customiseable. Also cereproc text to speech engines - about $2 for android (in-app purchase).

There are lots of sources for .epub files, not least the Internet Archive at archive.org. The richest thoughts of humankind at the lowest possible cost!

github.com/buggins/coolreader

I'm sure others will be along with even better recommendations in due course.

:gnu: 💬 Um exemplo que abre a cabeça para um mundo de possibilidades: 🤯

spd-say -y 'Portuguese (Brazil)+Storm' "$(date +'%A, %d de %B, %H e %M')"
Manual: :debian: https://manpages.debian.org/sid/spd-say

É parte do pacote speech-dispatcher, frequentemente já instalado.

🕰️ Poderia colocar algo assim no crontab para ser executado a cada hora das 9 às 18 de segunda a sexta-feira, ou pelo menos pra anunciar o fim do expediente.

💡 Tenho scripts em que já envio notificações à tela (notify-send). Em caso de evento importante, poderia verificar se spd-say está disponível e fazê-lo também falar a notificação. Isso pode me ajudar caso não esteja prestando atenção à tela. 🤔

#GNU #TTS #TextToSpeech #shell #Xfce #GNOME #GTK #GNUlinux #SoftwareLivre
manpages.debian.orgspd-say(1) — speech-dispatcher — Debian unstable — Debian Manpages

Favorite thing lately is finding an article I wish were in podcast form, saving the text to a .txt file, then having TTS Util use RH Voice to convert the file into an audio reading, and listen to my own little robotic FOSS nanny read me the stories I want to hear in my headphones as I do yardwork.