Note taking and reading with speech to text and text to speech.
Speech Note let you create and read notes using your voice. It converts speech to text and text to speech with only off-line processing. It supports many languages thanks to integration with following STT/TTS engines: Coqui STT (Mozilla DeepSpeech), Vosk, Whisper, Piper, RHVoice, eSpeak-NG, MBROLA.
All voice analysis is entirely done locally on the device. Internet connection is only required for models download during app initial configuration. Speech Note respects your privacy and does not send any data to the Internet.
Speech Note supports extensive number language models. Some of them give very good accuracy, but some are not perfect. All models can be downloaded directly from the app.
A detailed list of supported languages is here.
Limitations:
Any comments, ideas, translations, issue reports are highly appreciated.
Translations (both Speech Note and Speech Keyboard):
All translations are very welcome. There are three ways to contribute:
- [preferred] Transifex project
- Direct github pull request
- Translation file sent to me via e-mail: dsnote@mkiol.net
Source code: https://github.com/mkiol/dsnote
Bugs, Feature requests: https://github.com/mkiol/dsnote/issues or just email: dsnote@mkiol.net
Attachment | Size | Date |
---|---|---|
![]() | 1.27 MB | 17/11/2021 - 10:00 |
![]() | 1.34 MB | 17/11/2021 - 19:28 |
![]() | 1.39 MB | 09/12/2021 - 21:32 |
![]() | 1.31 MB | 09/12/2021 - 21:32 |
![]() | 1.31 MB | 10/12/2021 - 20:52 |
![]() | 1.39 MB | 10/12/2021 - 20:52 |
![]() | 1.44 MB | 02/04/2022 - 19:40 |
![]() | 1.36 MB | 02/04/2022 - 19:40 |
![]() | 6.19 MB | 07/04/2023 - 18:03 |
![]() | 7.31 MB | 07/04/2023 - 18:03 |
![]() | 6.7 MB | 15/04/2023 - 16:58 |
![]() | 7.86 MB | 15/04/2023 - 16:58 |
![]() | 92.81 MB | 22/05/2023 - 16:43 |
![]() | 22.16 MB | 22/05/2023 - 16:43 |
3.0.0
To read more details check About->Changes in the app.
2.0.1
2.0.0
1.8.0
=> I would be very grateful for any feedback how good speech transcription is for individual models.
1.6.1
1.6.0
1.5.1
1.5.0
1.4.0
1.3.0
1.2.0
1.0.1
Comments
PamNor
Sat, 2023/04/08 - 14:09
Permalink
Can't find Norwegian download in settings.
Jolla C.
mkiol
Sat, 2023/04/08 - 17:13
Permalink
Unfortunately Norwegian is provided only by Whisper model and all Whisper models are disabled on ARM7 devices (like Jolla C). Whisper requires a lot of computation power and this old CPU can't handle it. Sorry.
eson
Fri, 2023/04/07 - 18:53
Permalink
Great upgrade! Thanks for the Swedish speech models. Much appreciated.
articice
Mon, 2022/04/04 - 23:56
Permalink
Fatal error: the to be installed harbour-dsnote-1.8.0-1.armv7hl require
s 'qt5-qtmultimedia-plugin-mediaservice-gstaudiodecoder'
Looks like there's no gstaudiodecoder for qt5-qtmultimedia-5.6.2+git31-1.12.1 in Vanha Rauma
mkiol
Wed, 2022/04/06 - 14:17
Permalink
On which device you are installing? This package should be available on SFOS 4.4 as well.
At least it is available on Jolla C:
articice
Wed, 2022/05/04 - 00:48
Permalink
It's Xperia 10 Plus.
Perhaps this issue only applies to aarch64.
unsocialcortex
Sun, 2022/04/03 - 23:13
Permalink
Re 1.6.1 patchnotes:
just tested this wonderful app out for a while and "Deutsch (Aashish Agarwal)" seems very inferior to "Deutsch (Jaco)". tried some normal conversation aswell as nicely read out sentences using my xa2 for both and alot more words just got completly garbled or left out with "Aashish Agarwal".
mkiol
Tue, 2022/04/05 - 10:21
Permalink
Thank you so much for the feedback. Would you be able to evaluate "Deutsch (med)" as well? This model is available in version 1.8.0.
unsocialcortex
Tue, 2022/04/05 - 21:14
Permalink
so im no doctor or anything but i tested "med" a bit using some medical vocabulary and excerpts from german medical journals. "jaco" always gets more in general from sentences. for the medical terms they miss words or get them wrong regularly but "jaco" gets closer in my experience by doing *something* instead of nothing in some cases.
all in all german deepspeech is obviously nowhere near english but its not bad for normal people conversation
JayJay
Fri, 2022/01/21 - 22:14
Permalink
Real nice work! The app is really cool. Is there any option to customize the vocabulary (i would need german medical language with drug recognition and medical vocabulary... is there maybe a file i can download or buy? If not... That would be an awesome new feature if i could add new vocabulary myself :-)
rdomschk
Sun, 2021/12/12 - 08:27
Permalink
Perfect Work! A big Thank You from me...
inta
Wed, 2021/11/17 - 21:30
Permalink
Thanks for the great work, now it runs on arm64 and it works really well. :)
inta
Wed, 2021/11/17 - 18:55
Permalink
Languages still do not load here. Is there anything I have to clean up? I removed the settings folder from .config and the models dir inside Downloads.
mkiol
Wed, 2021/11/17 - 19:33
Permalink
Sorry, silly me. I forgot to upload 1.5.1 package for aarch64. It should be available in a moment.
inta
Tue, 2021/11/16 - 23:04
Permalink
The app does not "hang" anymore on startup and uninstall works, but the language list in the settings is empty (Xperia 10 II), so I can not choose a model to get started with.
mkiol
Wed, 2021/11/17 - 10:02
Permalink
Fixed in 1.5.1. Would be grateful for check if problem is resolved. Thanks.
mkiol
Wed, 2021/11/17 - 01:02
Permalink
Oh dear. I know what is wrong. I will fix it tomorrow.
inta
Tue, 2021/11/16 - 02:08
Permalink
@robthebold 10 II, so @mkiol could be right that this is an arm64 issue. Never mind, force uninstall worked and I'll try it again if you need someone to test it.
dubliner
Tue, 2021/11/16 - 01:03
Permalink
While version 1.3 worked flawlessly under SFOS 3.4, it seems the new version 1.4 runs into a problem. All I get is "Language is not configured". When I open the settings, there are "no languages", nothing is displayed.
Curiously, the old "Downloads/DeepSpeech models" directory was still there, populated with "de.scorer de.tflite en.scorer en.tflite". Pointing the "Location on language files" to that directory does not make any difference.
I also tried deleting "Downloads/DeepSpeech models" as well as ".config/harbour-dsnote" to get a fresh start. Unexpectedly, that ".config/harbour-dsnote" is not re-created after starting DeepSpeech Note.
Starting from the CLI I receive this output:
Any help would be appreciated, especially since I really love this application!
dubliner
Tue, 2021/11/16 - 01:22
Permalink
Update: When I copied ".config/harbour-dsnote" and ".local/share/harbour-dsnote" as well as "Downloads/DeepSpeech models" from another phone running SFOS 4.2 it works!!! Yay!
Not sure, though, why the ".local/share/harbour-dsnote" directory was not created and populated on the first try?!
P.S. Now Speech keyboard is not working on the SFOS 3.4 phone. I get the logo (three vertical lines) with strikethrough symbol.
robthebold
Mon, 2021/11/15 - 22:18
Permalink
I installed this on my Xperia 10 II, can't seem to make it work . . . When I start the app, I see an error "Unable to start service" pop up. As I'd expect for this error, speech recognition doesn't work, and when I go to Settings, there are no languages to choose from.
I was going to uninstall and reinstall the app, but Storeman can't uninstall it and when I try to uninstall from terminal a "scriptlet" fails, saying it can't stop the service because it isn't running and uninstalling fails.
I've also tried starting the service manually from the terminal but that didn't work. I'm not totally sure I did that right, though: as root I tried "systemctl start harbour-dsnote.service" and "systemctl start --user harbour-dsnote.service" and fails with message "Unit harbour-dsnote.service not found."
"rpm -rl harbour-dsnote" led me to check to make sure /usr/lib64/systemd/user/harbour-dsnote/ exists, and it does.
Any ideas on how I can fix this or debug further? If more details are needed I can find my glasses and copy/paste stuff from terminal
mkiol
Mon, 2021/11/15 - 23:45
Permalink
I'm sorry for this mess. Most likely something is wrong with arm64 package. To be honest, I did not test it because I don't have any arm64 device yet.
To force uninstall run following in a terminal:
I will investigate what went wrong tomorrow. Sorrrry.
inta
Mon, 2021/11/15 - 20:59
Permalink
I tried to install this app and the keyboard app, but the list of languages in the settings is empty. I cannot remove this app, it fails with the message that the service is not running. Any idea how to fix that?
robthebold
Mon, 2021/11/15 - 22:20
Permalink
I didn't realize you posted this issue before me -- I'm getting the same problem. What device are you using?
PamNor
Sun, 2021/11/14 - 22:33
Permalink
@mkoli. I'll continue search for Norwegian *.tflite file. Keep up your good work.
PamNor
Sun, 2021/11/14 - 19:19
Permalink
Is there a possibility to get speech model for Norwegian language?
https://www.google.com/url?sa=t&source=web&cd=&ved=2ahUKEwi53uqAnpj0AhVC...
mkiol
Sun, 2021/11/14 - 19:57
Permalink
I really would like to add such support but unfortunately I wasn't able to find any DeepSpeech model for Norwegian (usually file with *.tflite extension) :(
lispy
Sun, 2021/09/19 - 22:57
Permalink
A big Thankyou for the Transcribe Audio File feature. Made my day!!!
eson
Sun, 2021/09/19 - 22:29
Permalink
How about more language models, Swedish in perticular? ;)
mkiol
Fri, 2021/10/01 - 21:16
Permalink
I've tried but unfortunately I didn't find any available DeepSpeech model for Swedish. If you find one I will be pleased to add it.
Pages