Note taking, reading and translating with offline Speech to Text, Text to Speech and Machine Translator.
Speech Note let you take, read and translate notes in multiple languages. It uses Speech to Text, Text to Speech and Machine Translation to do so. Text and voice processing take place entirely offline, locally on your phone, without using a network connection.
Your privacy is always respected. No data is sent to the Internet!
Speech Note uses many different processing engines to do its job. Currently these are used:
Speech Note supports extensive number of language models. Some of them give very good accuracy, but some are not perfect. All models can be downloaded directly from the app.
A detailed list of supported languages is here.
If you are looking for similar app but for Linux Desktop you should check Speech Note available on Flathub (video demo).
Limitations:
Any comments, ideas, translations, issue reports are highly appreciated.
Translations (both Speech Note and Speech Keyboard):
All translations are very welcome. There are three ways to contribute:
- [preferred] Transifex project
- Direct github pull request or gitlab merge request
- Translation file sent to me via e-mail: dsnote@mkiol.net
Source code: https://github.com/mkiol/dsnote or https://gitlab.com/mkiol/dsnote
Bugs, Feature requests: https://github.com/mkiol/dsnote/issues or https://gitlab.com/mkiol/dsnote/-/issues or just email: dsnote@mkiol.net
4.7.0
4.6.1
4.6.0
4.5.0
4.4.0
4.3.0
4.2.0
4.1.0
4.0.0
To read more details check About->Changes in the app.
Comments
legar
Sat, 2024/11/16 - 15:09
Permalink
great soft. I use it on mx linux and the flatpack version worked at once ! I wonder if there is a way, a code or something to make a sound to reflect that the number in parentheses cites a book when doing text to speech. Thanks a lot.
Malakay
Thu, 2023/08/24 - 23:01
Permalink
Now i use vosk small
Malakay
Mon, 2023/08/07 - 22:18
Permalink
I upgraded to last version and it seems it is quite faster than before.but still quite many typos and one new thing - sometimes in written text appears this [unk]
mkiol
Tue, 2023/08/08 - 09:18
Permalink
What exact Speech to Text model you are testing?
Just a general remark from my observations. Sadly, STT works fine only for English right now. For any other language Whisper provides decent accuracy but it is also veeery slow :/
Malakay
Sun, 2023/09/03 - 10:26
Permalink
So I solved it finally - I installed back version 1.8.0 and keyboard 1.3.0 and it works flawlessly again. Model Commodoro CS.
Malakay
Mon, 2023/08/07 - 22:08
Permalink
So what should i do?
Malakay
Sat, 2023/07/08 - 23:18
Permalink
And whisper small, but i tested all of them and it seemed +- the same on all of them. Maybe you could obtain that old one used in 1.6-1.8 as another one for testing purposes?
mkiol
Thu, 2023/07/13 - 16:48
Permalink
Actually I don't think that any Whisper models is usable on ARM32. The smallest is 'Tiny' and it might work but will be very slow. Whisper has any value only on ARM64 :/
Malakay
Sat, 2023/07/08 - 23:16
Permalink
I use Xperia X. Thanks, looking forward for fix :)
Malakay
Thu, 2023/07/06 - 08:31
Permalink
I can´t help myself, but older version (some 1.6 - 1.8 perhaps, on sfos 3.4.0.24) worked much better for me. It recognized much faster and much better, almost without typos, what I said, it wrote. Nothing more, nothing less. This new version 3 seems to me slower and making more typos.
mkiol
Thu, 2023/07/06 - 12:11
Permalink
What model do you test and on what device?
I have to admit I've also noticed a performace reggression on ARM32 with DeepSpeech/Coqui models.
mkiol
Thu, 2023/07/06 - 16:56
Permalink
I took measurements and you were perfectly right. Speech Note v2.x uses new Coqui STT lib which has much worse performance comparing to old one. On Xperia 10 (ARM32) it 2x slower! I don't know why I missed it.
https://github.com/mkiol/dsnote/issues/11
Working on a fix...
TMavica
Sun, 2023/06/25 - 01:53
Permalink
It works. Anyway to add cantonese?
mkiol
Mon, 2023/06/26 - 18:42
Permalink
According this, Cantonese should work decently on Whisper 'Medium' model. Unfortunately 'Medium' model is disabled on SFOS version because phone's CPU is too week to handle processing (BTW, If you are Linux user, I recommend you to check Speech Note for desktop).
I made a test with 'Base' model and it looks that it can transcribe also to Cantonese but honestly can't say anything about accuracy. Did you try Whisper 'Base' model with Cantonese speech? What was the result?
TMavica
Wed, 2023/07/05 - 12:48
Permalink
Seem is not working unfortunately
TMavica
Tue, 2023/06/20 - 04:40
Permalink
One more question, my native language is tranditional chinese Cantonese , i am from HK, do u think it works??
mkiol
Tue, 2023/06/20 - 15:41
Permalink
I think it works for Taiwanese Mandarin but propably not Cantonese. I may be mistaken. Just uploaded new version, so you can verify it by your self. Please try Whisper Base model.
TMavica
Tue, 2023/06/20 - 04:32
Permalink
Ok thx
TMavica
Mon, 2023/06/19 - 11:04
Permalink
Can u add tranditional chinese?
mkiol
Mon, 2023/06/19 - 21:13
Permalink
Regarding STT, It looks like only Whisper model produces text in tranditional script but... Whisper for Chinese doesn't work right now at all because of the bug in the code. I've discovered this bug when I tried to answer your question, so big Thank You :) I will fix it in the upcoming release (in 2 days).
TTS Piper model reads chinese regardles of the script. It accepts traditional and simplified.
PamNor
Sat, 2023/04/08 - 14:09
Permalink
Can't find Norwegian download in settings.
Jolla C.
mkiol
Sat, 2023/04/08 - 17:13
Permalink
Unfortunately Norwegian is provided only by Whisper model and all Whisper models are disabled on ARM7 devices (like Jolla C). Whisper requires a lot of computation power and this old CPU can't handle it. Sorry.
eson
Fri, 2023/04/07 - 18:53
Permalink
Great upgrade! Thanks for the Swedish speech models. Much appreciated.
articice
Mon, 2022/04/04 - 23:56
Permalink
Fatal error: the to be installed harbour-dsnote-1.8.0-1.armv7hl require
s 'qt5-qtmultimedia-plugin-mediaservice-gstaudiodecoder'
Looks like there's no gstaudiodecoder for qt5-qtmultimedia-5.6.2+git31-1.12.1 in Vanha Rauma
mkiol
Wed, 2022/04/06 - 14:17
Permalink
On which device you are installing? This package should be available on SFOS 4.4 as well.
At least it is available on Jolla C:
articice
Wed, 2022/05/04 - 00:48
Permalink
It's Xperia 10 Plus.
Perhaps this issue only applies to aarch64.
unsocialcortex
Sun, 2022/04/03 - 23:13
Permalink
Re 1.6.1 patchnotes:
just tested this wonderful app out for a while and "Deutsch (Aashish Agarwal)" seems very inferior to "Deutsch (Jaco)". tried some normal conversation aswell as nicely read out sentences using my xa2 for both and alot more words just got completly garbled or left out with "Aashish Agarwal".
mkiol
Tue, 2022/04/05 - 10:21
Permalink
Thank you so much for the feedback. Would you be able to evaluate "Deutsch (med)" as well? This model is available in version 1.8.0.
unsocialcortex
Tue, 2022/04/05 - 21:14
Permalink
so im no doctor or anything but i tested "med" a bit using some medical vocabulary and excerpts from german medical journals. "jaco" always gets more in general from sentences. for the medical terms they miss words or get them wrong regularly but "jaco" gets closer in my experience by doing *something* instead of nothing in some cases.
all in all german deepspeech is obviously nowhere near english but its not bad for normal people conversation
JayJay
Fri, 2022/01/21 - 22:14
Permalink
Real nice work! The app is really cool. Is there any option to customize the vocabulary (i would need german medical language with drug recognition and medical vocabulary... is there maybe a file i can download or buy? If not... That would be an awesome new feature if i could add new vocabulary myself :-)
Pages