Note taking and reading with speech to text and text to speech.
Speech Note let you create and read notes using your voice. It converts speech to text and text to speech with only off-line processing. It supports many languages thanks to integration with following STT/TTS engines: Coqui STT (Mozilla DeepSpeech), Vosk, Whisper, Piper, RHVoice, eSpeak-NG, MBROLA.
All voice analysis is entirely done locally on the device. Internet connection is only required for models download during app initial configuration. Speech Note respects your privacy and does not send any data to the Internet.
Speech Note supports extensive number language models. Some of them give very good accuracy, but some are not perfect. All models can be downloaded directly from the app.
A detailed list of supported languages is here.
Limitations:
Any comments, ideas, translations, issue reports are highly appreciated.
Translations (both Speech Note and Speech Keyboard):
All translations are very welcome. There are three ways to contribute:
- [preferred] Transifex project
- Direct github pull request
- Translation file sent to me via e-mail: dsnote@mkiol.net
Source code: https://github.com/mkiol/dsnote
Bugs, Feature requests: https://github.com/mkiol/dsnote/issues or just email: dsnote@mkiol.net
Attachment | Size | Date |
---|---|---|
![]() | 1.27 MB | 17/11/2021 - 10:00 |
![]() | 1.34 MB | 17/11/2021 - 19:28 |
![]() | 1.39 MB | 09/12/2021 - 21:32 |
![]() | 1.31 MB | 09/12/2021 - 21:32 |
![]() | 1.31 MB | 10/12/2021 - 20:52 |
![]() | 1.39 MB | 10/12/2021 - 20:52 |
![]() | 1.44 MB | 02/04/2022 - 19:40 |
![]() | 1.36 MB | 02/04/2022 - 19:40 |
![]() | 6.19 MB | 07/04/2023 - 18:03 |
![]() | 7.31 MB | 07/04/2023 - 18:03 |
![]() | 6.7 MB | 15/04/2023 - 16:58 |
![]() | 7.86 MB | 15/04/2023 - 16:58 |
![]() | 92.81 MB | 22/05/2023 - 16:43 |
![]() | 22.16 MB | 22/05/2023 - 16:43 |
3.0.0
To read more details check About->Changes in the app.
2.0.1
2.0.0
1.8.0
=> I would be very grateful for any feedback how good speech transcription is for individual models.
1.6.1
1.6.0
1.5.1
1.5.0
1.4.0
1.3.0
1.2.0
1.0.1
Comments
eson
Sat, 2021/10/02 - 23:00
Permalink
Well, knowing exactly nothing about the matter, I found these links on the net. Maybe you've already seen them or they are totally useless?
https://github.com/AlexandrosFerles/Swedish-Language-Automatic-Speech-Re...
https://github.com/se-asr/model
https://medium.com/@klintcho/creating-an-open-speech-recognition-dataset...
Thanks anyway for your good work as allways!
mkiol
Mon, 2021/11/15 - 18:47
Permalink
Sorry for the late reply. Indeed this project provides model for Swedish. Unfortunately it was trained for older version of DeepSpeech and therefore it is not compatible. Sadly, there is no simple way to convert it to new one. The only solution is to repeat the training, which is possible but requires access to source material (voice samples) and significant computing power.
defactofactotum
Sun, 2021/09/19 - 14:07
Permalink
Now working on pinephone with sfos4.2. But the microphone disconnects after every input.
Fuchur
Sat, 2021/09/04 - 21:15
Permalink
It really is working very well and a very nice app.
One thing I really would love to see is to be have a button on the keyboard or an own keyboard layout which would include it to the keyboard input.
That would just be great :).
lispy
Fri, 2021/06/04 - 20:12
Permalink
Really works. I like it. My wife has to convert a huge audiofile to text but pushing the button for an hour sadly doesn't cut it for her. Can you imagine an audiofile import of sorts? Or maybe make the button sticky?
mkiol
Sat, 2021/09/18 - 20:38
Permalink
There are to modes (Settings->"Speech detection mode"). In "Automatic" mode, you don't have to hold the button. App will (in most cases ;-) automaticaly detect that speaking begins.
defactofactotum
Mon, 2021/05/03 - 14:51
Permalink
Thanks for the keyboard fix! It still doesn't work on my pinephone - it worked briefly in Italian but with very bad recognition, then stopped again. Another suggestion: would it be possible to add words to the database? I imagine this is probably a huge and complicated task....
defactofactotum
Tue, 2021/04/27 - 11:22
Permalink
Also does not work on pinephone. Suggestion for keyboard behaviour: at the moment it's possible to edit text in the middle of a line but after typing one letter the cursor snaps back to end of line. When an entire word is wrong this is very laborious.
mkiol
Thu, 2021/04/29 - 11:33
Permalink
Thank for suggestion.
In the meantime, I've managed to fix Jolla 1, Jolla C and PinePhone issue. Moreover with alpha version of DeepSpeech accuracy of recognition is much improved. Stay tuned for next release :)
ichthyosaurus
Fri, 2021/04/23 - 21:20
Permalink
This looks very promising - I suggest that you ask for it to be included in the next community news :)!
mkiol
Fri, 2021/04/23 - 20:14
Permalink
Unfortunately app does not work on Jolla C (and most likely on other older devices). Sorry :(
sashikknox
Fri, 2021/04/23 - 13:42
Permalink
Cool, start testing... Too long time while download soeech model
mkiol
Fri, 2021/04/23 - 20:10
Permalink
Indeed, download time might be long. Model size for english is almost 1GB.
Pelzlurch
Fri, 2021/04/23 - 10:07
Permalink
For a first version indeed quite polished. Recognition quality is not brillant but quite OK - and very cool - offline!
The only thing I noticed negatively is that there is no automatic line break.
defactofactotum
Fri, 2021/04/23 - 13:29
Permalink
Thanks for this! I haven't tried it much except for simple phrases but seems to work well. Maybe in the future you could make it easier to copy lines of text to other apps.
oops just noticed now you can copy text on the pulley menu!
I dictated this with speech note....
mkiol
Fri, 2021/04/23 - 20:11
Permalink
nice :D
ziellos
Thu, 2021/04/22 - 23:31
Permalink
Thanks a lot! Had no chance to really test speech recognition, but your app looks already very polished.
Pages