On my samsung phone, the voice recorder app has two additional options:
- speech to text
Both of these rely on the app being able to convert voice to text. The latter of the two probably cant be done in this app as the phone app uses the multiple microphones to "locate" the people.
But the first is something that could be done with Mozilla's deep speech. There is a g-streamer plugin that may be useful for this: https://github.com/Elleo/gst-deepspeech
- How to enable captioning/transcription
- Where to save the text. Can it be saved as metadata in the audio file?
- Add transcription