Thanks for the reply!
Like the reference to the REST uploader, although I also use the Joplin REST API (with curl as a client, both uploading on creation and "syncing" by uploading the temp files) I can integrate the uploader into my workflow or borrow from it.

Concerning whisper, yes, I have been passively monitoring the state-of-the-art in the field for years (e.g. in the early 2000s using the MS Speech SDK to input formulae in Mathematica; I know, a real dinosaur here) and IMHO, with the transformer models it has now reached critical "offline" mass. I use an AMD A10 APU (circa 2012) with the 'tiny' or 'base' English only (I think whisper is very well versed in German too) models (~300 to 500 MB in memory) and I find it more than acceptable. Using an AMD Ryzen with 6 cores makes it essentially trivial, highly accurate task. Give it a try, looking at your Joppy, I think it would be a simple matter for you to quickly get all the parts up and running, plus I will get a knowledgeable and sympathetic beta tester:-)

1 Like