I tried enabling OCR a few months ago, but couldn't live with the initial issues. I think the process may have crashed, since I have thousands of images / PDFs.
I've been using this app for OCR and copying automatically into Joplin for years now:
It has worked well, but I feel it won't work forever and I may want to switch to the native implementation. To do so and not waste resources, and reduce initial OCR time, it would be fantastic to have the option to enable OCR in a selective fashion. I imagine two possible enhancements:
- OCR only notes added after the feature is turned on.
- OCR only notes with a particular tag (which could then be optionally removed).
This would solve a lot of problems. It would also enable better control for cases where one simply doesn't want to OCR something.
Thoughts? Also, if anyone has enabled native OCR for a very large database with tens of thousands of images and PDFs like I have, in a successful way, it would be great to hear how!