File Uploader and OCR

bitbacchus · 12 November 2019 14:46

@kellerjustin thanks for publishing your uploader! I'm currently giving it a try, and so far, it is very cool.

I have some questions, mainly regarding OCR - which is not part of your code, but maybe you have some hints for me

OCR in PDF seems to work on the first page, only. Is this intentional or a bug?
OCR appears to be more reliable with English texts. I have installed tesseract-ocr-deu for German text recognition, but it seems not to improve OCR when used with the file uploader. Do you know whether tesseract needs to "know" the language before OCR?
OCR of handwritten text is not very good. You are not, by any chance, aware of a way to get better handwriting recognition?

I would appreciate it if you had any ideas on my OCR questions.

Cheers,
Sebastian

Topic		Replies	Views
Plugin: offline OCR (extract text from images, pdf, videos, etc) Plugins	48	8658	2 October 2023
OCR in Joplin (How to) Support	22	6446	23 March 2024
OCR for existing Joplin notes Apps	17	4656	12 April 2021
OCR selectively Features	0	110	14 November 2024
GSoC Idea - OCR Support Features gsoc-2020	18	2778	1 August 2024