OCR enabled yet text in PDF not found

Operating system

Windows

Joplin version

2.14.10

Desktop version info

Joplin 2.14.10 (prod, win32)

Client ID: 426797a29eb04d969676e6832c96ee4c
Sync Version: 3
Profile Version: 45
Keychain Supported: Yes

Revision: 3ed6ad5

Simple Backup: 1.3.6

What issue do you have?

I'm looking to use the new OCR feature in #8795.

I enabled the OCR checkbox in Options.

Text in PDFs are not found.

I have imported ENEX file as HTML and as Markdown and neither method seems to make the imported PDFs searchable. There are 19 notes of which 5 have PDFs and there are maybe 25 JPG images.

I can provide a sample PDF if you need it.

Yes if you could provide a sample PDF that would help. Also please note that the first time OCR runs it takes time to process all the documents, so it may be that it's running in the background but hasn't finished yet.

I waited 12 hours or more and now it is searchable. I had already waited 1 hour before I tested it. I'm surprised it took so long with so few attachments. I have a massive set of Evernote notebooks to import. I'm wondering how long that will take to index. Is there anyway to check progress, completion or how long it actually took?

1 Like

This topic was automatically closed 30 days after the last reply. New replies are no longer allowed.