OCR enabled yet text in PDF not found

robe070 · 23 January 2024 09:06

Operating system

Windows

Joplin version

2.14.10

Desktop version info

Joplin 2.14.10 (prod, win32)

Client ID: 426797a29eb04d969676e6832c96ee4c
Sync Version: 3
Profile Version: 45
Keychain Supported: Yes

Revision: 3ed6ad5

Simple Backup: 1.3.6

What issue do you have?

I'm looking to use the new OCR feature in #8795.

I enabled the OCR checkbox in Options.

Text in PDFs are not found.

I have imported ENEX file as HTML and as Markdown and neither method seems to make the imported PDFs searchable. There are 19 notes of which 5 have PDFs and there are maybe 25 JPG images.

I can provide a sample PDF if you need it.

laurent · 23 January 2024 10:22

Yes if you could provide a sample PDF that would help. Also please note that the first time OCR runs it takes time to process all the documents, so it may be that it's running in the background but hasn't finished yet.

robe070 · 23 January 2024 23:26

I waited 12 hours or more and now it is searchable. I had already waited 1 hour before I tested it. I'm surprised it took so long with so few attachments. I have a massive set of Evernote notebooks to import. I'm wondering how long that will take to index. Is there anyway to check progress, completion or how long it actually took?

Topic		Replies	Views
Search not finding OCR text Support	0	65	10 January 2026
[Beta Test] Joplin 2.14.12 - Cannot OCR without a sync target set? Beta Testing	4	333	9 February 2024
Search inside some Text-only PDFs does not work? Support	0	28	22 April 2026
Search for attachments via OCR text doesn't work on any platform Support	1	98	14 July 2025
Fixing OCR search issues after migrating notes from Evernote Support	3	61	13 June 2026

OCR enabled yet text in PDF not found

Operating system

Joplin version

Desktop version info

What issue do you have?

Related topics