Searching attached files

101lols · 15 February 2020 11:36

The search feature would be greatly enhanced if attached files like PDFs were also searched.

dae · 31 August 2020 16:10

This seems like a critical feature.

Gnopps · 30 October 2020 20:56

This is a feature I'm missing a lot as well. Not talking about OCR, but for Joplin to search whatever text is already present in an attached PDF-document.

Nick1 · 1 January 2021 13:43

Just searching the PDF index only, would already be a great improvement. This is probably simpler than a full pdf search?

ajay · 1 January 2021 14:02

As far as I can tell, Joplin does not even search inside a plain text file attached. (tried simple search, and "goto anything" on desktop v1.5.11, no luck)
That would be the first step, don't you think ?

JackGruber · 2 January 2021 10:03

For such an action, the file content (extracted text) would have to be included in the database / metadata. Only then would such a search be possible.
Laurent would have to comment on something like this, whether such a function would be implemented or not.

ajay · 2 January 2021 13:18

It is my understanding that Jop (by default, simple drag-n-drop) adds attached files to its DB, not just a link to an external file. In this case it's a matter of search scope not changes to the DB.
PS: you can use opt/alt while dragging a file to your note, these alone wouldn't be searchable

JackGruber · 2 January 2021 13:27

No, only the metadata. The file it self is in the resource forld. But for performante search the text must be pre extracted.

Nick1 · 2 January 2021 15:07

There exist many open source document search engines, for instance http://docfetcher.sourceforge.net/en/index.html.
Maybe, part of this code could be used to build an index for all attachments, whenever an attachment changes. The Joplin search engine can then check the index for a specific GUID against the search expression.
The indexer may be an independent piece of code, fully standalone are triggered by a Joplin API AttachmentChangeEvent. It may be written by an independent developer and be extended whenever a new doc type is required, without affecting Joplin.

MWmC · 19 March 2021 17:02

I would like to vote for this feature as well. I create and capture a large number of notes, but I have many others that are lightly noted and tagged but which are used to store important documents as PDFs. If I could search in those, as I do in Evernote, it would be very helpful.

Still getting used to Joplin, but so far I'm finding it to be amazing. Congrats to everyone contributing to this FOSS project.

tannenzaepfle · 7 November 2021 11:02

its a long time now but there are any news?
Hartmut

mzguy · 14 November 2021 15:22

There's this option:

copypaper · 16 December 2021 18:46

This would be huge, and would allow me to fully replace other tools with Joplin.

For me, only PDF text search is needed. OCR or image search is nice to have, but I can do that ahead of time if needed.

One workaround that could be implemented is a tool or human process that extracts the text from a PDF, and dumps it into a note with the PDF attached. I have seen this option work well for paperless orgs pre-evernote. This could even potentially be automated via the API.

This doesn't help people who already have a ton of PDFs in Joplin, of course.

wexsoft · 7 June 2023 09:34

Searching PDFs would be a must-have requirement for me to be able to replace Evernote completely.

johano · 7 June 2023 14:22

@wexsoft, have a look at the Resource Search Plugin. When installed, it is available under the Tools > Resource Search menu.

It can search through all text based PDFs (does not OCR images) in your notes. It is very nicely implemented by @roman_r_m (edited: mistake in attributing wrong developer)

wexsoft · 7 June 2023 14:36

Thanks very much for your reply @johano. I had seen a link to that plugin earlier in the conversation, but wasn't sure if it was functional. I've just installed it now and given it a try and it does seem to work very well.

rxliuli · 7 June 2023 15:05

what do you mean? I haven't implemented anything like that. Maybe there will be some attempts later, but not now.

johano · 7 June 2023 16:13

Apologies, @rxliuli, the author of the plugin was in fact @roman_r_m, my mistake ¯_(ツ)_/¯

Topic		Replies	Views
Indexing PDF attachments Features	1	485	11 May 2021
A feature request Support	12	555	29 January 2021
Search in resources plugin Plugins	17	3780	4 March 2023
Searching inside Joplin with the system-wide Recoll app! Features	5	529	7 March 2025
Search could be improved (highlighting ...) Features	4	626	11 January 2022

Searching attached files

Related topics