Ability to Tag Text within PDF & Utilization

I have been looking for the answer to this for a year. I would pay a thousand dollars, happily, for the perfect system that allowed me:

  1. to use an open source encrypted program
  2. data saved on my server/ personal cloud or even just able to sync with USB
  3. Web clipper
  4. HIGHLIGHT AND TAG text within a PDF that is embedded within the app/program (this is the piece I am not able to get)
  5. able to then use tags and/or highlights to mind map
  6. search function which searches all notes made and the complete text of pdf.

Basically a blend of MarginNote3/Remnote and Joplin. I JUST want to tag text in PDFs to build a personal data library and it seems the world does not want this to happen. lol I'm hoping and praying for you someday!

hey, welcome to the forum

What's your opinion on using OCR for the PDFs and highlight the text of significance by creating a new linked note?

This way you could build a robust linked searchable knowledge base.

Plugins to achieve that

Plugin: offline OCR (extract text from images, pdf, videos, etc)

Create note from highlighted text

New: Joplin Plugin Graph (knowledge graph visualization)

Other than that, there may be developed a plugin that would retrive "named destinations" links from a PDF. Those destination links could be used to navigate inside a PDF (url-to-file/file.pdf#nameddest=dest3) and additionally linked within the note structure of Joplin.

More on named destinations


[Offtopic] you have to tell me, how you built your linklist like that in your posting :star_struck:. What is the syntax/code like?[\Offtopic]

I'm not sure what's the linklist implied but in original post I meant that a user potentially can extract the text from a pdf, highlight it and link a separate note for each highlight.
Here's the example:

A note with pdf (let's call it NoteTOC) keeps text extract from the pdf. In mind map view NoteTOC represents a TOC of pdf highlights. Each highlight of text in NoteTOC has a child note (backlinked to NoteTOC).


mermaid src
graph TD
 subgraph NotebookPDF
 Highlights --> Child1(Child note1)
 Highlights --> Child2(Child note2)
 Highlights --> Child3(Child note3)
    subgraph NoteTOC
Pdf --> Text(Text from Pdf) --- Highlights


This way each highlight in pdf can be linked anywhere within the note structure (well, sharing externally is also possible). Also, you get the opportunity to comment each highlight in its own note.

**I meant this cool feature with the triangle:

Folded open:


In discourse

This text will be hidden

In Joplin

<details><summary>name of summary</summary>
This text will be hidden