File Uploader and OCR

Great to hear!
I hadn't seen that thread, very cool that @myfta put that how-to together, but also reinforces that I should do better on documentation :crazy_face:...
Thanks for the update and the kind words!

Please do make use of my text and notes if it helps with the documentation. Getting the OCR working was critical for me too.

1 Like

I released a new version of this yesterday. I wanted to get the API token file out of the python packaging space and into userspace, so please be advised that the new version of this will do just that, and may ask you to re-input the API token. The API token file will now be located in the following locations based on platform:
Windows: AppData/Roaming/rest_uploader
Linux: .local/share/rest_uploader
MacOS: Library/Application Support/rest_uploader

1 Like

I've been using this for quite a while, and wonder whether the new (-ish) Joplin OCR functionality can replace it. Can anyone who used REST Uploader but switched to Joplin OCR chime in? Some things I am curious about:

  • Did you use REST Uploader a lot before switching to the now native Joplin functionality? I have thousands of PDFs, plus a few images, that I imported using this tool.
  • Is the Joplin functionality better or worse?
  • How is OCR recognition accuracy?
  • How was the transition? Were there errors during OCR or an unreasonable wait before everything was indexed?
  • How does the native OCR functionality handle PDFs with text already included? Does it re-OCR and add another text layer that duplicates what REST Uploader already did?

Any feedback would be greatly appreciated.

1 Like