Idea 3 discussion - Using Python subprocess for UMAP and HDBSCAN instead of JavaScript

During my GSoC, I tried to use Python libraries that runs scientific packages in JS/TS but it was quite difficult to do so in the desktop environment in Joplin. You can try it but I wouldn’t recommend it since I think it would be out of the scope.

I like the idea of using density-based clustering! Not only you don’t have to compute k but it deals with a noise too.

But if you cannot find any packages that properly implement those algorithms, then you can either:

  • Use KMeans with calculating/finding the optimal k and think about the UI/UX around it in case your k is not the optimal one.
  • Or prompt engineer your agent to run any density-based clustering using Claude code execution tool: Code execution tool - Claude API Docs