Hey @Divya-A10, on the embedding model - Xenova/bge-small-en-v1.5 is worth considering over all-MiniLM (better retrieval benchmarks at similar size). The AI summarisation plugin by @HahaBill is a good reference for how Transformers.js runs inside a Joplin plugin.
On chunking - Joplin collections vary a lot, the same user might have short fleeting notes alongside long structured documents or web clippings. What similarity threshold are you thinking for the merge step, and how does the approach behave across that range?
For the broader question on what direction mentors have in mind, the AI scoping discussion is worth reading before finalising a proposal.
Feel free to open your own thread for proposal discussion - the submission template has the structure to follow.