@shikuz Thanks for the chunking tip. This actually makes sense. Is there a way to add not only nearby chunks, but perhaps all chunks in the same note?
Regarding chunking, I would like to share a few thoughts from my experiments.
- Depending on your use case, you may face the problem that you need certain notes as a whole (e.g. a checklist or other content where integrity is very critical), for this use case it might actually be practical to have large chunks and then manually ensure that your notes are not longer than the chunk size. (That's why I'm grateful you're considering the approximate token count feature in notes)
EDIT: Ok I think this above is not accurate as I realised that even with a large chunk size I get small chunks due to frequent use of subheadings. To have everything in one chunk would require me to avoid further subheadings, I guess...
- Another solution to the lack of context caused by chunking might be if Jarvis supports RAPTOR RAG in the future.
- Another workaround I experimented with is providing metadata through subheadings. As I remember you once mentioned that the embedding model stores the subheadings that lead to the chunks, I use H1 subheading to include a keyword that is important for the context of each chunk and that the chunks themselves may not contain. I have yet to try how tags affect this behaviour, as I am a bit worried that they might not be as "noticeable" to the LLM as H1 subheadings. I think the use of metadata will be very important to further sort out notes that are semantically similar but do not fulfil a specific requirement. My question is whether I can achieve this selection of notes containing only some tags by using the "Search" command in the Jarvis block. (Once again, thanks for the decision to have a preview window, as this will help me with further refinement.)
- If you are like me, your notes are not the same in all your notebooks. So it makes sense to have different chunk settings for each notebook. Would it be possible in the future to change the chunks settings on a notebook by notebook basis? By the way, this might be a bit similar to my earlier request to have different profiles in Jarvis. I just wanted to let you know that I have found a workable workaround for the profiles by simply using a different Joplin profile and importing the notes I want to experiment with. For testing this is great, but for day to day use it would be nice to be able to change the chunk settings on a notebook basis as I often change these depending on my use case and it means rebuilding the database each time.