Hello everyone I’m currently trying to build a public chat bot for construction . I’m trying to do this efficient without having that much cost in expenses as it’s a completely public chat bot.
I am currently using RAQ for this. Haystack + datastax
I’ve converted all the docxs files into embeddings and it works like this. Query -> embeddings -> semantic search -> retrieval -> LLM
The problem is if the semantic search finds a bunch of nonsense, it’s a bit hard to feed the correct data to the LM. And since I have 50k files (docxs files), it’s hard to pinpoint the exact answer. This is because in some files, part of the document might contain the same information which is normal.
Any tip or advice will be appreciated!
submitted by /u/ullere
[link] [comments]