The next challenge for building production RAG pipelines is scaling performance/compute to large numbers of documents.
We’re using our own documentation (400+ pages) as a testbed for advanced “chat with docs” architectures.
To do performance retrieval across a wide range of queries at this l... See more
Jerry Liux.com