r/LLMDevs - Reddit
Advances in data processing techniques. You can increase context length in two ways. First, you can train the model with longer context lengths. That’s difficult because it’s much more computationally expensive, and it’s hard to find datasets with long context lengths (most documents in CommonCrawl have fewer than 2,000 tokens).
The second, more... See more
The second, more... See more
Shortwave — rajhesh.panchanadhan@gmail.com [Gmail alternative]
First of all, I'd say you have a bigger problem where your company is trying to find nails with a hammer. That is where your sentiment comes from, and could be an obstacle for both you and the company. It's the same deal when I see people keep on talking about RAG, and nowadays "modular RAG", when really, you could treat everything as a software... See more
r/MachineLearning - Reddit
t on LLMs and SQL highlighting why they don’t consistently work :
- “LLMs can write SQL, but they are often prone to making up tables, making up field”
- “LLMs have some context window which limits the amount of text they can operate over”
- “The SQL it writes may be incorrect for whatever reason, or it could be correct but just return an unexpected