My top issues with the foundations of the "modern data stack":
Snowflake - you may only pay for what you use, but you pay through the nose for it. The cost is so high that in many cases it can exceed the cost of binders full of dbas.
Fivetran - convenient, but they tried to triple my licensing cost last year, they have almost constant 15 minute
All you have to do is tell the chatbot what you need. Blaze learns about your Database, writes your SQL code, runs the code to get the data, and... See more
“I look for features from data scientists, [who have ideas of] things that are correlated with what I’m trying to predict.” We found that organizations explicitly prioritized cross-team collaboration as part of their ML culture. Md3 said: We really think it’s important to bridge that gap between what’s often, you know, a [subject matter expert] in... See more
The value of data is the value of what you can do with it. Therefore, the more you’re allowed to do — the more usage rights the seller grants you — the more you should be willing to pay! This is a pricing effect that’s completely independent of quality, quantity, dataset internals, or table stakes status.
GPTScript is a new scripting language to automate your interaction with a Large Language Model (LLM), namely OpenAI. The ultimate goal is to create a fully natural language based programming experience. The syntax of GPTScript is largely natural language, making it very easy to learn and use. Natural language prompts can be mixed with... See more
For example, if you ask a model to “return all active users in the last 7 days” it might hallucinate a `is_active` column, join to an `activity` table that doesn’t exist, or potentially get the wrong date (especially in leap years!).
We previously talked to Shreya Rajpal at Guardrails AI, which also supports Text2SQL enforcement. Their approach was... See more
Trying to get a better understanding of how prompts work in relation to fine-tunes, and trying to see if any of them are actually reliable enough to be used in a "production" type environment.
My end goals are basically
A reliable AI assistant that I know is safe, secure and private. Any information about myself, my household or my proprietary ideas
Where possible, we try to match the Hugging Face implementation. We are open to adjusting the API, so please reach out with feedback regarding these details.