Our 5 favourite open-source customer data platforms
Traditional ETL solutions are still quite powerful when it comes to:
- Common connectors with small-medium data volumes : we still have a lot of respect for companies like Fivetran, who have really nailed the user experience for the most common ETL use cases, like syncing Zendesk tickets or a production Postgres read replica into Snowflake. The only
Why you should move your ETL stack to Modal
Nicolay Gerold added
The last core data stack tool is the orchestrator. It’s used quickly as a data orchestrator to model dependencies between tasks in complex heterogeneous cloud environments end-to-end. It is integrated with above-mentioned open data stack tools. They are especially effective if you have some glue code that needs to be run on a certain cadence, trigg... See more
Data Engineering • The Open Data Stack Distilled into Four Core Tools
Nicolay Gerold added
Programmable platform for data in motion
An open-source data streaming platform with in-line computation capabilities. Apply your custom programs to aggregate, correlate, and transform data records in real-time as they move over the network.
An open-source data streaming platform with in-line computation capabilities. Apply your custom programs to aggregate, correlate, and transform data records in real-time as they move over the network.
The programmable data streaming platform
Nicolay Gerold added
In today’s data world, there are so many options for an EL tool to avoid you developing your own extracting script and to help you gain a LOT of time.
Fivetran, Mage, and Airbyte to mention a few.
You don’t have to maintain custom scripts, these tools come with +300 connectors, basic scheduling, and error handling.
Jeremy • What I Learned After One Year of Building a Data Platform From Scratch
- Khoros (fka Lithium) - Samsung, Sephora, Microsoft, Airbnb, Powerschool, eBay, Etsy - Higher Logic (Acq. Vanilla Forums) - Tesla, Electronic Arts, and a lot of nonprofits. They also have good onboarding - Salesforce Experience Cloud - Farmers Insurance (Agents), UC BerkeleyAnd there are a few new entrants that might be inter... See more
Li Jin • Community leaders deserve better: An open letter about community software
Jacob Borgeson added
Navigating the terrain of vector databases in 2023 reveals a diverse array of options each catering to different needs. The comparison table paints a clear picture, but here's a succinct summary to aid your decision:
- Open-Source and hosted cloud : If you lean towards open-source solutions, Weviate, Milvus, and Chroma emerge as top contenders. Pinec
Picking a vector database: a comparison and guide for 2023
Nicolay Gerold added
Hadoop as the last word in big data platform infrastructure. It was one of the first tools to serve this purpose, but there are already multiple alternatives, some new and some already well understood, and
Thomas H. Davenport • Big Data at Work: Dispelling the Myths, Uncovering the Opportunities
- Bob Muglia has the best definition. Others simply miss essential parts. The MDS isn’t just about open source or dbt; it is about SaaS, Cloud, Snowflake, and more. It is the wrapper around the progress in analytics over the last years.
- You should try to go for a 100% SaaS MDS . But try not to build up too many dependencies (yes, that’s possible; yo
Sven Balnojan • Breaking Down the Modern Data Stack: Practical Insights for Leveraging Analytics Progress
Nicolay Gerold added