GitHub - Nike-Inc/koheesio: Python framework for building efficient data pipelines. It promotes modularity and collaboration, enabling the creation of complex pipelines from simple, reusable components.
Towhee is a cutting-edge framework designed to streamline the processing of unstructured data through the use of Large Language Model (LLM) based pipeline orchestration. It is uniquely positioned to extract invaluable insights from diverse unstructured data types, including lengthy text, images, audio and video files. Leveraging the capabilities of... See more
towhee-io • GitHub - towhee-io/towhee: Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
Nicolay Gerold added
The last core data stack tool is the orchestrator. It’s used quickly as a data orchestrator to model dependencies between tasks in complex heterogeneous cloud environments end-to-end. It is integrated with above-mentioned open data stack tools. They are especially effective if you have some glue code that needs to be run on a certain cadence, trigg... See more
Data Engineering • The Open Data Stack Distilled into Four Core Tools
Nicolay Gerold added
GitHub - FlowiseAI/Flowise: Drag & drop UI to build your customized LLM flow
github.comAndrés added
Mem0: The Memory Layer for Personalized AI
Mem0 provides a smart, self-improving memory layer for Large Language Models, enabling personalized AI experiences across applications.
Mem0 provides a smart, self-improving memory layer for Large Language Models, enabling personalized AI experiences across applications.
Note: The Mem0 repository now also includes the Embedchain project. We continue to maintain and support Embedchain ❤️. You can find the Embedchain codebase in the embedchai... See more
GitHub - mem0ai/mem0: The memory layer for Personalized AI
Nicolay Gerold added
Sugarcane AI is an open-source framework designed to simplify and accelerate LLM app development. With a focus on fine-tuned Language Models (LLMs), prompt management, and Workflow Plugins, Sugarcane AI empowers developers to build, train, and manage complex LLM applications effortlessly. 🎉
Why Sugarcane AI? : Key Features of Microservices Framewor... See more
Why Sugarcane AI? : Key Features of Microservices Framewor... See more
sugarcane-ai • GitHub - sugarcane-ai/sugarcane-ai.github.io
Nicolay Gerold added
So what abstractions do we have as of today? For example, let’s take the resource abstraction (Dagster, Prefect, referred to as an operator in Airflow). You abstract complex environments and connections away with a simple construct like that. You have the immediate benefits of defining that once and using it in every task or pipeline with context.r... See more
Data Engineering • Data Orchestration Trends: The Shift From Data Pipelines to Data Products
Nicolay Gerold added
The ideal solution for AI-native vectorDB would be something that would would be easy to set up and should integrate with existing APIs for rapid prototyping but should be able to scale without additional changes.
LanceDB is designed with this approach. Being server-less, it requires no setup — just import and start using. Persisted in HDD, allowing... See more
LanceDB is designed with this approach. Being server-less, it requires no setup — just import and start using. Persisted in HDD, allowing... See more
Ayush Chaurasia • LLMs, RAG, & the missing storage layer for AI
Nicolay Gerold added
Projects like Ceramic allow users to take their data with them across the internet. Two contributors highlighted this open-source protocol, describing how it could change how we build and interact with on-chain applications.