GitHub - databonsai/databonsai: clean & curate your data with LLMs.

Firecrawl

firecrawl.dev
Thumbnail of Firecrawl

GitHub - elicit/machine-learning-list: A curriculum for learning about foundation models, from scratch to the frontier

elicitgithub.com
Thumbnail of GitHub - elicit/machine-learning-list: A curriculum for learning about foundation models, from scratch to the frontier

GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

mit-han-labgithub.com
Thumbnail of GitHub - mit-han-lab/streaming-llm: Efficient Streaming Language Models with Attention Sinks

GitHub - virattt/ai-hedge-fund: An AI Hedge Fund Team

github.com
Thumbnail of GitHub - virattt/ai-hedge-fund: An AI Hedge Fund Team

alibaba GitHub - alibaba/data-juicer: A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!