GitHub - KomputeProject/kompute: General purpose GPU compute framework built on Vulkan to support 1000s of cross vendor graphics cards (AMD, Qualcomm, NVIDIA & friends). Blazing fast, mobile-enabled, asynchronous and optimized...

A new blog on GPUs is out now
I had a bunch of notion pages in which I had written some notes while reading and watching videos on GPUs for CUDA purposes so I thought of doing vibe blogging by giving Claude my notes and telling make them in the form of a well-structured single blog
I don't d... See more
Lots of hot takes on whether it's possible that DeepSeek made training 45x more efficient, but @doodlestein wrote a very clear explanation of how they did it. Once someone breaks it down, it's not hard to understand. Rough summary:
* Use 8 bit instead of 32 bit floating point numbers, which gives massive memory saving... See more
Jared Friedmanx.com