328
DeepSeek's AI breakthrough bypasses industry-standard CUDA, uses assembly-like PTX programming instead
(www.tomshardware.com)
This is a most excellent place for technology news and articles.
The big win I see here is the amount of optimisation they achieved by moving from the high-level CUDA to lower-level PTX. This suggests that developing these models going forward can be made a lot more energy-efficient, something I hope can be extended to their execution as well. As it stands currently, "AI" (read: LLMs and image generation models) consumes way too many resources to be sustainable.
PTX also removes NVIDIA lock-in.
Wtf, this is literally the opposite of true. PTX is nvidia only.
Google was giving me bad search results about PTX so I just posted am opinion and hoped Cunningham's Law would work.
How cunning.