Google's TurboQuant saves memory, but won't save us from DRAM-pricing hell
Chocolate Factory’s compression tech clears the way to cheaper AI inference, not more affordable memory When Google unveiled TurboQuant , an AI data compression technology that promises to slash the amount of memory required to serve models, many hoped it would help with a memory shortage that has seen prices triple since last year.
Not so much.…
原文链接: The Register
