Discussion: How DeepSeek Turned Storage Into the Scheduler


DeepSeek showed that in sparse models, the real delay isnt compute but waiting on the right weights. In the age of expert routing, storage became the scheduler.

Read more at: How DeepSeek Turned Storage Into the Scheduler | Shared Everything From VAST

What are your thoughts? Did you learn something new? Do you agree with this take?