DeepSeek has announced Flash MLA, an efficient MLA decoding kernel for Hopper GPUs.Flash MLA is optimized for variable-length sequences and is now in production.DeepSeek has also introduced other open-source projects such as DeepEP, DeepGEMM, and Optimized Parallelism Strategies.