02 2025 档案
摘要:DeepSeek 开源工作第5天 🚀 Day 5 of #OpenSourceWeek: 3FS, Thruster for All DeepSeek Data Access Fire-Flyer File System (3FS) - a parallel file system that ut
阅读全文
摘要:DeepSeek 开源工作第4天 🚀 Day 4 of #OpenSourceWeek: Optimized Parallelism Strategies ✅ DualPipe - a bidirectional pipeline parallelism algorithm for computa
阅读全文
摘要:DeepSeek 开源工作第3天 🚀 Day 3 of #OpenSourceWeek: DeepGEMM Introducing DeepGEMM - an FP8 GEMM library that supports both dense and MoE GEMMs, powering V3/
阅读全文
摘要:DeepSeek 开源工作第2天 🚀 Day 2 of #OpenSourceWeek: DeepEP Excited to introduce DeepEP - the first open-source EP communication library for MoE model traini
阅读全文
摘要:DeepSeek 开源工作第1天 🚀 Day 1 of #OpenSourceWeek: FlashMLA Honored to share FlashMLA - our efficient MLA decoding kernel for Hopper GPUs, optimized for va
阅读全文
摘要:DeepSeek 在 X 上发布开源计划 🚀 Day 0: Warming up for #OpenSourceWeek! We're a tiny team @deepseek_ai exploring AGI. Starting next week, we'll be open-sourcin
阅读全文