答え公開済み19 時間前Last edited 17 時間前26 ソース

小米が1兆パラメーターモデルで1000トークン/秒を突破、汎用GPUのみで～ UltraSpeedモード発表

小米とTileRTが2026年6月に「MiMo V2.5 Pro UltraSpeed」を発表。1兆パラメーターモデルとして初めて、標準的な8GPUサーバーで1000トークン/秒のデコード速度を突破し、カスタムチップ不要の高速推論を実現した [7][9][12]。速度達成の鍵は3つの技術の組み合わせ。すなわち、MoEエキスパート層のみに適用するFP4混合精度量子化、ブロックレベルのマスク並列予測によるDFlash投機的デコード、そしてGPUに計算パイプラインを常駐させるTileRTの常駐カーネルエンジンとワープ特殊化である [2][4][37]。

Studio Global AIで検索して事実確認さらにトレンドページを見る

29K0

Conceptual visualization of Xiaomi MiMo-V2.5-Pro-UltraSpeed achieving over 1,000 tokens per second on a trillion-parameter model using standard GPUs. — What did Xiaomi announce on June 6, 2026 regarding MiMo-V2.5-Pro-UltraSpeed, including the specific tokens-per-second milestone achieved onA conceptual representation of high-speed AI inference on standard GPU hardware.
AI プロンプト
Create a landscape editorial hero image for this Studio Global article: What did Xiaomi announce on June 6, 2026 regarding MiMo-V2.5-Pro-UltraSpeed, including the specific tokens-per-second milestone achieved on. Article summary: On **June 8, 2026** (with major reports appearing on June 9), Xiaomi's MiMo team, in collaboration with TileRT, announced **MiMo-V2.5-Pro-UltraSpeed** — a new high-speed inference mode for its trillion-parameter flagship. Topic tags: general, general web, user generated, documentation. Reference image context from search candidates: Reference image 1: visual subject "# Xiaomi rolls out MiMo V2.5 with multimodal AI and improved efficiency. Xiaomi has introduced its MiMo-V2.5 model family, adding multimodal capabilities and advancing its push int" source context "Xiaomi rolls out MiMo V2.5 with multimodal AI and improved efficiency" Reference image 2: visual subje
openai.com

2026年6月8日、小米（Xiaomi）の大規模言語モデルチーム「MiMo」と推論パートナー「TileRT」は、「MiMo-V2.5-Pro-UltraSpeed」を発表しました。これは、1兆（1トリリオン）パラメーターモデルにおいて、業界で初めて1000トークン/秒を超える出力速度を、カスタムチップではなく標準的な8GPUノード上で達成した高速推論モードです。

速度マイルストーン

発表内容の核心は、1.02兆パラメーター、アクティブパラメーター420億、100万トークンのコンテキストウィンドウを持つ「MiMo-V2.5-Pro」上で、1000トークン/秒以上の持続的なスループットを実現した点にあります。デモでは瞬間的に約1200トークン/秒のピーク速度も確認されました。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Studio Global AIで検索して事実確認

人々も尋ねます