答案已發布7 天前Last edited 5 天前21 個來源

拆解小鵬的500億豪賭：用「無語言」架構挑戰特斯拉FSD

小鵬證實每年花費高達5億美元於自駕AI模型訓練，僅為運算成本；其更廣泛的AI研發預算預計在2026年將躍升至近9.7億美元。在CVPR 2026上，小鵬推出第二代視覺語言行動模型 (VLA 2.0)，徹底移除了駕駛流程中的語言標記。AI主管劉顯明直言：「語言是毒藥」，是為了達到即時反應效能的關鍵決策。

使用 Studio Global AI 搜尋並查證事實瀏覽更多熱門頁面

384K0

A conceptual visualization of XPeng's VLA 2.0 AI architecture transforming raw visual data directly into driving actions, bypassing language tokens. — How much does Xpeng spend annually on AI model training for autonomous driving, what new VLA 2.0 architecture did it introduce at CVPR 2026XPeng's VLA 2.0 model adopts a 'Vision-Implicit Token-Action' path, eliminating language as an intermediate step to improve real-time driving performance.
AI 提示詞
Create a landscape editorial hero image for this Studio Global article: How much does Xpeng spend annually on AI model training for autonomous driving, what new VLA 2.0 architecture did it introduce at CVPR 2026. Article summary: Here are the answers to your three questions, based on recent reporting by Electrek and XPeng's official announcements.. Topic tags: general, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject "Xpeng’s head of autonomous driving told Electrek that the company is spending roughly 300 million RMB (~$41 million) per month on AI training alone and believes it has already reac" source context "Xpeng spends $500M/year on AI training to beat Tesla FSD | Electrek" Reference image 2: visual subject "Xpeng’s head of autonomous driving told Electrek that the company is spending roughly 300 million RMB (~$41 millio
openai.com

在一場日漸白熱化的技術與言論交鋒中，中國汽車製造商小鵬（XPeng）公開了其挑戰特斯拉自動駕駛主導地位的藍圖。在2026年6月的電腦視覺與模式辨識大會（CVPR）上，該公司詳細闡述了一種激進的AI架構，其核心在於拋棄多數機器人系統的關鍵組件——語言。伴隨著驚人的年度訓練支出曝光，小鵬正試圖證明，他們的次世代方案不僅與眾不同，在根本上更快速、更可靠。

小鵬通用智慧中心負責人、AI主管劉顯明博士在接受《Electrek》專訪時透露，公司每月投入約人民幣3億元，折合每年約5億美元（約新台幣160億元），僅用於AI模型的訓練。這筆數字僅涵蓋了訓練自駕模型的原始運算成本，並不包括公司總體的AI研發預算。小鵬在2025年用於AI相關的研發支出約為6.52億美元，而隨著其野心的擴張，此數字預計在2026年將攀升至約9.7億美元。

VLA 2.0：移除「語言」這個效能瓶頸

小鵬在CVPR 2026上的重頭戲，是正式發表其第二代視覺語言行動模型（Vision-Language-Action model），即VLA 2.0。此架構從根本上偏離了許多AI系統——包含小鵬自家的第一代模型——處理駕駛任務的方式。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查證事實

大家也會問