答案已發布2 週前Last edited 2 週前27 來源

拆解阿里Qwen-Robot套件：AI唔再齋噏，直接落場幫機械人「揸筷子」同「認路」

阿里Qwen Robot套件係三合一AI模型組合：Qwen RobotManip用80維度統一動作語言，令唔同牌子嘅機械臂都識得「執螺絲」；Qwen RobotNav整合五種導航任務，等於畀咗個「人肉GPS」機械人；Qwen RobotWorld仲識得預演物理世界變化，好似幫機械人裝咗個「水晶球」[1][7][8]。成個系統嘅最煞食之處係「模組化」：你可以單獨用導航模型俾倉庫送貨車，亦可以三件一齊上，砌出一個行得、睇得、諗得嘅全棧機械人，而且背後仲有阿里嘅開源生態同雲端基建撐腰 [9][10][21]。

使用 Studio Global AI 搜尋並查核事實瀏覽更多熱門頁面

252K0

An abstract visualization of three AI-powered robots representing the Qwen-Robot Suite's capabilities in manipulation, navigation, and world modeling. — What is Alibaba’s new Qwen-Robot AI model suite for robotics, what are the roles of Qwen-RobotNav, Qwen-RobotManip, and Qwen-RobotWorld, howAlibaba's Qwen-Robot Suite introduces three foundation models designed to power manipulation, navigation, and world prediction for physical AI agents.
AI 提示
Create a landscape editorial hero image for this Studio Global article: What is Alibaba’s new Qwen-Robot AI model suite for robotics, what are the roles of Qwen-RobotNav, Qwen-RobotManip, and Qwen-RobotWorld, how. Article summary: In June 2026, Alibaba launched the **Qwen-Robot Suite**, its first suite of AI models for robots, positioning it as a move beyond chatbot-style “digital AI” into embodied intelligence for the physical world [6][7]. The s. Topic tags: general, academic, general web, news, user generated. Reference image context from search candidates: Reference image 1: visual subject "BABA-W (09988.HK) -2.300 (-2.104%)) Short selling $836.00M; Ratio 11.269%) rolled out the Qwen-Robot embodied AI foundation model series, comprising three core models: the VLA man" source context "BABA-W Rolls out Qwen-Robot Embodied AI Foundation Model Series" Reference image 2: visual subject "B
openai.com

由電商巨頭轉型做科技霸主，阿里巴巴近年喺數碼AI嘅步伐快到冇朋友。但係2026年6月，佢哋旗下嘅通義千問實驗室（Qwen）玩咗舖更大嘅：正式推出Qwen-Robot套件。呢個唔係普通嘅語言模型升級，而係佢哋第一個專為「具身智能」（Embodied AI）而設嘅AI模型系列。簡單講，即係AI由電腦Mon跳咗出嚟，直接指揮機械人喺現實世界做嘢，標誌住阿里正式由「口水戰」轉去「實體戰」。

成個套件嘅設計理念好聰明，唔係盲頭烏蠅咁整一個萬能模型，而係將機械人需要嘅能力拆成三嚿：一對靈巧嘅手、一對識路嘅腳，同埋一個曉思考嘅大腦。呢種模組化嘅玩法，令到唔同形態嘅機械人都可以好似砌積木咁，揀自己啱用嘅「器官」裝上去。

三嚿「器官」點分工？

Qwen-RobotManip：機械人終於唔使「雞手鴨腳」

呢個係一個視覺-語言-動作（VLA）模型，底層架構係Qwen3.5-4B，專門負責操作控制。佢最大嘅突破，在於用咗一套80維度嘅統一動作表徵。你可以當佢係一種畀機械人睇嘅「萬國肢體語言」，無論係邊個廠出嘅機械臂，只要識睇呢套語言，就識得聽人話做嘢。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查核事實

人們還問