答案已發布2 個月前Last edited 上個月15 來源

阿里 Qwen3.7-Max 編程功力封神躋身全球頭四力壓 OpenAI 同 Google

阿里 Qwen3.7 Max 喺 Code Arena 全球編程榜以 1,541 分強勢奪得第四，直接超越 OpenAI 嘅 GPT 5.5 同 Google 嘅 Gemini 3.5 Flash [1][2]。榜單頭五名幾乎被 Anthropic 嘅 Claude 系列（Opus 4.7 同 4.6 變體）壟斷，唯獨 Qwen3.7 Max 可以打破缺口，仲要跑贏咗 Claude Opus 4.6 嘅非思考版本，證明編程 AI 競賽已進入新階段 [6][7]。

使用 Studio Global AI 搜尋並查核事實瀏覽更多熱門頁面

What is the significance of Alibaba’s new AI model, Qwen3.7-Max, achieving a score of 1,541 on the Code Arena global coding leaderboard, wheAlibaba Qwen3.7-Max breaks into top 4 on Code Arena global coding leaderboard
AI 提示
Create a landscape editorial hero image for this Studio Global article: What is the significance of Alibaba’s new AI model, Qwen3.7-Max, achieving a score of 1,541 on the Code Arena global coding leaderboard, whe. Article summary: Alibaba's Qwen3.7-Max scoring **1,541 points** on the Code Arena global coding leaderboard and claiming **fourth place** is significant for several reasons [1][2]:. Topic tags: general, general web. Reference image context from search candidates: Reference image 1: visual subject "As the number of training environments grows, Qwen3.7-Max-Thinking climbs the rankings across eight benchmarks, passing DeepSeek V4 Pro Max, GLM-5.1, and Kimi K2.6 - but still sitt" source context "Alibaba's latest AI model ran autonomously for 35 hours to optimize ..." Reference image 2: visual subject "Two bar charts for the benchmarks QwenClawBench and CoWorkBench. Claude Opus 4.6,
openai.com

阿里巴巴旗下最新旗艦模型「通義千問 Qwen3.7-Max」，近期喺全球編程界攞到一個歷史性嘅突破。喺業界公認極具權威嘅第三方編程能力評測榜單「Code Arena」入面，Qwen3.7-Max 以 1,541 分 嘅成績，一舉登上 全球第四位，令到呢間中國科技巨頭成為除咗美國公司 Anthropic 之外，唯一能夠躋身頭五名嘅開發商。

今次亦係國產 AI 模型首次衝入呢個級別嘅全球編程表現梯隊。頭五名嘅其餘四個席位，都係由 Anthropic 嘅 Claude Opus 4.7（思考版同非思考版）以及 Claude Opus 4.6 思考版佔據。Qwen3.7-Max 成功壓過 OpenAI 嘅 GPT-5.5 同 Google 嘅 Gemini 3.5 Flash 等一眾強敵，成為榜單上唯一非美國、非 Anthropic 嘅前沿模型，打破咗以往由美國巨頭壟斷嘅局面。

第三極勢力崛起，含金量十足

Code Arena 呢個排行榜喺 2026 年 5 月 25 日更新，專門針對大語言模型嘅編程能力進行嚴格且獨立嘅評估。多家業界報告都將佢形容為目前 AI 編程領域最具含金量、最權威嘅評測標準之一。

Qwen3.7-Max 攞到嘅 1,541 分，唔單止話畀大家知阿里嘅 Qwen 模型家族已經有能力喺最高層次嘅全球競賽入面「打世界波」，佢嘅編程能力更加係直接超過咗 Claude Opus 4.6 嘅非思考版本。

競爭版圖大洗牌

呢次成績徹底改寫咗頂尖編程 AI 由兩間美國公司玩晒嘅遊戲規則。佢清楚咁發出咗一個信號：中國嘅 AI 實驗室，而家已經有能力產出足以喺實際軟件開發任務中一較高下嘅模型。Qwen3.7-Max 嘅急速崛起，其實只係成個 AI 編程競賽大趨勢嘅一部分，包括月之暗面（Moonshot）嘅 Kimi K2.5 等模型，近期都已經打入咗全球頭十名。

編程之外，仲有驚喜

雖然 Qwen3.7-Max 喺 Code Arena 嘅成績最搶眼，但佢喺其他領域嘅表現同樣唔失禮。喺「Design Arena」排行榜上，Qwen3.7-Max 都攞到 第十名 嘅耀眼成績，顯示出佢喺多模態評估方面都有返咁上下實力，絕對唔止得編程叻。另外，呢個模型仲被形容為結合咗強大嘅推理能力，而且支援長時間嘅自主任務，可以連續工作 35 個鐘頭，調用超過 1,000 次工具。

對於開發者同企業嚟講，呢個趨勢嘅訊號好清晰：下一代 AI 編程助手嘅選擇，已經唔再局限於單一地區或者公司。阿里巴巴嘅 Qwen3.7-Max，已經成功將自己擺咗喺「值得為實際軟件工程流程進行基準測試」嘅前沿模型頭號名單之上。

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

使用 Studio Global AI 搜尋並查核事實

人們還問