Claude Opus 4.7、GPT-5.5、DeepSeek V4、Kimi K2.6：2026 基準測試怎麼看？ | 深度研究