GPT-5.5、Claude Opus 4.7、DeepSeek V4、Kimi K2.6 基準測試比較 | 答案