답변게시됨3일 전Last edited 3일 전19 소스

AI 에이전트가 기초 생물학에 실패하는 이유: 데이터 인프라의 배관 공사가 시급하다

124K0

Abstract illustration of a DNA helix intersecting with digital circuitry and database nodes, symbolizing the infrastructure gap between AI and biological data. — What do researchers from Anthropic, NCBI, the Broad Institute, and the Chan Zuckerberg Initiative reveal about why AI agents fail at retrievThe gap between AI and biology is not a failure of intelligence but of infrastructure — a lesson made clear by new research from Anthropic and leading scientific institutions.
AI 프롬프트
Create a landscape editorial hero image for this Studio Global article: What do researchers from Anthropic, NCBI, the Broad Institute, and the Chan Zuckerberg Initiative reveal about why AI agents fail at retriev. Article summary: In a collaboration between Anthropic, NCBI, the Broad Institute, and the Chan Zuckerberg Initiative (CZI), researchers demonstrated that state-of-the-art AI agents fail at retrieving biological data from public databases. Topic tags: general, government, academic, general web, user generated. Reference image context from search candidates: Reference image 1: visual subject "# Artificial Intelligence agents for biological research: a survey. A **.gov** website belongs to an official government organization in the United States. Inclusion in an NLM data" source context "Artificial Intelligence agents for biological research: a survey - PMC" Reference image 2: vis
openai.com

Anthropic, NCBI, 브로드 연구소, 그리고 Chan Zuckerberg Initiative(CZI)의 대규모 협력 연구는 AI 기반 과학의 불편한 진실을 드러냈다. 오늘날 가장 강력한 AI 에이전트조차 공공 데이터베이스에서 바이러스 DNA 서열을 가져오는 단순한 작업을 전혀 신뢰할 수 없을 정도로 수행한다는 것이다. 2026년 6월 발표된 이 연구에 따르면, 클로드 소네트 4(Claude Sonnet 4)와 같은 최첨단 모델은 이 일상적인 작업에서 정확도가 16.9%까지 떨어졌다. 하지만 문제의 원인은 AI의 지능이 아니라 '배관 공사'에 있었다. 생물정보학 인프라가 사람이 웹 폼을 클릭하며 사용하도록 설계되었을 뿐, 자율 에이전트를 염두에 두지 않았기 때문이다. 이에 연구팀은 결정론적 검색 레이어인 gget virus를 구축하여 단숨에 거의 100%의 정확도를 달성했으며, 이는 신뢰할 수 있는 AI 생물학을 위한 가장 빠른 길이 데이터 파이프를 고치는 것임을 입증했다 .

AI 에이전트가 생물학 데이터베이스에서 좌초하는 이유

로라 루버트(Laura Luebbert)와 동료들은 이 문제를 강력한 비유로 설명한다. AI 에이전트를 사용해 생물학 데이터를 탐색하는 것은 마치 중세 도시에 최신 스포츠카를 몰고 가는 것과 같다. 차는 기술적으로 발전했지만, 도로가 그 차를 위해 설계된 적이 없기 때문이다 .

공동 연구팀은 바이러스 학자들이 발병 추적과 진단법 개발에 필수적으로 사용하는 자원인 NCBI Virus 데이터베이스에서 바이러스 서열 데이터를 검색하는 겉보기에 단순한 작업을 클로드, GPT 기반 모델, 바이옴니 오픈 소스(Biomni Open Source), 에디슨 분석(Edison Analysis) 등 여러 선도적인 AI 시스템에 시험했다 . 결과는 충격적이었다.

인간 중심 설계, 에이전트에게는 최악의 성능

NCBI Virus와 많은 공공 생물학 데이터베이스는 사람이 브라우저에서 상호작용하는 워크플로우에 맞춰 구축되었다. 과학자들은 필터를 클릭하고, 결과를 직접 살펴보며, 시각적 단서에 의존한다. 이러한 인터페이스 로직은 구조화된 프로그래밍적 명령을 기대하는 자율 에이전트와는 근본적으로 맞지 않는다 .

Studio Global AI

Search, cite, and publish your own answer

Use this topic as a starting point for a fresh source-backed answer, then compare citations before you share it.

Studio Global AI로 검색 및 팩트체크

사람들은 또한 묻습니다.