The tool searches across a corpus of 138+ million academic papers, supplemented by PubMed and more than 545,000 clinical trials from ClinicalTrials.gov . Users can also import papers from other databases, allowing a single workflow to combine multiple search sources
.
Screening papers for relevance is one of the most labor-intensive steps in any review. Elicit automates this at two levels: abstract screening and full-text screening.
According to the tool's internal validation benchmarks, Elicit achieves 97% accuracy on abstract screening and 99% accuracy on full-text screening . These benchmarks were drawn from tests across 994 Cochrane reviews, a standard-setter in evidence-based medicine
.
The screening process also supports reproducibility: every screening decision can be reviewed and audited .
Once relevant papers are identified, Elicit extracts key information into customizable table columns. Users can ask for PICO elements (Population, Intervention, Comparison, Outcome), study methods, sample sizes, statistical findings, and other study characteristics .
Elicit reports 96% extraction accuracy in benchmark tests . Reviews of the tool describe this extraction capability as one of Elicit's "superpowers," noting that researchers can define custom columns for almost any data point
.
The extracted data can be exported for further analysis, making it useful for systematic reviews and meta-analyses .
Elicit offers a dedicated Systematic Review module that guides users step-by-step through the full process: protocol refinement, comprehensive source gathering, title and abstract screening, full-text screening, and data extraction .
The workflow supports PRISMA 2020 guidelines and is designed to be reproducible, traceable, and auditable at every step . Key features include:
Elicit Reports generates fully automated research overviews inspired by systematic reviews . These reports synthesize a body of literature with sentence-level citations drawn from the underlying papers
. Each AI-generated claim is directly linked to the source paper, enabling researchers to verify claims without leaving the report
.
The reports conduct a fully-automated rapid review — including search, screening, and data extraction — using the same engine that powers the Systematic Review workflow . Researchers can edit every intermediate step, ensuring trust in the final output
. Reports currently support up to 80 papers per summary
.
A study published in 2025 reports that Elicit can reduce literature review time by up to 80%, saving an estimated 16 hours per review compared to traditional methods . This aligns with the tool's own reported figures
.
The time savings come from automating the most repetitive steps: finding papers, screening them, manually extracting data, and synthesizing findings into a coherent summary.
Unlike a static protocol locked in at the start, Elicit allows researchers to iteratively refine their search criteria, add or remove screening columns, and narrow focus as they explore the literature . This flexibility mirrors the natural research process, where new questions emerge as the literature is read.
While Elicit's accuracy benchmarks are strong, they are based on the tool's own internal validation and have not been independently replicated at scale. The key reported figures include:
These benchmarks were run against Cochrane reviews, which are considered among the highest-quality evidence syntheses. But the exact methodology and potential limitations of the validation are not fully detailed in publicly available material. Researchers should verify Elicit's outputs in their own domain, particularly for specialized or niche fields where the training data may be thinner.
Elicit is best suited for:
Elicit has some important limitations:
Elicit provides a genuinely useful set of automation tools for literature reviews, from natural-language search and automated screening to structured extraction and report generation. Its accuracy benchmarks — 97% abstract screening and 96% data extraction against Cochrane reviews — are encouraging, though they come from the tool's own validation . For researchers looking to save time on the mechanical parts of a literature review, Elicit is one of the most capable AI assistants available today.
As with any AI research tool, the best results come from combining Elicit's automation with human judgment: use the tool to handle the tedious parts, but always verify the outputs that matter most to your research question.
Comments
0 comments