BTC

ETH

HTX

SOL

BNB

简中

繁中

English

日本語

한국어

ภาษาไทย

Tiếng Việt

Settings

More

Login

OpenAI Releases LifeSciBench: Measuring AI Systems' Capabilities in Real-World Scientific Research Scenarios

2026-06-19 15:29

Odaily Planet Daily News OpenAI has released a new evaluation benchmark, LifeSciBench, designed to measure the capabilities of AI systems in real-world scientific research scenarios. Reportedly, LifeSciBench is based on 750 expert-crafted tasks, covering 7 types of scientific research workflows and 7 biological domains. The tasks were contributed by 173 researchers with PhD backgrounds and experience in the biotech or pharmaceutical industries. This benchmark emphasizes the assessment of complex scientific research capabilities, including evidence integration, experimental design, data analysis, scientific reasoning, and scientific communication, rather than single factual questions. Over 79% of the tasks involve multi-step reasoning, requiring an average of approximately 4 reasoning steps per question, and include 1,062 real-world research-related data attachments (such as papers, charts, sequence data, and structural files).

Recommended Articles

「NVIDIA proxy stock」CoreWeave co-founder interview: AI demand seems to be intensifying every day

STRC de-pegs by 11%, can Strategy's perpetual motion machine still keep running?

When the World Cup Meets Agent: From Web2 to Web3, How Are Wallets Evolving Toward Agentic Wallets?

Gate Research: Market Cap of the Big Three Storage Giants Surpasses $1 Trillion Collectively

Search

24-Hour Flash News

2026-06-19 16:58

伊朗外交部副部长：60天后将采用新的机制来管理霍尔木兹海峡

2026-06-19 16:11

美国情报部门警告：以色列可能破坏美伊协议

2026-06-19 15:59

Donald Trump frames Iran deal amid pushback against 'America loses' narrative

2026-06-19 15:53

A ceasefire was announced, yet Israeli military drones continue to conduct airstrikes on southern Lebanon

2026-06-19 15:45

伊朗外长：美国应对任何违反谅解备忘录条款的行为负责

2026-06-19 15:34

Analysis: The market cap share of the semiconductor sector in the S&P 500 rises to 18.8%, hitting a new high

Download Odaily App

Let Some People Understand Web3.0 First

Android

Advertising Cooperation

Odaily Brand Media Kit | Official Logo & Visual Guidelines

Beijing Ruike Culture Media Co., Ltd.

京ICP备 2026027382号

京公网安备11010502060861号