BTC

ETH

HTX

SOL

BNB

简中

繁中

English

日本語

한국어

ภาษาไทย

Tiếng Việt

設置

更多

登錄

OpenAI Releases LifeSciBench: Measuring the Capabilities of AI Systems in Real-World Scientific Research Scenarios

2026-06-19 15:29

Odaily Planet Daily News OpenAI has released a new evaluation benchmark, LifeSciBench, designed to measure the capabilities of AI systems in real-world scientific research scenarios. It is reported that LifeSciBench is based on 750 expert-written tasks, covering 7 types of scientific research workflows and 7 biology fields. The tasks were contributed by 173 researchers with Ph.D. backgrounds and experience in the biotechnology or pharmaceutical industries. This benchmark emphasizes the assessment of complex scientific research capabilities, including evidence synthesis, experimental design, data analysis, scientific reasoning, and scientific communication, rather than simple fact-based questions. Over 79% of the tasks involve multi-step reasoning, with each task requiring an average of about 4 reasoning steps, and includes 1,062 data appendices related to real scientific research (such as papers, charts, sequence data, and structural files).

推薦文章

「英伟达概念股」CoreWeave聯創訪談：AI需求似乎每天都在加劇

STRC 脫錨 11%，Strategy 的永動機還轉得動嗎？

When World Cup Meets Agent: From Web2 to Web3, How Will Wallets Evolve Toward Agentic Wallets?

Gate 研究院：存储三巨頭市值集體破萬億

搜索

24小時快訊

2026-06-19 16:58

伊朗外交部副部长：60天后将采用新的机制来管理霍尔木兹海峡

2026-06-19 16:11

美国情报部门警告：以色列可能破坏美伊协议

2026-06-19 15:59

特朗普對伊朗協議定調旨在駁斥美國失敗論

2026-06-19 15:53

停火消息傳出，以軍無人機仍持續空襲黎巴嫩南部

2026-06-19 15:45

Iranian Foreign Minister: The US is responsible for any violation of the terms of the Memorandum of Understanding

2026-06-19 15:34

分析：標普500的半導體板塊市值佔比升至18.8%創新高

下載Odaily星球日報app

讓一部分人先讀懂 Web3.0

Android

Odaily星球日報品牌媒體資料包 | 官方Logo與視覺規範下載

北京瑞克文化傳媒有限公司

京ICP备 2026027382号

京公网安备11010502060861号