BTC

ETH

HTX

SOL

BNB

ดูตลาด

简中

繁中

English

日本語

한국어

ภาษาไทย

Tiếng Việt

หน้าแรก

ข่าวด่วน

เจาะลึก

บทความ

ประเด็นร้อน

หัวข้อพิเศษ

กิจกรรม

มุมมอง

เครื่องมือ

ดาวน์โหลด

การตั้งค่า

More

เข้าสู่ระบบ

OpenAI Releases LifeSciBench: Measuring AI System Capabilities in Real-World Scientific Research Scenarios

2026-06-19 15:29

Odaily, OpenAI has released a new evaluation benchmark, LifeSciBench, designed to measure the capabilities of AI systems in real-world scientific research scenarios. It is reported that LifeSciBench is based on 750 expert-crafted tasks, covering 7 types of scientific research workflows and 7 fields of biology. The tasks were contributed by 173 researchers with PhD backgrounds and experience in the biotech or pharmaceutical industry. This benchmark emphasizes the assessment of complex scientific research capabilities, including evidence integration, experimental design, data analysis, scientific reasoning, and scientific communication, rather than single factual questions. Over 79% of the tasks involve multi-step reasoning, with each question requiring an average of about 4 reasoning steps, and includes 1,062 real-world scientific research data attachments (such as papers, charts, sequence data, and structure files).

ลิงก์ต้นฉบับ

บทความแนะนำ

「英伟达概念股」CoreWeave联创访谈：AI需求似乎每天都在加剧

STRC deviates by 11% from its anchor – can Strategy's perpetual motion machine keep running?

เมื่อโลกฟุตบอลโคปปา อเมริกา พบกับ Agent: จาก Web2 สู่ Web3 กระเป๋าเงินจะพัฒนาไปสู่ Agentic Wallet ได้อย่างไร?

Gate Research: Combined Market Cap of the Three Major Storage Giants Exceeds $1 Trillion

ค้นหา

ข่าวด่วน 24 ชั่วโมง

2026-06-19 16:58

伊朗外交部副部长：60天后将采用新的机制来管理霍尔木兹海峡

2026-06-19 16:11

美国情报部门警告：以色列可能破坏美伊协议

2026-06-19 15:59

特朗普定调伊朗协议，意在驳斥美国失败论

2026-06-19 15:53

停火消息传出，以军无人机仍持续空袭黎巴嫩南部

2026-06-19 15:45

伊朗外长：美国应对任何违反谅解备忘录条款的行为负责

2026-06-19 15:34

คำแปล: การวิเคราะห์: สัดส่วนมูลค่าตลาดของกลุ่มเซมิคอนดักเตอร์ใน S&P 500 พุ่งขึ้นเป็น 18.8% สูงสุดเป็นประวัติการณ์

ดาวน์โหลดแอพ Odaily พลาเน็ตเดลี่

ให้คนบางกลุ่มเข้าใจ Web3.0 ก่อน

Android

เกี่ยวกับเรา

ติดต่อเรา

ร่วมงานกับเรา

ลิงก์มิตรภาพ

ความร่วมมือด้านโฆษณา

ข้อตกลงผู้ใช้

นโยบายความเป็นส่วนตัว

ข้อสงวนสิทธิ์

ชุดสื่อแบรนด์ Odaily | โลโก้ทางการและแนวทางภาพลักษณ์

บริษัท ปักกิ่ง รุ่ยเค่อ คัลเจอร์ มีเดีย จำกัด

京ICP备 2026027382号

京公网安备11010502060861号