English
全部
搜索
图片
视频
地图
资讯
Copilot
更多
购物
航班
旅游
笔记本
Top stories
Sports
U.S.
Local
World
Science
Technology
Entertainment
Business
More
Politics
时间不限
过去 1 小时
过去 24 小时
过去 7 天
过去 30 天
最佳匹配
最新
36氪
1 年
AI科学家太多,谁靠谱一试便知,普林斯顿新基准CORE-Bench:最强模型 ...
普林斯顿大学发布CORE-Bench评测AI复现科研。 普林斯顿大学新发布的CORE-Bench基准测试,通过270个基于90篇跨学科科学论文的任务,可评估AI智能体在计算可重复性方面的表现,最简单任务的准确率可以达到60%,最难任务准确率仅有21% 大模型的能力越来越强,用户在 ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果
今日热点
‘Ketamine Queen’ sentenced
Added to endangered list
Husband arrested in Bahamas
Speaks out after crash
Man pleads in NY terror plot
Plane crash at AZ airport
Mountaineering legend dies
Won't testify in Epstein probe
Announces retirement at 31
Prosecutors seek drug records
Invests in second data center
Iran closes Strait of Hormuz
Reds place Trevino on IL
Madeleines recalled
TN court blocks media access
British pastor charged
Vance on US-Iran ceasefire
'Game of Thrones' actor dies
Philly parking garage collapse
Dodgers great Lopes dies
Meta debuts new AI model
Disney plans to cut 1K jobs
TPS termination postponed
Loses appeals court bid
To change eligibility rules?
'Cop & 1/2' screenwriter dies
UW system president fired
Receive 7-game suspension
Army veteran charged
Rex Heuermann pleads guilty
Paramount pres to depart
Doctor found guilty
Indiana suspends gas tax
Hikes checked bag fees
反馈