{"id":3753,"date":"2025-05-12T23:02:43","date_gmt":"2025-05-12T23:02:43","guid":{"rendered":"https:\/\/mailitics.com\/index.php\/2025\/05\/12\/openai-leaps-into-health-care-with-ai-benchmark-to-evaluate-models\/"},"modified":"2025-05-12T23:02:43","modified_gmt":"2025-05-12T23:02:43","slug":"openai-leaps-into-health-care-with-ai-benchmark-to-evaluate-models","status":"publish","type":"post","link":"https:\/\/mailitics.com\/index.php\/2025\/05\/12\/openai-leaps-into-health-care-with-ai-benchmark-to-evaluate-models\/","title":{"rendered":"STAT+: OpenAI leaps into health care with AI benchmark to evaluate models"},"content":{"rendered":"<p>    STAT+: OpenAI leaps into health care with AI benchmark to evaluate models<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n    <!-- no image --><br \/>\n \t<BR><br \/>\n<BR><\/BR><\/p>\n<div>\n<p>OpenAI on Monday released a large dataset for evaluating how well large language models answer questions related to health care. Experts lauded the open-source data and detailed evaluation rubrics, calling them \u201cunprecedented\u201d in scale and breadth. <\/p>\n<p>The project, <a href=\"https:\/\/openai.com\/index\/healthbench\/\">HealthBench<\/a>, marks OpenAI\u2019s first foray into health care applications of AI, outside of <a href=\"https:\/\/www.statnews.com\/2024\/11\/12\/openai-chatgpt-health-care-adoption-hospitals-pharma-cancer-care\/\">external partnerships<\/a>.<\/p>\n<p>\u201cOur mission as OpenAI is to ensure AGI is beneficial to humanity,\u201d said Karan Singhal, who leads OpenAI\u2019s health AI team, referring to OpenAI\u2019s goal of developing artificial general intelligence. \u201cOne part of that is building and deploying technology. Another part of it is ensuring that positive applications like health care have a place to flourish and that we do the right work to ensure that the models are safe and reliable in these settings,\u201d he said.<\/p>\n<p><a href=\"https:\/\/www.statnews.com\/2025\/05\/12\/openai-leaps-into-health-care-with-ai-benchmark-to-evaluate-models\/?utm_campaign=rss\">Continue to STAT+ to read the full story\u2026<\/a><\/p>\n<\/div>\n<p> \t<BR><br \/>\n <BR><\/BR><br \/>\n    Brittany Trang<br \/>\n \t<BR><br \/>\n<BR><\/BR><br \/>\n<a href=\"https:\/\/www.statnews.com\/2025\/05\/12\/openai-leaps-into-health-care-with-ai-benchmark-to-evaluate-models\/?utm_campaign=rss\">Go to statnews<\/a><br \/>\n \t<BR><br \/>\n <BR><\/BR><\/p>\n","protected":false},"excerpt":{"rendered":"<p>STAT+: OpenAI leaps into health care with AI benchmark to evaluate models OpenAI on Monday released a large dataset for evaluating how well large language models answer questions related to health care. Experts lauded the open-source data and detailed evaluation rubrics, calling them \u201cunprecedented\u201d in scale and breadth. The project, HealthBench, marks OpenAI\u2019s first foray [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[69,2217,2152],"tags":[2150],"class_list":["post-3753","post","type-post","status-publish","format-standard","hentry","category-artificial-intelligence","category-health-tech","category-stat","tag-statnews"],"_links":{"self":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts\/3753"}],"collection":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/comments?post=3753"}],"version-history":[{"count":0,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/posts\/3753\/revisions"}],"wp:attachment":[{"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/media?parent=3753"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/categories?post=3753"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mailitics.com\/index.php\/wp-json\/wp\/v2\/tags?post=3753"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}