Live
Black Hat USADark ReadingBlack Hat AsiaAI BusinessThe reputation of troubled YC startup Delve has gotten even worseTechCrunchSam Altman's Sister Amends Lawsuit Accusing OpenAI CEO of Sexual Abuse - GV WireGoogle News: OpenAI‘System failure’ paralyzes Baidu robotaxis in ChinaTechCrunch AIThe Perils of AI-Generated Legal Advice for Dealers and Finance Companies - JD SupraGoogle News: Generative AICognichip, which is building an AI model for chip design, raised a $60M Series A led by Seligman Ventures, with participation from new board member Lip-Bu Tan (Tim Fernholz/TechCrunch)TechmemeDrones Reportedly Being Used to Help Smugglers Cross the U.S.-Mexico BorderInternational Business TimesWhatsApp just caught an Italian spyware firm building a fake version of its app for iPhonesThe Next Web NeuralGoogle offers researchers early access to Willow quantum processorTechSpotCrack ML Interviews with Confidence: Anomaly Detection (20 Q&A)Towards AIInspectMind AI (YC W24) Is HiringHacker News TopMicrosoft CFO’s AI Spending Runs Up Against Tech Bubble FearsBloomberg TechnologyWhy Traditional Defenses Can’t Hide AI Traffic Patterns - Security BoulevardGoogle News: Machine LearningBlack Hat USADark ReadingBlack Hat AsiaAI BusinessThe reputation of troubled YC startup Delve has gotten even worseTechCrunchSam Altman's Sister Amends Lawsuit Accusing OpenAI CEO of Sexual Abuse - GV WireGoogle News: OpenAI‘System failure’ paralyzes Baidu robotaxis in ChinaTechCrunch AIThe Perils of AI-Generated Legal Advice for Dealers and Finance Companies - JD SupraGoogle News: Generative AICognichip, which is building an AI model for chip design, raised a $60M Series A led by Seligman Ventures, with participation from new board member Lip-Bu Tan (Tim Fernholz/TechCrunch)TechmemeDrones Reportedly Being Used to Help Smugglers Cross the U.S.-Mexico BorderInternational Business TimesWhatsApp just caught an Italian spyware firm building a fake version of its app for iPhonesThe Next Web NeuralGoogle offers researchers early access to Willow quantum processorTechSpotCrack ML Interviews with Confidence: Anomaly Detection (20 Q&A)Towards AIInspectMind AI (YC W24) Is HiringHacker News TopMicrosoft CFO’s AI Spending Runs Up Against Tech Bubble FearsBloomberg TechnologyWhy Traditional Defenses Can’t Hide AI Traffic Patterns - Security BoulevardGoogle News: Machine Learning

Evaluating Human-AI Safety: A Framework for Measuring Harmful Capability Uplift

arXivMarch 31, 202610 min read0 views
Source Quiz

arXiv:2603.26676v1 Announce Type: cross Abstract: Current frontier AI safety evaluations emphasize static benchmarks, third-party annotations, and red-teaming. In this position paper, we argue that AI safety research should focus on human-centered evaluations that measure harmful capability uplift: the marginal increase in a user's ability to cause harm with a frontier model beyond what conventional tools already enable. We frame harmful capability uplift as a core AI safety metric, ground it in prior social science research, and provide concrete methodological guidance for systematic measurem — Michelle Vaccaro, Jaeyoon Song, Abdullah Almaatouq, Michiel A. Bakker

View PDF HTML (experimental)

Abstract:Current frontier AI safety evaluations emphasize static benchmarks, third-party annotations, and red-teaming. In this position paper, we argue that AI safety research should focus on human-centered evaluations that measure harmful capability uplift: the marginal increase in a user's ability to cause harm with a frontier model beyond what conventional tools already enable. We frame harmful capability uplift as a core AI safety metric, ground it in prior social science research, and provide concrete methodological guidance for systematic measurement. We conclude with actionable steps for developers, researchers, funders, and regulators to make harmful capability uplift evaluation a standard practice.

Subjects:

Computers and Society (cs.CY); Artificial Intelligence (cs.AI); Human-Computer Interaction (cs.HC)

Cite as: arXiv:2603.26676 [cs.CY]

(or arXiv:2603.26676v1 [cs.CY] for this version)

https://doi.org/10.48550/arXiv.2603.26676

arXiv-issued DOI via DataCite

Submission history

From: Michelle Vaccaro [view email] [v1] Fri, 6 Mar 2026 20:29:20 UTC (4,918 KB)

Was this article helpful?

Sign in to highlight and annotate this article

AI
Ask AI about this article
Powered by AI News Hub · full article context loaded
Ready

Conversation starters

Ask anything about this article…

Daily AI Digest

Get the top 5 AI stories delivered to your inbox every morning.

Knowledge Map

Knowledge Map
TopicsEntitiesSource
Evaluating …researchpaperarxivaiartificial-…arXiv

Connected Articles — Knowledge Graph

This article is connected to other articles through shared AI topics and tags.

Knowledge Graph100 articles · 170 connections
Scroll to zoom · drag to pan · click to open

Discussion

Sign in to join the discussion

No comments yet — be the first to share your thoughts!

More in Research Papers