Evaluating Artificial Intelligence for National Security and Public Safety

Insights from Frontier Model Evaluation Science Day

Anton Shenk

ResearchPublished Sep 16, 2024

Frontier Model Evaluation Science Day assembled more than 100 leading experts in artificial intelligence (AI), national security, and policy to address the emerging challenges of evaluating threats from advanced AI systems. Participants discussed the intersection of AI with chemical and biological risks; scenarios in which AI systems could operate beyond the intended boundaries set by their developers or users, including AI systems deceiving humans or acting autonomously; risk-agnostic methodological frameworks to strengthen the robustness of model evaluations; and connecting stakeholders in government, industry, and civil society to develop a shared understanding of the objectives of evaluation science. 

These proceedings synthesize insights from these sessions, outline the complexities of evaluating AI for dangerous capabilities, and highlight the collaborative effort required to formulate effective policy. 

Topics

Document Details

Citation

RAND Style Manual
Shenk, Anton, Evaluating Artificial Intelligence for National Security and Public Safety: Insights from Frontier Model Evaluation Science Day, RAND Corporation, CF-A3429-1, 2024. As of September 23, 2024: https://www.rand.org/pubs/conf_proceedings/CFA3429-1.html
Chicago Manual of Style
Shenk, Anton, Evaluating Artificial Intelligence for National Security and Public Safety: Insights from Frontier Model Evaluation Science Day. Santa Monica, CA: RAND Corporation, 2024. https://www.rand.org/pubs/conf_proceedings/CFA3429-1.html.
BibTeX RIS

Research conducted by

Funding for this research was provided by gifts from RAND supporters and income from operations. The research was conducted by the Technology and Security Policy Center within RAND Global and Emerging Risks.

This publication is part of the RAND conference proceeding series. Conference proceedings present a collection of papers delivered at a conference or a summary of the conference.

This document and trademark(s) contained herein are protected by law. This representation of RAND intellectual property is provided for noncommercial use only. Unauthorized posting of this publication online is prohibited; linking directly to this product page is encouraged. Permission is required from RAND to reproduce, or reuse in another form, any of its research documents for commercial purposes. For information on reprint and reuse permissions, please visit www.rand.org/pubs/permissions.

RAND is a nonprofit institution that helps improve policy and decisionmaking through research and analysis. RAND's publications do not necessarily reflect the opinions of its research clients and sponsors.