We're excited to announce that the U.S. Al Safety Institute (US AISI) has officially joined the UK AISI Agent Red-Teaming Challenge as a co-judge.
We're excited to announce that the U.S. Al Safety Institute (US AISI) has officially joined the UK AISI Agent Red-Teaming Challenge as a co-judge. Alongside the UK AISI, US AISI will help evaluate submissions focused on Al agent failures, instruction bypass, misuse risk, and over-refusals-helping ensure the challenge maintains the highest standards of fairness and transparency.
A Global, Multi-Stakeholder Effort
This challenge is now supported by some of the most influential organizations in Al:
The prize pool has grown to $170,000, making this the largest Al red-teaming challenge of its kind.
What Is the UK AISI Agent Red-Teaming Challenge?
The challenge tasks participants with identifying vulnerabilities in anonymous Al agents-testing their ability to:
Participants use both direct and indirect exploit techniques, simulating the kinds of threats real-world agents may face in production.
We're currently in Wave 4, the final phase of the month-long challenge. New behaviors have been introduced, and submissions remain open through April 6.