UAI 2026 Workshop  ·  Amsterdam

2nd Workshop on Safe AI

Towards Trustworthy Agentic AI

Friday, 21 August 2026
Amsterdam, Netherlands
Full-day · In person
We are pleased to announce the 2nd Workshop on Safe AI (SafeAI), held as part of UAI 2026 in Amsterdam. This year's edition focuses on safety challenges in agentic AI — systems that autonomously act, adapt, and interact over extended horizons. The workshop features invited talks, a panel discussion, and contributed papers spanning foundations, robustness, interpretability, and deployment of safe AI systems.

Important dates

All deadlines are end of day Anywhere on Earth (AoE).

Apr 9, 2026 Call for papers released
May 28, 2026 Paper submission deadline
Jun 1–16, 2026 Reviewing period
Jun 18, 2026 Author notification
Jul 2, 2026 Camera-ready deadline
Jul 9, 2026 Program & papers online
Aug 21, 2026 Workshop day — Amsterdam

Call for papers

We invite two categories of submissions:

Review process: Single-blind peer review by the program committee. At least one author must attend in person. There will be no proceedings, so authors are free to submit their work elsewhere. A journal special issue may be considered.

Submit on OpenReview ↗

Topics of interest

We solicit submissions presenting original research, theoretical results, and applied work within the following topics (list not exhaustive).

Foundations for safe and agentic AI

  • Foundations of Safe AI across learning paradigms, including reinforcement and continual learning
  • Sequential decision-making under uncertainty, including Bayesian decision theory and POMDPs
  • Objective specification, reward design, and alignment for agentic systems
  • Test-time scaling, adaptive inference, and safety implications of increasingly capable agents

Uncertainty, robustness, and control

  • Uncertainty quantification, calibration, and propagation in decision-making
  • Risk-sensitive planning and long-horizon safety under uncertainty
  • Robustness to distribution shift, adversarial conditions, and non-stationary environments
  • Safety and control in RL-based agents, including safe exploration and learning-based control

Interpretability, auditing, and evaluation

  • Interpretability and explainability of agent policies, representations, and behaviors
  • Methods to audit, measure, monitor, and evaluate agentic AI systems
  • Benchmarks, metrics, and case studies for safe and aligned agents
  • Adversarial evaluation and stress testing of agentic systems, including red teaming

Engineering, deployment, and applications

  • Agent failure modes: reward hacking, goal misgeneralization, agent hijacking, and emergent behaviors
  • Human-in-the-loop and oversight mechanisms for agentic systems
  • Deployment-time safety of AI in safety-critical and autonomous systems

Invited speakers

Tom Everitt
Tom Everitt Google DeepMind
Niek Tax
Niek Tax Meta

More speakers to be announced.

Panel discussion

In-person speakers and leading researchers from academia and industry will discuss:

Motivation

Ensuring the safety of AI systems has become a foundational challenge as they are increasingly deployed in high-stakes domains and subject to regulatory and societal expectations. Beyond accuracy and performance, Safe AI concerns reliability, robustness, interpretability, and alignment across the full system lifecycle.

This workshop focuses on technical and socio-technical challenges in Safe AI, with particular attention to agentic AI systems — learning-based systems that autonomously select actions, interact with an environment, and pursue objectives over time. In such systems, actions influence data, feedback, and behavior over extended horizons, making safety tightly linked to sequential decision-making, adaptation, and interaction. These challenges place uncertainty, robustness, and interpretability at the center of agentic safety, directly connecting them to foundational questions studied by the UAI community.

Organizers

Mykola Pechenizkiy
Workshop Chair Mykola Pechenizkiy Eindhoven University of Technology, NL
Christos Louizos
Workshop Chair Christos Louizos Qualcomm AI Research, NL
Yali Du
Workshop Chair Yali Du King's College London, UK
Eric Nalisnick
Workshop Chair Eric Nalisnick Johns Hopkins University, US
Emtiyaz Khan
Workshop Chair Emtiyaz Khan RIKEN AIP, Japan
Alvaro H.C. Correia
Discussion Chair Alvaro H.C. Correia Qualcomm AI Research, NL
Dharmesh Tailor
Discussion Chair Dharmesh Tailor Qualcomm AI Research, NL

Program committee

Program committee members will be announced soon.