
AI Safety Evaluations for Human Flourishing - Designing Real-World Governance
AI Safety Evaluations for Human Flourishing - Designing Real-World Governance
AI Safety Evaluations for Human Flourishing - Designing Real-World Governance
AI Safety Evaluations for Human Flourishing - Designing Real-World Governance
February 20, 2026 | 8:00AM to 10:00AM The Imperial Hotel, New Delhi
Co-hosted with Humane Intelligence PBC
PAST EVENT
AI Safety Connect and Humane Intelligence PBC convened a private breakfast dialogue bringing together researchers, policymakers, and practitioners to explore practical, equitable methodologies for evaluating AI systems. The conversation built on three complementary initiatives emerging from the Summit's Safe and Trusted AI Working Group: the Expert Engagement Group on Frontier AI Model Usage Data and Voluntary Commitments; the announced Trust and Safety AI Commons; and the newly launched Global South AI Safety Network, led by Digital Futures Lab and the Centre for Responsible AI at IIT Madras.
The dialogue centred on three questions: what role evaluations play in AI governance, how voluntary safety commitments by frontier developers can be sustained, and how governance frameworks can better reflect Global South priorities and institutional realities.

Welcome & Opening Remarks


Mr. Nicolas Miailhe
Mr. Nicolas Miailhe
Mr. Nicolas Miailhe
Co-Founder, AI Safety Connect

Mr. Cyrus Hodes
Mr. Cyrus Hodes
Mr. Cyrus Hodes
Co-Founder, AI Safety Connect


Dr. Rumman Chowdhury
Dr. Rumman Chowdhury
Dr. Rumman Chowdhury
Founder, Humane Intelligence PBC
Moderator

H.E. Mr. Philip Thigo
Digital Ambassador, Government of Kenya
Framing Provocation 1


Mr. Stephen Clare
Lead Writer, International AI Safety Report
Lead Writer, International AI Safety Report
The first provocation and discussion examined evaluations as a governance mechanism. Participants recognised that evaluations are increasingly foundational to AI governance but face significant challenges: inconsistent methods among developers, high costs, and limited transparency around internal company testing. A key concern was that AI models may increasingly recognise evaluation settings and adapt their behaviour — underperforming during tests or masking capabilities — raising questions about whether current frameworks can keep pace with advancing systems.
The first provocation and discussion examined evaluations as a governance mechanism. Participants recognised that evaluations are increasingly foundational to AI governance but face significant challenges: inconsistent methods among developers, high costs, and limited transparency around internal company testing. A key concern was that AI models may increasingly recognise evaluation settings and adapt their behaviour — underperforming during tests or masking capabilities — raising questions about whether current frameworks can keep pace with advancing systems.
The first provocation and discussion examined evaluations as a governance mechanism. Participants recognised that evaluations are increasingly foundational to AI governance but face significant challenges: inconsistent methods among developers, high costs, and limited transparency around internal company testing. A key concern was that AI models may increasingly recognise evaluation settings and adapt their behaviour — underperforming during tests or masking capabilities — raising questions about whether current frameworks can keep pace with advancing systems.
The first provocation and discussion examined evaluations as a governance mechanism. Participants recognised that evaluations are increasingly foundational to AI governance but face significant challenges: inconsistent methods among developers, high costs, and limited transparency around internal company testing. A key concern was that AI models may increasingly recognise evaluation settings and adapt their behaviour — underperforming during tests or masking capabilities — raising questions about whether current frameworks can keep pace with advancing systems.
Framing Provocation 2


Ms. Kalika Bali
Senior Principal Researcher, Microsoft Research India; Chair, Expert Engagement Group on Frontier AI Model Usage / Voluntary Commitments
Senior Principal Researcher, Microsoft Research India; Chair, Expert Engagement Group on Frontier AI Model Usage / Voluntary Commitments
The second provocation addressed voluntary commitments and institutional sustainability. While commitments made at international summits signal shared norms, participants noted that companies are often hesitant to make public pledges, summits lack continuity mechanisms, and tracking commitments over time remains difficult. The discussion surfaced a tension between responsible AI and commercial incentives, and emphasised that the real audience for safety information is developers adapting models for specific use cases, not end users.
The second provocation addressed voluntary commitments and institutional sustainability. While commitments made at international summits signal shared norms, participants noted that companies are often hesitant to make public pledges, summits lack continuity mechanisms, and tracking commitments over time remains difficult. The discussion surfaced a tension between responsible AI and commercial incentives, and emphasised that the real audience for safety information is developers adapting models for specific use cases, not end users.
The second provocation addressed voluntary commitments and institutional sustainability. While commitments made at international summits signal shared norms, participants noted that companies are often hesitant to make public pledges, summits lack continuity mechanisms, and tracking commitments over time remains difficult. The discussion surfaced a tension between responsible AI and commercial incentives, and emphasised that the real audience for safety information is developers adapting models for specific use cases, not end users.
Framing Provocation 3



Dr. Urvashi Aneja
Founder and Director, Digital Futures Lab
Founder and Director, Digital Futures Lab
The third provocation examined Global South participation in AI safety governance. Participants observed that AI safety debates are often perceived as a Global North agenda, even as Global South countries actively adopt AI technologies. Key gaps include limited local evaluation infrastructure, insufficient government technical capacity, and a lack of resources for safety research. Regional cooperation, civil society participation, and locally contextualised evaluations were proposed as ways forward.
The third provocation examined Global South participation in AI safety governance. Participants observed that AI safety debates are often perceived as a Global North agenda, even as Global South countries actively adopt AI technologies. Key gaps include limited local evaluation infrastructure, insufficient government technical capacity, and a lack of resources for safety research. Regional cooperation, civil society participation, and locally contextualised evaluations were proposed as ways forward.
The third provocation examined Global South participation in AI safety governance. Participants observed that AI safety debates are often perceived as a Global North agenda, even as Global South countries actively adopt AI technologies. Key gaps include limited local evaluation infrastructure, insufficient government technical capacity, and a lack of resources for safety research. Regional cooperation, civil society participation, and locally contextualised evaluations were proposed as ways forward.
The third provocation examined Global South participation in AI safety governance. Participants observed that AI safety debates are often perceived as a Global North agenda, even as Global South countries actively adopt AI technologies. Key gaps include limited local evaluation infrastructure, insufficient government technical capacity, and a lack of resources for safety research. Regional cooperation, civil society participation, and locally contextualised evaluations were proposed as ways forward.
Key Findings

Participants agreed that AI evaluations are rapidly becoming a central governance mechanism, but the infrastructure required to support them — credible third-party evaluators, shared risk taxonomies, scalable benchmarks, and certification mechanisms for evaluation professionals — remains underdeveloped. The lack of shared definitions of AI risks across countries, companies, and evaluators was identified as a major barrier to coordinated governance. Strengthening this ecosystem will require coordinated efforts across governments, industry, research institutions, and civil society.
Participants agreed that AI evaluations are rapidly becoming a central governance mechanism, but the infrastructure required to support them — credible third-party evaluators, shared risk taxonomies, scalable benchmarks, and certification mechanisms for evaluation professionals — remains underdeveloped. The lack of shared definitions of AI risks across countries, companies, and evaluators was identified as a major barrier to coordinated governance. Strengthening this ecosystem will require coordinated efforts across governments, industry, research institutions, and civil society.
Participants agreed that AI evaluations are rapidly becoming a central governance mechanism, but the infrastructure required to support them — credible third-party evaluators, shared risk taxonomies, scalable benchmarks, and certification mechanisms for evaluation professionals — remains underdeveloped. The lack of shared definitions of AI risks across countries, companies, and evaluators was identified as a major barrier to coordinated governance. Strengthening this ecosystem will require coordinated efforts across governments, industry, research institutions, and civil society.

Closing Remarks



H.E. Ambassador Egriselda López
Permanent Representative of El Salvador to the UN


H.E. Ambassador Rein Tammsaar
Permanent Representative of Estonia to the UN
Both ambassadors, co-chairs of the UN Global Dialogue on AI Governance, signaled that the session's findings would inform multilateral governance processes through the rest of 2026. Ambassador López positioned evaluation as one of the most important bridges between governance commitments and implementation. Ambassador Tammsaar encouraged participants to carry their practical insights directly into the UN Global Dialogue in Geneva.
Both ambassadors, co-chairs of the UN Global Dialogue on AI Governance, signaled that the session's findings would inform multilateral governance processes through the rest of 2026. Ambassador López positioned evaluation as one of the most important bridges between governance commitments and implementation. Ambassador Tammsaar encouraged participants to carry their practical insights directly into the UN Global Dialogue in Geneva.
Both ambassadors, co-chairs of the UN Global Dialogue on AI Governance, signaled that the session's findings would inform multilateral governance processes through the rest of 2026. Ambassador López positioned evaluation as one of the most important bridges between governance commitments and implementation. Ambassador Tammsaar encouraged participants to carry their practical insights directly into the UN Global Dialogue in Geneva.
Both ambassadors, co-chairs of the UN Global Dialogue on AI Governance, signaled that the session's findings would inform multilateral governance processes through the rest of 2026. Ambassador López positioned evaluation as one of the most important bridges between governance commitments and implementation. Ambassador Tammsaar encouraged participants to carry their practical insights directly into the UN Global Dialogue in Geneva.

Photo Gallery

Attendees can access the gallery with the link and password provided in your follow-up email.

