Verified Explanations for Neural Networks

Abstract

Verified explanations offer a principled way to explain neural network decisions by grounding explainability in formally verified robustness, thereby addressing the inherently black-box nature of these models. We propose two complementary notions of verified explanations. The first is defined over the input space of neural networks, where we introduce hierarchical explanations, termed verifier-optimal robust explanations. These explanations explicitly account for the incompleteness of neural network verifiers, ensuring optimality with respect to the guarantees that can be formally established. The second notion is defined over the network’s internal (latent) representations, capturing explanations in terms of the model’s inner computations. We present novel efficient algorithms to compute both forms of explanations, and showcase their scalability and meaningfulness in practice.

Date

Saturday, July 25, 2026

9:00 AM — 9:30 AM

Event

1st Workshop on Automated Reasoning for Future Space Logistics (AR4Space 2026)

Location

🇵🇹 Lisbon, Portugal

Links

Project Project Project

Whoopie