Prover-Verifier Games (PVGs) are effective for verifiability in nonlinear classification models, but have not been applied to high-dimensional images.
Concept Bottleneck Models (CBMs) can interpret complex data but rely on low-capacity linear predictors.
Neural Concept Verifier (NCV) combines PVGs with concept encodings for interpretable, nonlinear classification in high-dimensional settings.
NCV utilizes concept encodings extracted from raw inputs, outperforming CBMs and pixel-based PVG classifier baselines, showing promise for verifiable AI.