Explainable AI for Image Classification using Twin System and Grad-CAM

Posted Apr 25, 2025

By Aryan Jain

2 min read

Introduction

Explainable Artificial Intelligence (XAI) is essential for making ML models more interpretable and trustworthy, particularly in opaque or high-stakes domains. This project applied two complementary post-hoc explanation methods to a binary classification task , distinguishing between real and AI-generated (fake) cat images.

The goal:

Train a high-performing ResNet-18-based image classifier.
Explain its predictions using visual and example-based techniques.

XAI Techniques Used

Grad-CAM (Gradient-weighted Class Activation Mapping)

Grad-CAM generates heatmaps showing which regions of an image influence the model’s prediction. It works by backpropagating gradients to the final convolutional layer.

Purpose: Visual explanation of what the model is attending to.

Grad-CAM Paper

Twin System (Embedding Similarity via Case-Based Reasoning)

This method explains predictions by retrieving visually similar images from the training set. Embeddings are extracted from the penultimate layer of ResNet-18, and cosine similarity is used to find the top matches.

Purpose: Intuitive justification by referencing known cases.

Inspired by: This Looks Like That (2018)

Dataset

Total Images: 300
- 150 Real cat images from public datasets
- 150 Fake cat images from google/ddpm-cat-256
Preprocessing: Resized to 224x224, normalized (mean=0.5, std=0.5)
Split:
- Train: 100 real + 100 fake
- Validation: 50 real + 50 fake

Model Architecture

Base Model: Pretrained ResNet-18
Final Layer: Modified for 2-class output
Training Setup:
- Optimizer: Adam (lr=1e-4)
- Loss: CrossEntropyLoss
- Epochs: 10
- Batch Size: 32

Final Validation Accuracy: 91%