Explainable Deep Learning Framework for Automated Concrete Crack Detection Using ResNet-18

doi:10.17577/IJERTV15IS050984

Volume 15, Issue 05 (May 2026)

Explainable Deep Learning Framework for Automated Concrete Crack Detection Using ResNet-18

DOI : 10.17577/IJERTV15IS050984

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 0
Authors : Janice Benita F
Paper ID : IJERTV15IS050984
Volume & Issue : Volume 15, Issue 05 , May – 2026
Published (First Online): 11-05-2026
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Explainable Deep Learning Framework for Automated Concrete Crack Detection Using ResNet-18

Janice Benita F

Undergraduate Student, Department of Information Technology

St. Josephs College of Engineering (Autonomous)

Chennai, Tamilnadu, India

AbstractConcrete crack inspection is an essential component of structural health monitoring and infrastructure maintenance. Conventional manual inspection methods are time-consuming, subjective, labor-intensive, and difficult to scale for large infrastructure systems. This paper presents an explainable deep learning framework for automated concrete crack detection using a fine-tuned ResNet-18 convolutional neural network integrated with Grad-CAM explainable artificial intelligence techniques. The proposed framework performs binary classification of cracked and non-cracked concrete surface images while simultaneously generating visual explanations highlighting crack-sensitive regions influencing prediction outcomes. Transfer learning techniques are employed to improve feature extraction efficiency and computational performance. Experimental evaluation demonstrated strong classification performance with an overall accuracy of 95.7%, precision of 95.1%, recall of 96.2%, and F1-score of 95.6%. A deployment-oriented web-based inspection interface was also developed to support automated infrastructure inspection workflows. The proposed framework demonstrates practical applicability for scalable, interpretable, and efficient structural health monitoring systems.

KeywordsConcrete Crack Detection, Deep Learning, Explainable AI, ResNet-18, Grad-CAM, Structural Health Monitoring, Infrastructure Inspection, Transfer Learning.

INTRODUCTION

Infrastructure monitoring plays an important role in ensuring the safety, reliability, and longevity of civil engineering structures such as bridges, tunnels, pavements, dams, and buildings. Concrete structures are continuously exposed to environmental stress, fatigue, aging, and mechanical loading conditions that may result in cracks and structural deterioration over time. Early crack detection is essential for preventing severe structural failures and reducing maintenance costs.

Traditional concrete inspection methods mainly rely on manual visual assessment performed by trained inspectors. Although widely practiced, manual inspection procedures are time-consuming, subjective, labor-intensive, and difficult to scale for large infrastructure systems. Variations in human judgment and environmental conditions may also affect inspection reliability and consistency.

Recent advancements in Artificial Intelligence (AI) and computer vision technologies have significantly improved automated crack detection systems [1], [3]. Deep learning models, particularly Convolutional Neural Networks (CNNs), have demonstrated strong image classification and feature extraction capabilities [1]. However, many CNN-based systems operate as black-box models, limiting interpretability in safety-critical engineering applications.

Explainable Artificial Intelligence (XAI) techniques improve prediction transparency by visually identifying image regions contributing to model decisions [2] . Grad-CAM is one of the most widely adopted explainability techniques because it generates the localization heatmaps highlighting crack-sensitive regions influencing CNN predictions [2].

This paper presents an explainable deep learning framework for automated concrete crack detection using a fine-tuned ResNet-18 model integrated with Grad-CAM explainability visualization. The proposed framework combines crack classification, visual interpretability, and deployment-oriented infrastructure inspection support within a unified system.
LITERATURE REVIEW
1. Traditional Crack Detection Methods
  
  Traditional concrete crack detection methods mainly relied on image processing techniques such as threshold segmentation, edge detection, and handcrafted feature extraction approaches [11], [14]. Although computationally simple, these methods often struggled under varying illumination conditions, complex surface textures, shadows, and environmental noise. Manual inspection processes also introduced subjectivity and inconsistency in structural assessment workflows, particularly for large-scale infrastructure systems.
2. Deep Learning and Explainable AI Approaches
Recent advancements in deep learning have significantly improved automated crack detection performance. Convolutional Neural Networks (CNNs) such as AlexNet, VGG16, ResNet, DenseNet, and EfficientNet demonstrated superior feature extraction and classification capability compared to conventional image processing methods [1], [3], [14]. Transfer learning techniques further improved

computational efficiency by utilizing pretrained ImageNet weights for infrastructure inspection tasks [1]. Explainable Artificial Intelligence (XAI) techniques such as Grad-CAM have recently gained attention because they generate visual heatmaps identifying crack-sensitive regions influencing prediction outcomes [2], [6], [15]. Existing studies primarily focus on classification accuracy while providing limited deployment-oriented implementation support. The proposed framework addresses these limitations by integrating explainable AI, deployment-oriented system design, and web-based inspection support within a computationally efficient deep learning framework.
RESEARCH CONTRIBUTIONS
1. Major Contributions of the Proposed Framework
  
  The major contributions of this research work are summarized as follows:
  - Development of an automated concrete crack detection framework using a fine-tuned ResNet-18 deep learning architecture.
  - Integration of Grad-CAM explainable artificial intelligence techniques for visual interpretation of crack-sensitive image regions influencing prediction outcomes.
  - Implementation of a web-based infrastructure inspection interface to support deployment-oriented structural health monitoring applications.
  - Application of transfer learning techniques to improve feature extraction capability, classification efficiency, and computational performance.
  - Experimental evaluation using multiple performance metrics including accuracy, precision, recall, and F1-score for reliable model assessment.
  - Demonstration of a scalable, interpretable, and computationally efficient AI-assisted infrastructure inspection framework suitable for real-world deployment environments.
2. Advantages of the Proposed System
  - The proposed explainable deep learning framework improves automated infrastructure inspection reliability by combining crack classification capability with visual interpretability support. Unlike conventional black-box deep learning systems, the integration of Grad-CAM explainability visualization enables engineers to identify image regions contributing to prediction decisions. This improves prediction transparency, inspection validation capability, and engineering confidence in automated structural assessment workflows.The framework also demonstrates strong deployment capability through the implementation of a web-based inspection interface. The proposed system supports automated crack prediction, heatmap visualization,and scalable inspection workflows for practical structural health monitoring applications.
PROPOSED METHODOLOGY
1. Framework Overview
  
  The proposed explainable deep learning framework consists of five major stages including image acquisition, image preprocessing, crack classification, explainability visualization, and deployment-oriented output generation. The framework integrates a fine-tuned ResNet-18 convolutional neural network with Grad-CAM explainable artificial intelligence techniques to improve automated crack detection reliability and interpretability for structural health monitoring applications.
2. Image Acquisition and Preprocessing
  
  Concrete surface images containing cracked and non-cracked regions were collected from infrastructure inspection environments. The images were resized to dimensions compatible with the ResNet-18 architecture. Several preprocessing techniques were applied before training, including image resizing, normalization, tensor conversion, horizontal flipping, rotation, brightness adjustment, and scaling operations. These preprocessing methods improved feature extraction capability, classification robustness, and model generalization performance while reducing overfitting during training.
3. Crack Classification Using ResNet-18
  
  A pretrained ResNet-18 convolutional neural network was fine-tuned using transfer learning techniques [1]. The final fully connected layer was modified to perform binary classification between crack and non-crack concrete surface images. The Adam optimizer and cross-entropy loss function were utilized during training to improve classification performance and optimization efficiency.
  
  Cross-Entropy Loss Function
  
  (1)
  
  Adam Optimization Update
  
  (2)
4. Explainable AI Using Grad-CAM
  
  Grad-CAM explainability was integrated to improve model interpretability by generating heatmaps identifying crack-sensitive image regions contributing to prediction outcomes [2].
  
  Grad-CAM Formulation
  
  (3)
  
  The generated visual explanations assist engineers in validating model predictions and improving inspection reliability.
5. System Workflow
Input Image

Image Preprocessing

Data Augmentation

ResNet-18 Classification

Grad-CAM Heatmap Generation

Prediction Visualization

Web-Based Output Interface.

EXPERIMENTAL SETUP

Development Environment

TABLE I. DEVELOPMENT ENVIRONMENT

Parameter	Specification
Programming Language	Python
Deep Learning Framework	PyTorch
Computer Vision Library	OpenCV
Web Framework	Flask
Development Environment	Jupyter Notebook / VS Code
Operating System	Windows/Linux

Dataset Description

TABLE II. DATA SET DESCRIPTION

Dataset Property	Value
Total Images	10,034
Crack Images	6,498
Non-Crack Images	3,536
Training Images	7,732
Validation Images	689
Testing Images	1,154
Input Resolution	224 × 224

The dataset included diverse concrete surface conditions containing varying crack patterns, textures, illumination conditions, and environmental characteristics to improve classification robustness [4].

Training Configuration

TABLE III. TRAINING CONFIGURATION

Parameter	Value
Model Architecture	ResNet-18
Optimizer	Adam
Learning Rate	0.001
Batch Size	32
Epochs	25
Loss Function	Cross-Entropy Loss
Explainability Method	Grad-CAM

RESULTS AND DISCUSSION

Experimental evaluation demonstrated strong crack classification performance across diverse concrete surface conditions.
1. Performance Metrics
  
  TABLE IV. PERFORMANCE METRICS
  
  Metric
  
  Value
  
  Accuracy
  
  95.70%
  
  Precision
  
  95.10%
  
  Recall
  
  96.20%
  
  F1-Score
  
  95.60%
  
  The ResNet-18 architecture successfully extracted discriminative crack features while maintaining computational efficiency suitable for deployment-oriented applications.
  
  Fig. 1. Proposed Explainable AI-Based Concrete Crack Detection System.
  
  Proposed explainable deep learning framework demonstrating (a) web-based inspection interface, (b) crack detection prediction example, (c) Grad-CAM explainability visualization, and (d) non-crack prediction example.
2. Confusion Matrix Analysis
  
  The confusion matrix analysis demonstrated reliable classification performance for both cracked and non-cracked concrete surfaces within the testing dataset.
  
  Fig. 2. Confusion Matrix of the Proposed ResNet-18 Crack Detection Model.
  
  Confusion matrix representing classification performance of the proposed ResNet-18 crack detection framework on the testing dataset.
3. ROC Curve and AUC Analysis
  
  The Receiver Operating Characteristic (ROC) curve was used to evaluate the discriminative capability of the proposed ResNet-18 crack detection framework. The ROC analysis demonstrated strong classification performance with an Area Under the Curve (AUC) score of 0.974, indicating excellent capability in distinguishing between cracked and non-cracked concrete surface images.
  
  The high AUC value confirms that the proposed framework achieved reliable sensitivity and specificity across multiple classification thresholds. The ROC analysis further validates the robustness and generalization capability of the proposed deep learning model for automated infrastructure inspection applications.
  
  Fig. 3. Receiver Operating Characteristic (ROC) Curve of the Proposed ResNet-18 Framework.
  
  ROC curve demonstrating classification performance of the proposed ResNet-18 crack detection model with an AUC score of 0.974.
4. Explainability Visualization
  
  Grad-CAM explainability heatmaps confirmed that the proposed framework focused primarily on crack-sensitive regions instead of irrelevant background textures [2], [15]. Crack-positive predictions generated strong activation responses around structural crack areas, while non-crack images produced minimal activation behavior.
  
  The explainability module improved:
  - prediction transparency,
  - engineering interpretability,
  - inspection validation support,
  - infrastructure monitoring reliability.
COMPARATIVE DISCUSSION WITH EXISTING STUDIES

Previous studies have demonstrated the effectiveness of deep learning architectures such as VGG16, U-Ne, and conventional Convolutional Neural Networks (CNNs) for automated crack detection applications [3], [11], [14]. Existing literature generally reports classification accuracies ranging from 89% to 93% depending on dataset characteristics and model complexity [3], [14]. In comparison, the proposed ResNet-18 framework achieved 95.7% classification accuracy while additionally incorporating Grad-CAM explainability visualization and deployment-oriented web implementation support. The integration of explainable artificial intelligence techniques improves interpretability, prediction transparency, and practical applicability in infrastructure inspection environments.

TABLE V. COMPARATIVE ANALYSIS WITH EXISTING STUDIES

Study

Model

Reported Accuracy

Explainability

Cha et al. (2017)

CNN

89%

No

Liu et al. (2019)

U-Net

93%

No

Proposed Work

ResNet-18 + Grad-CAM

95.70%

Yes
DEPLOYMENT IMPLEMENTATION

A deployment-oriented web-based inspection interface was developed using the Flask framework to support automated infrastructure monitoring workflows. The proposed deployment system enables efficient crack inspection and prediction visualization for real-world civil engineering applications.

The developed framework supports multiple functionalities including image upload, automated crack prediction, Grad-CAM heatmap generation, crack localization, and prediction visualization. The deployment interface improves inspection accessibility and assists engineers in performing scalable structural health assessment procedures.

The proposed deployment framework demonstrates practical applicability for infrastructure inspection environments and may support future integration with advanced technologies such as drone-assisted monitoring systems, edge artificial intelligence platforms, IoT-enabled infrastructure monitoring systems, and mobile-based deployment applications [9], [15].
OPERATIONAL IMPACT ANALYSIS

The proposed explainable deep learning framework demonstrates potential operational advantages for infrastructure inspection and structural health monitoring workflows. Automated crack detection systems may reduce dependency on manual inspection procedures while improving inspection consistency, scalability, and computational efficiency across large-scale infrastructure environments [3], [15].

Preliminary deployment-oriented evaluation scenarios indicate potential improvements in inspection duration, workforce requirements, operational efficiency, and inspection consistency. The integration of explainable artificial intelligence techniques further enhances engineering interpretability and inspection validation capability during automated assessment processes.

However, deployment performance may vary depending on infrastructure scale, environmental conditions, image quality variations, and operational workflow configurations.
LIMITATIONS AND FUTURE WORK

Although the proposed framework demonstrated strong classification performance and explainability capability, several limitations remain in real-world deployment environments. Variations in illumination conditions, surface texture complexity, environmental noise, and large-scale infrastructure diversity may affect model generalization performance under certain operational scenarios

Future work will focus on extending the framework toward real-time video crack detection, drone-assisted infrastructure inspection, lightweight edge-AI optimization, crack segmentation, crack width estimation, and IoT-enabled monitoring systems [8], [9], [15]. Future explainability studies may also incorporate quantitative explainability evaluation metrics, advanced visualization techniques, and real-time deployment optimization for scalable structural health monitoring applications.
CONCLUSION

This paper presented an explainable deep learning framework for automated concrete crack detection using a fine-tuned ResNet-18 architecture integrated with Grad-CAM explainable artificial intelligence techniques. The proposed framework combined deep learning-based crack classification with visual interpretability support to improve infrastructure inspection reliability and transparency.

Experimental evaluation demonstrated strong classification performance with an overall accuracy of 95.7% along with reliable explainability visualization capability. The developed web-based deployment interface further demonstrated practical applicability for scalable infrastructure inspection and structural health monitoring workflows.

The proposed framework provides an efficient, interpretable, and deployment-oriented solution for AI-assisted infrastructure monitoring applications and demonstrates the potential of explainable deep learning techniques in civil engineering inspection systems.

ACKNOWLEDGMENT

The author sincerely thanks Larsen & Toubro Construction for providing the internship opportunity and valuable industrial exposure related to infrastructure inspection and deep learning applications. Special gratitude is extended to Mr. Jayaprakash Vattikundala, Deputy Head Data Science, L&T Divisional Corporate, for his guidance, mentorship, and technical support throughout the internship and research work.

The author also expresses sincere appreciation to Mr. M. Francis Dhanaraj, DGM QA/QC, L&T Construction (WET IC), for his valuable technical insights and domain expertise in identifying concrete cracks and structural defects, which significantly contributed to the practical understanding of infrastructure inspection workflows.

The author further acknowledges the support and encouragement provided by the faculty mentors of St. Josephs College of Engineering, Chennai, during the completion of this research study.

REFERENCES

K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp. 770778.
R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, Grad-CAM: Visual explanations from deep networks via gradient-based localization, in Proc. IEEE Int. Conf. Comput. Vis. (ICCV), 2017, pp. 618626.
Y. J. Cha, W. Choi, and O. Büyüköztürk, Deep learning-based crack damage detection using convolutional neural networks, Comput.-Aided Civil Infrastruct. Eng., vol. 32, no. 5, pp. 361378, 2017.
S. Dorafshan, R. J. Thomas, and M. Maguire, SDNET2018: An annotated image dataset for non-contact concrete crack detection using deep convolutional neural networks, Data Brief, vol. 21, pp. 16641668, 2018.
C. Shorten and T. M. Khoshgoftaar, A survey on image data augmentation for deep learning, J. Big Data, vol. 6, no. 60, pp. 148, 2019.
R. A. Swarna, et al., Concrete crack detection and segregation: Explainable AI-based approach, J. Imag., vol. 10, no. 2, pp. 118, 2024.
J. Y. Luo and Y. C. Liu, Adaptive and explainable deep learning-based rapid identification of architectural cracks, IEEE Access, vol. 12, pp. 115, 2024.
F. Wei, et al., Weakly-supervised segmentation with ensemble

explainable AI for crack detection, Rev. Sci. Instrum., vol. 96, no. 1,

pp. 112, 2025.
K. M. S. Asif, et al., UAV-based building surface defect detection with Grad-CAM explainability, in Proc. IEEE Int. Conf., 2025, pp. 16.

S. Fallahy and N. Rezazadeh, Explainable domain adaptation for masonry crack detection, J. Build. Eng., vol. 92, pp. 114, 2025.

L. Zhang, F. Yang, Y. D. Zhang, and Y. J. Zhu, Road crack detection using deep convolutional neural network, in Proc. IEEE Int. Conf. Image Process. (ICIP), 2016, pp. 37083712.
S. Li, X. Zhao, and G. Zhou, Automatic pixel-level multiple damage detection of concrete structure using fully convolutional network, Comput.-Aided Civil Infrastruct. Eng., vol. 35, no. 12, pp. 115, 2020.
P. J. Chun, S. Izumi, and T. Yamane, Automatic detection method of cracks from concrete surface imagery using deep learning-based image processing, Constr. Build. Mater., vol. 264, pp. 113, 2021.
Z. Liu, Y. Cao, Y. Wang, and W. Wang, Computer vision-based concrete crack detection using U-Net fully convolutional networks, Autom. Constr., vol. 104, pp. 129139, 2019.
F. Jiang, W. Liu, L. Wang, and Y. Wang, Explainable deep learning framework for structural damage detection and infrastructure monitoring, Eng. Appl. Artif. Intell., vol. 124, pp. 114,

Metric	Value
Accuracy	95.70%
Precision	95.10%
Recall	96.20%
F1-Score	95.60%

Study	Model	Reported Accuracy	Explainability
Cha et al. (2017)	CNN	89%	No
Liu et al. (2019)	U-Net	93%	No
Proposed Work	ResNet-18 + Grad-CAM	95.70%	Yes