DOI : 10.17577/IJERTV15IS050984
- Open Access
- Authors : Janice Benita F
- Paper ID : IJERTV15IS050984
- Volume & Issue : Volume 15, Issue 05 , May – 2026
- Published (First Online): 11-05-2026
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License:
This work is licensed under a Creative Commons Attribution 4.0 International License
Explainable Deep Learning Framework for Automated Concrete Crack Detection Using ResNet-18
Janice Benita F
Undergraduate Student, Department of Information Technology
St. Josephs College of Engineering (Autonomous)
Chennai, Tamilnadu, India
AbstractConcrete crack inspection is an essential component of structural health monitoring and infrastructure maintenance. Conventional manual inspection methods are time-consuming, subjective, labor-intensive, and difficult to scale for large infrastructure systems. This paper presents an explainable deep learning framework for automated concrete crack detection using a fine-tuned ResNet-18 convolutional neural network integrated with Grad-CAM explainable artificial intelligence techniques. The proposed framework performs binary classification of cracked and non-cracked concrete surface images while simultaneously generating visual explanations highlighting crack-sensitive regions influencing prediction outcomes. Transfer learning techniques are employed to improve feature extraction efficiency and computational performance. Experimental evaluation demonstrated strong classification performance with an overall accuracy of 95.7%, precision of 95.1%, recall of 96.2%, and F1-score of 95.6%. A deployment-oriented web-based inspection interface was also developed to support automated infrastructure inspection workflows. The proposed framework demonstrates practical applicability for scalable, interpretable, and efficient structural health monitoring systems.
KeywordsConcrete Crack Detection, Deep Learning, Explainable AI, ResNet-18, Grad-CAM, Structural Health Monitoring, Infrastructure Inspection, Transfer Learning.
-
INTRODUCTION
Infrastructure monitoring plays an important role in ensuring the safety, reliability, and longevity of civil engineering structures such as bridges, tunnels, pavements, dams, and buildings. Concrete structures are continuously exposed to environmental stress, fatigue, aging, and mechanical loading conditions that may result in cracks and structural deterioration over time. Early crack detection is essential for preventing severe structural failures and reducing maintenance costs.
Traditional concrete inspection methods mainly rely on manual visual assessment performed by trained inspectors. Although widely practiced, manual inspection procedures are time-consuming, subjective, labor-intensive, and difficult to scale for large infrastructure systems. Variations in human judgment and environmental conditions may also affect inspection reliability and consistency.
Recent advancements in Artificial Intelligence (AI) and computer vision technologies have significantly improved automated crack detection systems [1], [3]. Deep learning models, particularly Convolutional Neural Networks (CNNs), have demonstrated strong image classification and feature extraction capabilities [1]. However, many CNN-based systems operate as black-box models, limiting interpretability in safety-critical engineering applications.
Explainable Artificial Intelligence (XAI) techniques improve prediction transparency by visually identifying image regions contributing to model decisions [2] . Grad-CAM is one of the most widely adopted explainability techniques because it generates the localization heatmaps highlighting crack-sensitive regions influencing CNN predictions [2].
This paper presents an explainable deep learning framework for automated concrete crack detection using a fine-tuned ResNet-18 model integrated with Grad-CAM explainability visualization. The proposed framework combines crack classification, visual interpretability, and deployment-oriented infrastructure inspection support within a unified system.
-
LITERATURE REVIEW
-
Traditional Crack Detection Methods
Traditional concrete crack detection methods mainly relied on image processing techniques such as threshold segmentation, edge detection, and handcrafted feature extraction approaches [11], [14]. Although computationally simple, these methods often struggled under varying illumination conditions, complex surface textures, shadows, and environmental noise. Manual inspection processes also introduced subjectivity and inconsistency in structural assessment workflows, particularly for large-scale infrastructure systems.
-
Deep Learning and Explainable AI Approaches
Recent advancements in deep learning have significantly improved automated crack detection performance. Convolutional Neural Networks (CNNs) such as AlexNet, VGG16, ResNet, DenseNet, and EfficientNet demonstrated superior feature extraction and classification capability compared to conventional image processing methods [1], [3], [14]. Transfer learning techniques further improved
computational efficiency by utilizing pretrained ImageNet weights for infrastructure inspection tasks [1]. Explainable Artificial Intelligence (XAI) techniques such as Grad-CAM have recently gained attention because they generate visual heatmaps identifying crack-sensitive regions influencing prediction outcomes [2], [6], [15]. Existing studies primarily focus on classification accuracy while providing limited deployment-oriented implementation support. The proposed framework addresses these limitations by integrating explainable AI, deployment-oriented system design, and web-based inspection support within a computationally efficient deep learning framework.
-
-
RESEARCH CONTRIBUTIONS
-
Major Contributions of the Proposed Framework
The major contributions of this research work are summarized as follows:
-
Development of an automated concrete crack detection framework using a fine-tuned ResNet-18 deep learning architecture.
-
Integration of Grad-CAM explainable artificial intelligence techniques for visual interpretation of crack-sensitive image regions influencing prediction outcomes.
-
Implementation of a web-based infrastructure inspection interface to support deployment-oriented structural health monitoring applications.
-
Application of transfer learning techniques to improve feature extraction capability, classification efficiency, and computational performance.
-
Experimental evaluation using multiple performance metrics including accuracy, precision, recall, and F1-score for reliable model assessment.
-
Demonstration of a scalable, interpretable, and computationally efficient AI-assisted infrastructure inspection framework suitable for real-world deployment environments.
-
-
Advantages of the Proposed System
-
The proposed explainable deep learning framework improves automated infrastructure inspection reliability by combining crack classification capability with visual interpretability support. Unlike conventional black-box deep learning systems, the integration of Grad-CAM explainability visualization enables engineers to identify image regions contributing to prediction decisions. This improves prediction transparency, inspection validation capability, and engineering confidence in automated structural assessment workflows.The framework also demonstrates strong deployment capability through the implementation of a web-based inspection interface. The proposed system supports automated crack prediction, heatmap visualization,and scalable inspection workflows for practical structural health monitoring applications.
-
-
-
PROPOSED METHODOLOGY
-
Framework Overview
The proposed explainable deep learning framework consists of five major stages including image acquisition, image preprocessing, crack classification, explainability visualization, and deployment-oriented output generation. The framework integrates a fine-tuned ResNet-18 convolutional neural network with Grad-CAM explainable artificial intelligence techniques to improve automated crack detection reliability and interpretability for structural health monitoring applications.
-
Image Acquisition and Preprocessing
Concrete surface images containing cracked and non-cracked regions were collected from infrastructure inspection environments. The images were resized to dimensions compatible with the ResNet-18 architecture. Several preprocessing techniques were applied before training, including image resizing, normalization, tensor conversion, horizontal flipping, rotation, brightness adjustment, and scaling operations. These preprocessing methods improved feature extraction capability, classification robustness, and model generalization performance while reducing overfitting during training.
-
Crack Classification Using ResNet-18
A pretrained ResNet-18 convolutional neural network was fine-tuned using transfer learning techniques [1]. The final fully connected layer was modified to perform binary classification between crack and non-crack concrete surface images. The Adam optimizer and cross-entropy loss function were utilized during training to improve classification performance and optimization efficiency.
Cross-Entropy Loss Function
(1)
Adam Optimization Update
(2)
-
Explainable AI Using Grad-CAM
Grad-CAM explainability was integrated to improve model interpretability by generating heatmaps identifying crack-sensitive image regions contributing to prediction outcomes [2].
Grad-CAM Formulation
(3)
The generated visual explanations assist engineers in validating model predictions and improving inspection reliability.
-
System Workflow
Input Image
Image Preprocessing
Data Augmentation
ResNet-18 Classification
Grad-CAM Heatmap Generation
Prediction Visualization
Web-Based Output Interface.
-
-
EXPERIMENTAL SETUP
-
Development Environment
TABLE I. DEVELOPMENT ENVIRONMENT
Parameter
Specification
Programming Language
Python
Deep Learning Framework
PyTorch
Computer Vision Library
OpenCV
Web Framework
Flask
Development Environment
Jupyter Notebook / VS Code
Operating System
Windows/Linux
-
Dataset Description
TABLE II. DATA SET DESCRIPTION
Dataset Property
Value
Total Images
10,034
Crack Images
6,498
Non-Crack Images
3,536
Training Images
7,732
Validation Images
689
Testing Images
1,154
Input Resolution
224 × 224
The dataset included diverse concrete surface conditions containing varying crack patterns, textures, illumination conditions, and environmental characteristics to improve classification robustness [4].
-
Training Configuration
TABLE III. TRAINING CONFIGURATION
Parameter
Value
Model Architecture
ResNet-18
Optimizer
Adam
Learning Rate
0.001
Batch Size
32
Epochs
25
Loss Function
Cross-Entropy Loss
Explainability Method
Grad-CAM
-
-
RESULTS AND DISCUSSION
Experimental evaluation demonstrated strong crack classification performance across diverse concrete surface conditions.
-
Performance Metrics
TABLE IV. PERFORMANCE METRICS
Metric
Value
Accuracy
95.70%
Precision
95.10%
Recall
96.20%
F1-Score
95.60%
The ResNet-18 architecture successfully extracted discriminative crack features while maintaining computational efficiency suitable for deployment-oriented applications.
Fig. 1. Proposed Explainable AI-Based Concrete Crack Detection System.
Proposed explainable deep learning framework demonstrating (a) web-based inspection interface, (b) crack detection prediction example, (c) Grad-CAM explainability visualization, and (d) non-crack prediction example.
-
Confusion Matrix Analysis
The confusion matrix analysis demonstrated reliable classification performance for both cracked and non-cracked concrete surfaces within the testing dataset.
Fig. 2. Confusion Matrix of the Proposed ResNet-18 Crack Detection Model.
Confusion matrix representing classification performance of the proposed ResNet-18 crack detection framework on the testing dataset.
-
ROC Curve and AUC Analysis
The Receiver Operating Characteristic (ROC) curve was used to evaluate the discriminative capability of the proposed ResNet-18 crack detection framework. The ROC analysis demonstrated strong classification performance with an Area Under the Curve (AUC) score of 0.974, indicating excellent capability in distinguishing between cracked and non-cracked concrete surface images.
The high AUC value confirms that the proposed framework achieved reliable sensitivity and specificity across multiple classification thresholds. The ROC analysis further validates the robustness and generalization capability of the proposed deep learning model for automated infrastructure inspection applications.
Fig. 3. Receiver Operating Characteristic (ROC) Curve of the Proposed ResNet-18 Framework.
ROC curve demonstrating classification performance of the proposed ResNet-18 crack detection model with an AUC score of 0.974.
-
Explainability Visualization
Grad-CAM explainability heatmaps confirmed that the proposed framework focused primarily on crack-sensitive regions instead of irrelevant background textures [2], [15]. Crack-positive predictions generated strong activation responses around structural crack areas, while non-crack images produced minimal activation behavior.
The explainability module improved:
-
prediction transparency,
-
engineering interpretability,
-
inspection validation support,
-
infrastructure monitoring reliability.
-
-
-
COMPARATIVE DISCUSSION WITH EXISTING STUDIES
Previous studies have demonstrated the effectiveness of deep learning architectures such as VGG16, U-Ne, and conventional Convolutional Neural Networks (CNNs) for automated crack detection applications [3], [11], [14]. Existing literature generally reports classification accuracies ranging from 89% to 93% depending on dataset characteristics and model complexity [3], [14]. In comparison, the proposed ResNet-18 framework achieved 95.7% classification accuracy while additionally incorporating Grad-CAM explainability visualization and deployment-oriented web implementation support. The integration of explainable artificial intelligence techniques improves interpretability, prediction transparency, and practical applicability in infrastructure inspection environments.
TABLE V. COMPARATIVE ANALYSIS WITH EXISTING STUDIES
Study
Model
Reported Accuracy
Explainability
Cha et al. (2017)
CNN
89%
No
Liu et al. (2019)
U-Net
93%
No
Proposed Work
ResNet-18 + Grad-CAM
95.70%
Yes
-
DEPLOYMENT IMPLEMENTATION
A deployment-oriented web-based inspection interface was developed using the Flask framework to support automated infrastructure monitoring workflows. The proposed deployment system enables efficient crack inspection and prediction visualization for real-world civil engineering applications.
The developed framework supports multiple functionalities including image upload, automated crack prediction, Grad-CAM heatmap generation, crack localization, and prediction visualization. The deployment interface improves inspection accessibility and assists engineers in performing scalable structural health assessment procedures.
The proposed deployment framework demonstrates practical applicability for infrastructure inspection environments and may support future integration with advanced technologies such as drone-assisted monitoring systems, edge artificial intelligence platforms, IoT-enabled infrastructure monitoring systems, and mobile-based deployment applications [9], [15].
-
OPERATIONAL IMPACT ANALYSIS
The proposed explainable deep learning framework demonstrates potential operational advantages for infrastructure inspection and structural health monitoring workflows. Automated crack detection systems may reduce dependency on manual inspection procedures while improving inspection consistency, scalability, and computational efficiency across large-scale infrastructure environments [3], [15].
Preliminary deployment-oriented evaluation scenarios indicate potential improvements in inspection duration, workforce requirements, operational efficiency, and inspection consistency. The integration of explainable artificial intelligence techniques further enhances engineering interpretability and inspection validation capability during automated assessment processes.
However, deployment performance may vary depending on infrastructure scale, environmental conditions, image quality variations, and operational workflow configurations.
-
LIMITATIONS AND FUTURE WORK
Although the proposed framework demonstrated strong classification performance and explainability capability, several limitations remain in real-world deployment environments. Variations in illumination conditions, surface texture complexity, environmental noise, and large-scale infrastructure diversity may affect model generalization performance under certain operational scenarios
Future work will focus on extending the framework toward real-time video crack detection, drone-assisted infrastructure inspection, lightweight edge-AI optimization, crack segmentation, crack width estimation, and IoT-enabled monitoring systems [8], [9], [15]. Future explainability studies may also incorporate quantitative explainability evaluation metrics, advanced visualization techniques, and real-time deployment optimization for scalable structural health monitoring applications.
-
CONCLUSION
This paper presented an explainable deep learning framework for automated concrete crack detection using a fine-tuned ResNet-18 architecture integrated with Grad-CAM explainable artificial intelligence techniques. The proposed framework combined deep learning-based crack classification with visual interpretability support to improve infrastructure inspection reliability and transparency.
Experimental evaluation demonstrated strong classification performance with an overall accuracy of 95.7% along with reliable explainability visualization capability. The developed web-based deployment interface further demonstrated practical applicability for scalable infrastructure inspection and structural health monitoring workflows.
The proposed framework provides an efficient, interpretable, and deployment-oriented solution for AI-assisted infrastructure monitoring applications and demonstrates the potential of explainable deep learning techniques in civil engineering inspection systems.
ACKNOWLEDGMENT
The author sincerely thanks Larsen & Toubro Construction for providing the internship opportunity and valuable industrial exposure related to infrastructure inspection and deep learning applications. Special gratitude is extended to Mr. Jayaprakash Vattikundala, Deputy Head Data Science, L&T Divisional Corporate, for his guidance, mentorship, and technical support throughout the internship and research work.
The author also expresses sincere appreciation to Mr. M. Francis Dhanaraj, DGM QA/QC, L&T Construction (WET IC), for his valuable technical insights and domain expertise in identifying concrete cracks and structural defects, which significantly contributed to the practical understanding of infrastructure inspection workflows.
The author further acknowledges the support and encouragement provided by the faculty mentors of St. Josephs College of Engineering, Chennai, during the completion of this research study.
REFERENCES
-
K. He, X. Zhang, S. Ren, and J. Sun, Deep residual learning for image recognition, in Proc. IEEE Conf. Comput. Vis. Pattern Recognit. (CVPR), 2016, pp. 770778.
-
R. R. Selvaraju, M. Cogswell, A. Das, R. Vedantam, D. Parikh, and D. Batra, Grad-CAM: Visual explanations from deep networks via gradient-based localization, in Proc. IEEE Int. Conf. Comput. Vis. (ICCV), 2017, pp. 618626.
-
Y. J. Cha, W. Choi, and O. Büyüköztürk, Deep learning-based crack damage detection using convolutional neural networks, Comput.-Aided Civil Infrastruct. Eng., vol. 32, no. 5, pp. 361378, 2017.
-
S. Dorafshan, R. J. Thomas, and M. Maguire, SDNET2018: An annotated image dataset for non-contact concrete crack detection using deep convolutional neural networks, Data Brief, vol. 21, pp. 16641668, 2018.
-
C. Shorten and T. M. Khoshgoftaar, A survey on image data augmentation for deep learning, J. Big Data, vol. 6, no. 60, pp. 148, 2019.
-
R. A. Swarna, et al., Concrete crack detection and segregation: Explainable AI-based approach, J. Imag., vol. 10, no. 2, pp. 118, 2024.
-
J. Y. Luo and Y. C. Liu, Adaptive and explainable deep learning-based rapid identification of architectural cracks, IEEE Access, vol. 12, pp. 115, 2024.
-
F. Wei, et al., Weakly-supervised segmentation with ensemble
explainable AI for crack detection, Rev. Sci. Instrum., vol. 96, no. 1,
pp. 112, 2025.
-
K. M. S. Asif, et al., UAV-based building surface defect detection with Grad-CAM explainability, in Proc. IEEE Int. Conf., 2025, pp. 16.
-
L. Zhang, F. Yang, Y. D. Zhang, and Y. J. Zhu, Road crack detection using deep convolutional neural network, in Proc. IEEE Int. Conf. Image Process. (ICIP), 2016, pp. 37083712.
-
S. Li, X. Zhao, and G. Zhou, Automatic pixel-level multiple damage detection of concrete structure using fully convolutional network, Comput.-Aided Civil Infrastruct. Eng., vol. 35, no. 12, pp. 115, 2020.
-
P. J. Chun, S. Izumi, and T. Yamane, Automatic detection method of cracks from concrete surface imagery using deep learning-based image processing, Constr. Build. Mater., vol. 264, pp. 113, 2021.
-
Z. Liu, Y. Cao, Y. Wang, and W. Wang, Computer vision-based concrete crack detection using U-Net fully convolutional networks, Autom. Constr., vol. 104, pp. 129139, 2019.
-
F. Jiang, W. Liu, L. Wang, and Y. Wang, Explainable deep learning framework for structural damage detection and infrastructure monitoring, Eng. Appl. Artif. Intell., vol. 124, pp. 114,
S. Fallahy and N. Rezazadeh, Explainable domain adaptation for masonry crack detection, J. Build. Eng., vol. 92, pp. 114, 2025.
