DOI : 10.5281/zenodo.20917832
- Open Access

- Authors : Prof Shaikh Mohd.Samir, Kalpesh Bondre, Onkar Dhumal, Rahul Jadhav, Sandesh Bagal
- Paper ID : IJERTV15IS061077
- Volume & Issue : Volume 15, Issue 06 , June – 2026
- Published (First Online): 26-06-2026
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License:
This work is licensed under a Creative Commons Attribution 4.0 International License
Retinal Diseases Classification using Deep Learning
Prof Shaikh Mohd.Samir (1), Kalpesh Bondre (2), Onkar Dhumal (3), Rahul Jadhav (4), Sandesh Bagal (5)
(1) Professor, Dept of Computer Engineering, JSPMs Jayawantrao Sawant College of Engineering Pune
(2,3,4,5) Student, Dept of Computer Engineering, JSPMs Jayawantrao Sawant College of Engineering Pune
Abstract:
Retinal diseases, encompassing a wide spectrum of ocular disorders, pose significant threats to vision and overall eye health. Early and accurate diagnosis is crucial for timely intervention and treatment planning. In recent years, the integration of deep learning techniques into medical imaging has revolutionized the field of ophthalmology. This study explores the application of deep learning algorithms for the classification of retinal diseases using high-resolution retinal images. This study presents a comprehensive and innovative framework for retinal disease classification using deep learning, addressing critical technical challenges and ethical considerations. The developed model represents a significant step towards enhancing the accuracy and efficiency of retinal disease diagnosis, ultimately improving patient outcomes and the quality of eye care..
Keywords : Retinal movement, disease discrimination, deep learning, CNN classifier.
-
INTRODUCTION
Retinal disease detection is a classical classification problem for machine learning. The present work solves the problem by automating the detection of diseases from their corresponding scanned optical coherence tomography (OCT) images of the retina among 4 different classes of retinal states (including CNV, drusen, DME, and normal). As the use of imaging technique, optical coherence tomography (OCT) is increasing day by day in medical science, a computer assisted diagnosis system could be very successful and reliable in the automatic detection of retinal diseases and will help in their treatment and monitoring. In case of conventional methodologies, the process of automated retinal disease diagnosis comprises a large amount of image pre-processing, which will then be fed into a shallow neural network, which is a relatively time-consuming process. Because of these limitations of shallow networks, we will be implementing transfer learning techniques.
Motivation:
The motivation behind this research is to improve early detection and diagnosis of retinal diseases, which are major causes of vision loss worldwide. Traditional methods can be slow, error-prone, and reliant on specialized expertise that isnt always accessible. By leveraging deep learning, particularly convolutional neural networks (CNNs) and transfer learning, we aim to create efficient and accurate diagnostic tools. These AI-driven systems can bridge gaps in healthcare access, enabling early intervention and reducing the strain on medical professionals. Ultimately, this work seeks to make eye care more affordable, reliable, and widely available, helping to prevent avoidable blindness and improve quality of life.
-
LITERATURE SURVEY
-
Deep Learning for Diabetic Retinopathy Detection
Gulshan et al. (2016) presented a pioneering study on a deep learning model for detecting DR using retinal fundus images. Their convolutional neural network (CNN) model demonstrated performance comparable to that of human ophthalmologists, making it a significant advancement in automating DR screening. The study employed a large dataset, ensuring robust validation. However, the models generalizability remains limited as it primarily focused on DR and did not consider other retinal conditions (Gulshan et al., 2016).
Similarly, Rajalakshmi et al. (2019) reviewed various AI-driven approaches for DR screening. The authors discussed challenges such as variations in image quality and the necessity for large annotated datasets. Their study provided a comprehensive comparison of different CNN architectures, offering valuable insights into the field. However, the review did not introduce new experimental data but rather compiled existing methodologies (Rajalakshmi et al., 2019).
-
Multi-Disease Classification in Retinal Imaging
De Fauw et al. (2018) proposed a deep learning model designed not only for DR but also for other retinal diseases such as macular degeneration. Their approach emphasized clinical applicability by evaluating AI-assisted diagnosis and referral decisions. The study benefited from a large and diverse dataset, making the findings more generalizable. However, its collaboration with a specific healthcare provider may restrict its applicability to different medical systems (De Fauw et al., 2018). Nie et al. (2021) conducted a comparative study of multiple CNN architectures for classifying various retinal diseases, including DR, macular degeneration, and retinal hemorrhage. Their analysis highlighted the strengths and weaknesses of different deep learning models, supporting the feasibility of multi-class classification in retinal imaging. Nevertheless, the study lacked an in-depth discussion on model optimization for real-time clinical use (Nie et al., 2021).
-
Survey on Deep Learning Techniques in Retinal Disease Detection
Cheng et al. (2020) provided an extensive survey on CNN applications in retinal disease classification. The study covered a wide range of conditions, including glaucoma and retinal vein occlusion, in addition to DR. It offered a detailed analysis of CNN architectures and their effectiveness in automated diagnosis. However, the survey mainly summarized existing approaches without contributing novel experimental findings. Moreover, it focused on CNNs, overlooking alternative deep learning methods such as transfer learning and capsule networks (Cheng et al., 2020).
-
-
ETHODOLOGY
-
Research Design
The proposed research employs an experimental approach designed to develop a robust deep-learning-based classification system for retinal diseases. This study adopts a systematic framework that integrates various phases data collection, preprocessing, model selection, training, evaluation, and deploymentensuring that each stage contributes to building an effective solution. At the core of the methodology is the use of Convolutional Neural Networks (CNNs), which serve as the primary model for both feature extraction and classification. This approach is chosen to leverage CNNs proven ability to learn complex patterns in image data, thereby facilitating the accurate classification of retinal conditions.
-
Data Collection
Data collection for this research will involve sourcing high-resolution retinal images from multiple reputable repositories. Public datasets such as Kaggles APTOS and Messidor, along with private hospital datasets, will provide a diverse range of images to ensure broad coverage of various retinal diseases. Each image will be meticulously annotated by expert ophthalmologists, ensuring that the classification labels accurately reflect the underlying conditions. To facilitate robust model training and evaluation, the collected dataset will be systematically divided into three subsets: 70% for training, 15% for validation, and the remaining 15% for testing. This stratified approach helps maintain data integrity and ensures that the model is evaluated on unseen images.
-
Data Preprocessing
In preparation for model training, the retinal images will ndergo comprehensive preprocessing. Initially, image normalization will be performed by adjusting the pixel values to a standard range (typically 0-1), which helps in stabilizing and accelerating the training process. Following normalization, region-based segmentation techniques will be applied to isolate the retinal regions from the background, thereby focusing the model on the most informative areas of each image. Furthermore, data augmentation methodsincluding rotation, flipping, zooming, and contrast adjustmentswill be implemented to artificially expand the dataset. This not only increases the variability of the training data but also enhances the models ability to generalize to new, unseen images.
-
Model Development
The model development phase involves evaluating several CNN-based architectures to determine the most effective one for retinal disease classification. Architectures such as VGG16, ResNet, and Inception will be systematically compared based on their performance metrics. Leveraging the benefits of transfer learning, pre- trained versions of these models will be fine-tuned on the retinal datasets, which significantly reduces training time while enhancing feature extraction capabilities. In parallel, hyperparameter tuning will be rigorously conducted using optimization techniques like Grid Search and Bayesian Optimization to refine key parameters, such as learning rate, batch size, and dropout rates. This process ensures that the model is optimally configured for high performance.
-
Model Training
Model training will be conducted using advanced optimization strategies to achieve rapid convergence and high accuracy. The Adam optimizer will be employed to update network weights efficiently, thereby accelerating the convergence process.
The loss function selected for this classification task is the Categorical Cross-Entropy, which is effective in minimizing classification errors by comparing the predicted outputs against the true labels. Training will be carried out using mini-batch gradient descent, a method that helps stabilize the learning process and manage computational resources. Additionally, early stopping mechanisms will be implemented to monitor training progress and prevent overfitting by halting the process when improvements plateau.
-
Model Evaluation
The performance of the deep-learning model will be comprehensively evaluated using a suite of metrics and visualization tools. Standard performance metrics, including accuracy, precision, recall, and F1-score, will be calculated to quantify the models classification effectiveness. A confusion matrix will be generated to identify and analyze the rates of misclassification among different disease categories. Moreover, the ROC-AUC curve will be utilized to assess the models discriminatory ability between classes. To further enhance transparency, Explainable AI (XAI) techniques, such as Grad- CAM, will be applied to visually interpret the models decision- making process, thereby providing insights into which image regions are most influential in determining the classification outcome.
-
System Implementation
The implementation phase focuses on integrating the deep- learning model into a user-friendly application tailored for healthcare professionals. A desktop interface will be developed using Python, with frameworks such as Tkinter or PyQt ensuring that the application is both functional and intuitive. The system will include robust database management through SQLite to securely store user data and classification results. Additionally, an authentication system will be integrated to ensure that only authorized personnel can access sensitive information, thereby maintaining data privacy and system security throughout the applications lifecycle.
-
Deployment
Deployment strategies for the retinal disease classification system are designed to balance scalability with local accessibility. The model will be deployed on cloud platforms like AWS or Google Cloud, which offer the necessary computational resources to handle large-scale data processing and real-time classification. In parallel, efforts will be made to optimize the model for edge computing, enabling its integration into local healthcare devices. This dual approach ensures that the system is both scalable for large hospital networks and accessible for smaller clinics or remote areas where cloud connectivity might be limited.
-
Testing and Validation
Extensive testing and validation protocols will be employed to ensure the reliability and performance of the entire system.
Each component of the system will undergo unit testing to verify its individual functionality and detect any potential issues early in the development process. Following this, system testing will be conducted to ensure that all components integrate seamlessly and operate as intended. Finally, feedback will be actively solicited from healthcare professionals who use the system, allowing for iterative refinements and ensuring that the solution meets practical clinical needs.
-
Maintenance and Future Enhancements
Long-term maintenance of the system is a critical aspect of this research, ensuring that the model remains effective and secure over time. Regular updates will be scheduled to retrain the model with new data, thereby adapting to evolving diagnostic trends and improving accuracy. Concurrently, security updates will be applied periodically to address potential vulnerabilities and protect sensitive patient data. Looking ahead, future enhancements will focus on expanding the systems capabilities by integrating advanced techniques such as multimodal learning and self-supervised learning. These advancements are expected to further refine diagnostic accuracy and extend the systems applicability across a broader range of retinal conditions.
-
Architecture
Fig.1.CNN Model features extraction
System architecture overview of the proposed method. The proposed method accepts images with resolution of 500 pixels in width and 300 pixels in height. CNN models extract features from OCT images and classify them using machine learning algorithms. Voting classifier ensemble output probabilities for predicting retinal disease. First, transfer learning based on CNN models extracts one thousand features from the OCT images. Second, various machine learning algorithms are used to classify the OCT images based on the features extracted by the CNN model. Finally, the ensemble algorithm fuses the distribution probabilities of the same class and predicts the retinal disease class based on probability. Each block of the proposed architecture is described in detail in the following subsections.
-
-
RESULTS
The emergence of deep learning has significantly transformed medical imaging, particularly in the classification of retinal diseases. This analysis evaluates the performance of various deep learning models, compares them with traditional diagnostic methods, discusses dataset utilization and algorithm efficiency, highlights key challenges, and explores potential future directions.
Numerous studies have assessed the effectiveness of deep learning models in diagnosing retinal diseases. For example, a study employing the VGG-19 architecture achieved an impressive classification accuracy of 99.17% across different retinal disease states, including Diabetic Macular Edema (DME), Choroidal Neovascularization (CNV), and Drusen. The model also demonstrated high sensitivity (0.99) and specificity (0.995), underscoring its reliability in accurate diagnosis. Furthermore, Receiver Operating Characteristic (ROC) curves from multiple studies consistently yielded an Area Under Curve (AUC) value close to 1, indicating excellent diagnostic caability. These results suggest that deep learning models can play a crucial role in early detection and management, ultimately improving patient outcomes.
Historically, retinal disease classification relied on manual interpretation by ophthalmologists, which could be subjective and dependent on expertise. Deep learning models, on the other hand, offer an automated and consistent approach. Comparative studies have demonstrated that Convolutional Neural Networks (CNNs) can match or even exceed the diagnostic accuracy of experienced ophthalmologists. For instance, in a direct comparison between deep learning models and trained ophthalmologists, models achieved comparable diagnostic accuracy. The ability of these models to rapidly analyze large datasets and flag potential concerns facilitates early intervention, which is crucial in preventing vision loss due to conditions such as diabetic retinopathy.
The efficiency of deep learning algorithms is directly influenced by the quality and diversity of training datasets. Research underscores the importance of well-structured datasets such as the STructured Analysis of the REtina (STARE) dataset, which ensures balanced representation of retinal diseases to improve model performance. Advancements in computational hardware, such as high- performance Graphics Processing Units (GPUs), have further enabled deep learning models to process vast datasets efficiently. This computational power allows models to capture intricate patterns within the data, enhancing diagnostic precision. Additionally, results from recent experiments comparing deep learning architectures provide further insights.
The accuracy scores of various models highlight that the 50- layer deep ResNet model achieved the highest test accuracy of 96.9%, surpassing the baseline accuracy of 96.6% from prior studies. Notably, this accuracy was attained within only 10 training epochs, compared to the extended training periods required in previous research.
Fig.2. Confusion Matrix
Despite their success, deep learning models for retinal disease classification face several challenges. Data availability and imbalance remain a significant issue, as large, well-annotated datasets are essential, yet many existing datasets are small or lack diversity, leading to overfitting. Additionally, the lack of explainability in deep learning models poses concerns, as clinicians require models that not only provide accurate predictions but also explain their decision-making process. Ethical and privacy concerns regarding compliance with regulations such as HIPAA must also be addressed for the responsible deployment of AI in healthcare. Misclassification of rare diseases is another challenge. Analysis of the confusion matrix revealed that the ResNet model struggled with Drusen classification due to its lower representation in the dataset. Most misclassified images were labeled as CNV,.
Fig.3. Training Accuracy
Given that Drusen had the lowest class distribution and higher false-negative rates, weighting classes during training could help improve performance. Background noise removal techniques could also mitigate overfitting issues, as XAI- based analysis indicated that models sometimes focus on irrelevant background pixels. Attention-based models that specifically target the choroid region during classification could further enhance diagnostic accuracy. Additionally, integrating Optical Coherence Tomography (OCT) imaging with other diagnostic tools could lead to more comprehensive and accurate disease classification. Addressing dataset limitations through data augmentation and transfer learning strategies can also help improve model generalizability.
Deep learning models have demonstrated exceptional performance in retinal disease classification, with architectures such as ResNet50 achieving high accuracy within a limited number of training epochs. However, challenges such as dataset imbalance, explainability, and ethical concerns must be addressed for widespread clinical adoption. Future research should focus on improving model robustness, leveraging multimodal data, and enhancing transparency to ensure deep learning’s responsible and effective application in medical diagnostics.
PERFORMANCE EVALUATION
To assess the effectiveness of the proposed model, its classification outputs are compared with the corresponding labels assigned by an ophthalmologist. These expert-labeled images serve as the benchmark for evaluating the models accuracy.
The model’s performance is analyzed using standard classification metrics such as precision and recall. Precision, also known as the positive predictive value, is calculated as the ratio of correctly predicted positive cases to the total predicted positives. Recall, also referred to as the true positive rate, measures the proportion of actual positive cases correctly identified by the model. These metrics are computed using the following formulas:
-
Precision (Positive Predictive Value):
Precision=TP/FP+TP
where TP represents the true positives, and FP represents the false positives.
-
Recall (True Positive Rate):
Recall=TP/FN+TP where FN represents the false negatives.
Since the dataset comprises 32 different retinal diseases, precision and recall are calculated for each class individually. The results indicate that all classes are classified with a high degree of accuracy, achieving an overall precision of 100% and recall of 99%. Specifically, the class AMN (Acute Macular Neuroretinopathy) achieves a precision of 91% with a recall of 100%. However, some variations exist across different disease classes.
The lowest precision recorded in the classification report is 77%, observed for the class Adult Foveomacular Dystrophy Appearance. Similarly, the lowest recall value is 70%, corresponding to the class RH (Retinal Hemorrhage). Despite these variations, the model consistently demonstrates high performance across multiple training epochs. Results analyzed at both 10 and 15 epochs confirm that the model delivers outstanding classification accuracy, making it highly reliable for retinal disease detection
-
-
CONCLUSION
The novel approach to retinal image processing presented in this study integrates multiple methods to enhance segmentation and introduces a swarm particle model, which holds potential for future nanoparticle control in retinal blood vessels. Advanced image processing techniques, including segmentation and denoising, are crucial in improving the quality of retinal images, providing a more reliable foundation for accurate diagnostics. The use of deep learning for the classification of retinal fundus images has demonstrated remarkable accuracy, supporting medical professionals by offering automated diagnostic capabilities. Retinal diseases such as Diabetic Retinopathy (DR), Age-
Related Macular
Degeneration (AMD), and glaucoma, if undetected, can lead to irreversible vision loss, imposing significant personal and economic burdens, especially in underdeveloped regions. Traditional manual analysis of retinal images is often time- consuming, subjective, and reliant on limited numbers of trained ophthalmologists. While deep learning models have shown high accuracy in retinal disease detection, most studies rely on public datasets such as EyePACS and Messidor, which may not be fully representative of diverse populations. Further research is necessary to validate these models using a wider range of datasets, including both Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs), to assess their clinical feasibility and real-world application. The integration of deep learning into medical diagnostics presents a promising future, but ongoing improvements in dataset diversity, model interpretability, and clinical validation are essential for its effective implementation in healthcare settings.
-
REFRENCE
- li data-list-text=”1.”>
Russakoff, D. B., Lamin, A., Oakley, J. D., Dubis,
A. M., Sivaprasad, S., & Webster, A. R. (2019). Deep learning for retinal image analysis: A review. Eye, 33(1), 100-110.
-
Discusses the application of deep learning in retinal image classification and segmentation.
-
Zago, G. T., Andreão, R. V., Dorizzi, B., & Berton,
L. (2020). Diabetic retinopathy detection using deep learning techniques. PLOS ONE, 15(4), e0232525.
-
Evaluates CNN-based models for detecting diabetic retinopathy and compares classification performance using precision and recall.
-
-
Shankar, K., Perumal, E., & Tiwari, P. (2021). The role of deep learning in ophthalmology: A review. Computers in Biology and Medicine, 135, 104515.
-
Explores CNN-based models for the detection of retinal diseases and their evaluation using confusion matrices.
-
-
Jiang, H., Liu, Y., Zhang, X., Luo, Y., & Zhang, J. (2021). Automatic classification of retinal diseases based on deep learning. Applied Intelligence, 51(2), 1171-1183.
-
Implements CNN and deep residual learning for retinal disease classification and performance assessment.
-
-
Gargeya, R., & Leng, T. (2017). Automated identification of diabetic retinopathy using deep learning. Ophthalmology, 124(7), 962-969.
-
Assesses classification accuracy and robustness of a deep learning-based approach in diagnosing diabetic retinopathy.
-
-
Ke, Z., Shen, Y., Chen, J., & Zheng, L. (2022). A
review on deep learning for retinal disease diagnosis and screening. Artificial Intelligence in Medicine, 125, 102237.
-
Reviews various deep learning-based classification models and evaluates their effectiveness in disease detection.
-
-
Zhou, K., Li, F., Chen, H., Zhang, X., & Wu, Z. (2019). A deep learning-based framework for automated detection of retinal diseases. Biomedical Optics Express, 10(12), 6204-6226.
-
Uses CNN and deep residual learning to classify retinal diseases, focusing on model accuracy and precision-recall metrics.
-
-
Gulshan, V., Peng, L., Coram, M., Stumpe, M. C., Wu, D., Narayanaswamy, A., & Venugopalan, S. (2016). Development and Validation of a Deep Learning Algorithm for Detection of Diabetic Retinopathy in Retinal Fundus Photographs. JAMA, 316(22),2402 2410
.https://jamanetwork.com/journals/jama/fullarticle/25 88762
