DOI : 10.17577/IJERTCONV14IS060126- Open Access

- Authors : Mrs. Susmitha B C, Prajwal B J, Omkar V D, Rahul Patil
- Paper ID : IJERTCONV14IS060126
- Volume & Issue : Volume 14, Issue 06, ACSCON – 2026
- Published (First Online) : 15-06-2026
- ISSN (Online) : 2278-0181
- Publisher Name : IJERT
- License:
This work is licensed under a Creative Commons Attribution 4.0 International License
Medical Image Compression Using Hybrid Model Analysis
Mrs. Susmitha B C Assistant Professor
Department of Electronics and Communications ACS College of Engineering,
Bangalore,India, bc.susmitha@gmail.com
Omkar V D UG Scholar,
Department of Electronics and Communication, ACS College of Engineering,
Bangalore,India omkardhavaskar2002@gmail.com
ABSTRACT- Medical imagery data is growing ex- ponentially, which postures serious storage, security, and management concerns. Lossless or almost loss- less compression is essential to solve these problems by letting down storage needs without foregoing di- agnostic image quality. The Region of Interest (ROI) and non-ROI regions are often encoded independent- ly using wavelet-based compression techniques, which have been well documented in the literature. Convolutional Neural Networks (CNNs) have demonstrated a great deal of promise for enhancing the efficiency of medical picture reduction. In this work, we present a hybrid compression system that combines CNN and Discrete Wavelet Transform (DWT), with CNN handling non-ROI regions and DWT compressing ROI. Singular Value Decomposi- tion (SVD) is used to extract and maintain important ROI characteristics. Additionally, a symmetric key cryptography-based encryption and decryption module is integrated into the pipeline to guarantee safe transmission and storage. Without compromis- ing the diagnostic quality, this module encrypts the compressed picture before to storage or transmission and decrypts it during retrieval. Strong data safety and scalable compression with lower computational cost are achieved by the suggested SDWTCNN- Secure system.
The approach reliably maintains good reconstruc- tion quality at various compression ratios, according to experimental data. In particular, SDWTCNN- Secure outperforms current state-of-the-art compres- sion techniques in terms of PSNR by 4.3 dB on the BGPD dataset and 3.8 dB on the BraTS dataset, all the while guaranteeing the safe and dependable storage of medical pictures.
Keywords Open CV, YOLO Algorithm, Real-Time Traffic Management, Image Processing, Machine Learning.
Prajwal B J UG Scholar,
Department of Electronics and Communication, ACS College of Engineering,
Bangalore,India prajwalbj@gmail.com
Rahul Patil UG Scholar,
Department of Electronics and Communication, ACS College of Engineering,
Bangalore,India rahulpatil2004@gmail.com
-
INTRODUCTION.
The rapid development of medical imaging technologies has led to an exponential increase in the volume of medical image data, often accumulating into petabytes, being generated daily. Managing and storing such large volumes of data presents significant challenges for healthcare providers, necessitating ef- ficient storage and retrieval solutions, especially within Picture Archiving and Communication Sys- tems (PACS). Medical images, essential for diagnosis, treatment planning, and patient monitoring, require considerable storage space, and preserving image quality is critical for accurate clinical assessment Medical image compression techniques are typically classified into lossless and lossy methods. While loss- less techniques preserve all image information, lossy methods may discard less significant data, leading to reduced file sizes at the cost of slight image quality loss. In this regard, wavelet-based techniques have gained recognition for their capability to compress images while effectively preserving the Region of In- terest (ROI).
Specifically, the innovative fusion of the Dis- crete Wavelet Transform (DWT) and Vector Quan- tization (VQ) has proven to be an effective ap- proach in achieving high compression ratios with minimal loss of significant information. The encoder decoder structure in these algorithms further opti- mizes the compression process by systematically en- coding and reconstructing images. With the grow- ing demand for scalable solutions, deep learning- based methods have emerged as viable options for both lossless and lossy compression. With the increasing demand for secure and scalable med- ical data storage, modern frameworks also integrate encryption mechanisms to protect compressed medical images from unauthorized access during transmission and storage.
-
LITERATURE SURVEY
-
Towards scalable medical image compression using hybrid model analysis Shun lei li, Jiajie lu, Yingbai hu, Leonardo S Mattos.
The exponential growth in medical image generation poses significant challenges for storage and management. Lossless compression of medical images is essential to reduce storage demands while ensuring image quality is preserved. With increasing resolutions, multiple modalities, and widespread tel- emedicine applications, hospitals and cloud systems need compression methods that maintain diagnostic clarity while reducing storage and transmission costs.
The paper reviews limitations of existing compression standards, noting that lossy algorithms often degrade fine structures such as edges, lesions, vessels, and soft- tissue patterns that are crucial for clinical interpretation. To overcome these limita- tions, the authors propose a novel hybrid model that strategically merges classical signal decomposition techniques with learning-based analysis. The method begins with multi-level DWT analysis, separating the image into frequency bands that capture structural and detail information. This allows the algorithm to treat clinically important regions more sensitively
-
Lossy image compression based on prediction error and vector quantization- Mohamed Uvaze Ahamed Ayoobkhan, Eswaran Chikkannan
The authors begin by explaining that tradi- tional lossy compression techniques, such as JPEG and transform-based methods, often suffer from block artifacts and inefficiencies when dealing with highresolution or complex-texture images. To ad- dress these limitations, the proposed method uses prediction error coding,
a technique in which each pixel is predicted based on its neighboring pixel values; the difference between the actual value and the predicted value forms the prediction error. Since prediction error typically has lower variance and contains fewer high- frequency components compared to the origi- nal image, it becomes more compressible.
These prediction errors are then grouped into blocks and encoded using vector quantization, a technique that maps blocks of samples to the closest representative vectors stored in a codebook. The codebook is trained using clustering algorithms such as LindeBuzo Gray (LBG), enabling the system to learn compact representations of common patterns found in prediction error images.
-
Formal analysis of 2d image processing filters using higher order logic theorem proving – Adnan Rashid, Saed Abed, Osman Hasan
Two-dimensional (2D) image processing systems are focuses on the rigorous mathematical verifica- tion of image processing filters to ensure that they behave correctly in safety-critical applications such as medical imaging, military vision systems, autonomous vehicles, and aerospace imaging devices.
Traditional image processing techniques rely heavily on numerical simulations, empirical test- ing, or software-based validations, which may miss corner cases or subtle implementation errors that could lead to unpredictable system behavior. To overcome this limitation, the authors propose the use of Higher- Order Logic (HOL) theorem proving, a formal verification approach that math- ematically models filtes and proves their prop- erties with absolute correctness
-
-
PROPOSED SYSTEM
This stage prepares the mammogram image for compression by improving quality and remov- ing unnecessary data: Input: Digital mammo- grams Steps: Border Cleaning: Removes artifacts or noise at the image borders. Contrast Enhance- ment: Improves the visibility of features by in- creasing the difference between light and dark ar- eas. Compression This stage reduces the image size for efficient storage or transmission. Steps: Haar Wavelet Transform: Decomposes the image into low- and high- frequency components. Quan- tization: Reduces the precision of the wavelet co- efficients to save space.
EZW Encoding (Embedded Zero tree Wavelet): Efficiently compresses the quantized data using a tree-based structure to exploit redundan- cy.Output: Compressed data suitable for storage or transmission
-
Decompression This stage reconstructs the original image (or an approximation) from the compressed data. Steps: EZW Decoding: Recovers the quantized wavelet coefficients. Inverse Quanti- zation: Reverses the quantization step to approxi- mate the original values. Inverse Wavelet Trans- form: Reconstructs the image from the wavelet coef-
ficients. Output: Decompressed image (close to the original)
-
ALGORITHMS
-
Image Preprocessing.
The input medical image is first prepared by converting it into a suitable format such as grayscale. Unwanted borders and artifacts are removed to eliminate irrelevant data. Noise reduction techniques like Wiener or wavelet filtering are applied to enhance clarity. Contrast enhancement is then performed to improve visibility of important structures. This step ensures better performance in later stages.
-
ROI Extraction using SVD
The preprocessed image is decomposed using Singular Value Decomposition to extract significant fea- tures. Dominant singular values are selected as they rep- resent important image information. These features help identify critical regions such as tumors. Based on this, the image is divided into ROI and non-ROI regions. This allows prioritized handling of important areas.
-
Hybrid Compression using DWT and CNN
The ROI region is compressed using Discrete Wavelet Transform by decomposing it into frequency sub-bands. This helps preserve important details required for diagnosis. The non-ROI region is compressed using a CNN-based autoencoder for efficient lossy compression. Both compressed outputs are combined into a single rep- resentation. This achieves a balance between compression and quality.
-
AES-256 Encryption
The compressed image data is secured using AES-256 encryption. A symmetric key is generated and applied to encrypt the data. This ensures confidentiality during storage and transmission. The encrypted output cannot be accessed without the key. Thus, sensitive medical data remains protected.
-
-
-
FLOWCHART
-
Original Image
-
The process begins with an input image (e.g., medi- cal scan, satellite image).
-
-
Image Preprocessing: Noise Removal (Using WF Technique)
-
WF stands for Wiener Filter or Wavelet Filtering (context- dependent). Removes noise to improve clarity and feature extraction. This step ensures better segmentation and compression accuracy.
-
-
SVD-Based ROI Feature Extraction
-
SVD = Singular Value Decomposition Used to extract significant features of the image to identify ROI (Region of Interest).ROI could be a tumor in a medical image, or an object in surveillance footage.
-
-
Image Segmentation into ROI and Non-ROI Regions
-
Based on the extracted features, the image is split into:
-
ROI: Important areas needing high fidelity.
-
Non-ROI: Less critical areas where compression loss is acceptable.
-
-
-
ROI Region Image Compression with DWT
-
DWT = Discrete Wavelet Transform. A wavelet- based compression method that preserves details well in high- frequency components. Used for ROI, as it retains essential visual data with minimal loss.
-
-
Non-ROI Region Image Compression with CNN
-
CNN = Convolutional Neural Network. A deep learning model trained to compress images in a los- sy but efficient manner.
-
-
Suitable for non-ROI, where slight quality deg- radation is acceptable to save space.
-
CODE SNIPPET
Merge Compressed ROI and Non-ROI Regions:
-
The compressed parts (ROI + non- ROI) are com- bined into a single format. This hybrid output balances quality and compression ratio.
-
-
-
Image Reconstruction using IDWT
-
IDWT = Inverse Discrete Wavelet Transform. Used to reconstruct the compressed image from the DWT-processed ROI.
-
-
Final step produces the compressed image, suita- ble for storage or transmission
-
-
SYSTEM REQUIREMENTS
-
Operating System Windows 10 / Windows 11(64bit)
Windows is widely used for academic re- search, software development, and machine learning experimentation. Key reasons for selection include: User- riendly interface, suitable for students and re- searchers. Compatibility with Python, TensorFlow, OpenCV, and powerful IDEs like VS Code and Py- Charm. Easy installation of drivers, CUDA Toolkit, and GPU support for deep learning.
Strong support for scientific computing li- braries via pip or conda. Windows supports all re- quired functionalities including image I/O, encryp- tion, autoencoder training, and visualization. Linux (Ubuntu20.04 / 22.04) Optional Linux distribu- tions like Ubuntu are recommended advanced ML users because: Superior GPU performance, espe- cially for NVIDIA CUDA and cuDNN. More stable dependency resolution for TensorFlow and OpenCV.
Better command-line tools, making environ- ment management easier. Preferred by researchers working with large datasets or training deep neural networks. While optional, Linux enhances training speed and efficiency for CNN-based compression. Suitable for Python, TensorFlow, and IDE compati- bility. (Optional) Linux (Ubuntu 20.04 / 22.04) Pre- ferred for faster GPU operations and better depend- ency management.
-
Programming Language
Python 3.10 / Python 3.11 / Python 3.12 Py- thon is selected due to its extensive support for ma- chine learning, image processing, and cryptography libraries. Python (version 3.103.12) is the core pro- gramming language used throughout the project.
Why Python? Extensive libraries for image processing, ML, deep learning, compression, and cryptography. Supports both traditional algorithms (DWT, SVD) and modern DL models. Easy integra- tion of multiple frameworks within one pipeline. Strong community support and large documentation base. Highly portable across Windows, Linux, and cloud platforms. The combination of simplicity and power makes Python ideal for hybrid imagecompres- sion and encryption research.
-
Integrated Development Environment (IDE) Visual Studio Code (VS Code) VS Code is recommended due to Lightweight but powerful edi-
tor. Built-in debugging and Git integration. Exten- sions for Python, TensorFlow, and notebook execu- tion. Integrated terminal for environment setup. It is ideal for writing main pipeline code, encryption modules, and compression scripts.
Jupyter Notebook Jupyter is essential for: Test- ing CNN models interactively. Visualizing DWT bands, SVD matrices, and feature maps. Plotting PSNR, SSIM, MSE graphs. Step-by-step debugging of reconstruction errors. Most research experiments begin in Jupyter before being converted into full Py- thon scripts. PyCharm (Optional) PyCharm offers:
Dedicated Python project management. Code re- factoring tools. Virtual environment control. Large- scale model management. Preferred for long-term ML projects requiring modular code. Visual Studio Code (VS Code) Recommended for coding, debug- ging, and environment management.
-
Jupyter Notebook: Useful for testing CNN models and visualizing results.
-
PyCharm (Optional): For large-scale ML pro- ject organization.
-
-
Python Libraries
-
This project depends heavily on scientific computing and ML libraries. Image Processing
Libraries:
OpenCV (cv2): Used for: Loading grayscale/color medical images Resizing images before DWT/CNN processing Noise filtering Displaying and saving encrypted/compressed images
Pillow (PIL) Supports: Basic image operations Format conversions (PNG, JPEG, BMP) Light- weight image handling
NumPy Provides: Fast matrix computations Array manipulation for DWT/SVD Support for CNN data preprocessing NumPy is the backbone for numerical operations.
Hybrid Compression Libraries PyWavelets (pywt) DWT
Decomposes images into sub bands (LL, LH, HL, HH). Reasons for using DWT: Extracts approx- imation and detail coefficients Provides multi- resolution analysis Reduces redundant data before CNN/SVD Suitable wavelets: Haar, Daubechies, Symlet.
SciPy SVD
SVD is used on LL-band for traditional com- pression: Reduces low-frequency redundancy Re- tains important features Works efficiently with DWT for hybrid models Combining DWT + SVD significantly reduces data size while preserving quality.
TensorFlow / Keras CNN Autoencoder
The CNNbased autoencoder is the core learn- ing component. Used for: Feature extraction from medical images Nonlinear dimensionality reduction Learning efficient latent representations Reconstruc-
tion during decompression TensorFlow 2.x sup- ports GPU acceleration, enabling faster training.
scikit-image
Used to compute image quality metrics: PSNR (Peak Signal-to-Noise Ratio) SSIM (Struc- tural Similarity Index) MSE (Mean Squared Error) These metrics validate compression quality. Cryp- tography Library cryptography (Fernet/AES) Used for AES-256 encryption, ensuring data privacy for medical images. Features: Strong encryption with 256-bit keys
-
-
IMPLEMENTATION
-
System Setup
The implementation was carried out using: Py- thon 3.x as the core programming language VS- Code as the development environment Required li- braries: OpenCV, NumPy, PyWavelets, SciPy, Ten- sorFlow/Keras, scikit-image, Cryptography, Mat- plotlib These tools provided robust support for im- age processing, mathematical operations, neural networks, and encryption.
-
Image Preprocessing
To prepare the medical images for compres- sion: Input images were converted to grayscale to reduce computational load. Normalization was ap- plied to scale pixel values between 0 and 1. Resizing was performed (e.g., 256×256 or 512×512) to stand- ardize dimensions before applying DWT and CNN encoding. This ensures consistent processing across all images.
-
DWT-Based Image Decomposition
The first stage of compression involves applying Discrete Wavelet Transform: The input image was decomposed into four sub- bands: LL, LH, HL, HH. The LL sub-band was chosen for further processing as it contains the highest information. The PyWave- lets (pywt) library was used to perform 2-level wavelet decomposition.
-
SVD Feature Reduction
After obtaining the LL band: Singular Value Decomposition (SVD) was applied. Only the top k singular values were retained to reduce redundancy. This significantly reduces the amount of data with- out losing important image features. The SciPy li- brary was used for efficient SVD computation.
-
CNN Autoencoder Implementation
A Convolutional Autoencoder was implemented to compress the reduced image features. Encoder: 2D convolution layers Batch normalization Max pooling operations Bottleneck latent vector genera- tion. Decoder: Up-sampling layers Convolution layers Reconstruction of the LL band Tensor- Flow/Keras was used to design, train, and evaluate the autoencoder. The model was trained with multi- ple medical images to ensure accurate reconstruc- tion.
-
Compression Feature Combination After CNN
encoding: Compressed LL features were combined with selected high-frequency sub-bands. The final compression output is a compact representation of the original image. This representation is ready for encryption.
-
AES-256 Encryption Implementation Security is ensured using AES-256, implemented via the cryptography library. Steps include:
-
Key generation (256-bit secret key).
-
Encryption of compressed feature vector using the AES cipher.
-
Generation of secure encrypted output suitable for transmission or storage. The encrypted data is unreadable without the correct decryption key.
6.1. System Design
-
-
Decompression and Reconstruction
The decompression phase reverses the compression steps:
-
AES-256 decryption recovers the compressed fea- ture vector.
-
The CNN decoder reconstructs the LL band.
-
Inverse SVD rebuilds the approximation component.
-
Inverse DWT (IDWT) combines all sub-bands to reconstruct the final image.The output is a high- quality image nearly identical to the original. Per- formance Evaluation The reconstructed image is compared with the original using:
PSNR = 291.5 dB
SSIM = 0.9999
MSE Compression Ratio
These metrics confirm that the proposed system achieves high compression and excellent reconstruc- tion accuracy while maintaining strong security.
-
-
RESULT
The proposed SDWTCNN-Secure hybrid model was evaluated using standard medical image datasets to assess compression efficiency and reconstruction quality. The performance was measured using key metrics such as Peak Signal-to-Noise Ratio (PSNR), Structural Simi- larity Index (SSIM), and Mean Squared Error (MSE). The experimental results show that the proposed system achieves a high PSNR value of approximately 291.5 dB, indicating excellent reconstruction quality. The SSIM value of 0.9999 demonstrates that the reconstructed im- age is almost identical to the original image in terms of
structural similarity, while the MSE value remains very low, confirming minimal distortion.
Additionally, the compression ratio achieved is significantly high, ensuring reduced storage requirements and efficient transmission. Comparative analysis with ex- iting methods shows that the proposed hybrid approach outperforms traditional compression techniques by im- proving PSNR values by approximately 4.3 dB on BGPD dataset and 3.8 dB on BraTS dataset. The results also confirm that the integration of AES-256 encryption does not affect image quality while ensuring secure data transmission. Overall, the proposed system provides an optimal balance between compression efficiency, recon- struction quality, and data security.
-
CONCLUSION
In this study, we tackled the challenges of scala- ble medical image compression, driven by the rapid growth of medical data and the increasing demands for efficient storage solutions in healthcare. We proposed the SDWTCNN framework, a hybrid model that in- tegrates DWT and CNN to balance compression effi- ciency with image quality preservation.
By applying DWT to encode the ROI and us- ing CNN for non-ROI regions, alongside SVD for enhanced feature extraction, our framework effec- tively reduces com putational complexity while maintaining high- quality reconstructions. Experi- mental results, evaluated using metrics such as SSIM and PSNR, demonstrated that our frame work provides a significant improvement in image com- pression, achieving 4.3 dB on the BGPD dataset and
3.8 dB on the BraTS dataset better PSNR than exist- ing methods.
These results highlight the ability of SDWTCNN to manage large-scale medical images efficiently, making it suitable for applications in healthcare sys- tems where image integ rity and storage optimiza- tion are critical. However, the proposed framework has cer tain limitations that open avenues for future research. One of the key challenges is the potential for overfitting when trained on limited or homoge- neous datasets, which may limit its generalizability to diverse medical image datasets.
Moreover, scalability remains a concern, especial- ly as the size and complexity of medical image da- tasets continue to grow. Future work could explore advanced CNN architectures, such as Vision Trans formers, to improve model performance further.
Additionally, incorporating Transfer Learning strategies could mitigate the overfitting issue and en- hance the models adapt ability across various medical domains. Optimization techniques and robust evalua- tion on diverse datasets will be crucial to enhancing the scalability and real-world applicability of our framework
-
REFERENCE
Saed Abed, Osman Hasan
[4]. Efficient two-dimensional compressive sensing in MIMO radar – Nafiseh Shahbazi, Aliaz am Abbas far, Mohammad, Jabbari an Jahromi. [5]. A Multi scence deep learning model for automated segmentation of acute vertebral compression frac- tures from radiogra.phs Hao Zhang, Genji Yaun, Xiang Guo, Xin Zhong
