Machine Learning Models for Fault Detection and Classification in Predictive Maintenance

doi:10.17577/IJERTV15IS020287

Volume 15, Issue 02 (February 2026)

Machine Learning Models for Fault Detection and Classification in Predictive Maintenance

DOI : 10.17577/IJERTV15IS020287

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 3
Authors : Mohammed Sahil Ps, Gopika Shaju, Saranya Raju, Thomas Paul, Dr. N. Vishwanath
Paper ID : IJERTV15IS020287
Volume & Issue : Volume 15, Issue 02 , February – 2026
Published (First Online): 21-02-2026
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Machine Learning Models for Fault Detection and Classification in Predictive Maintenance

Mohammed Sahil PS, Gopika Shaju, Saranya Raju, Thomas Paul, Dr. N. Vishwanath

Computer Science and Engineering

Toc H Institute of Science and Technology, Arakkunnam, Ernakulam 682313

Abstract Modern industrial operations demand continuous, reliable machine performance to maintain productivity and ensure worker safety. Traditional fault detection systems, which depend on xed threshold values and manual monitoring, often fail when faced with dynamic operating conditions. This research introduces a hybrid machine learning framework designed to detect and classify mechanical faults in rotating machinery with both accuracy and computational efciency. We leverage the widely-recognized Case Western Reserve Univer- sity bearing vibration dataset and implement a two-stage diag- nostic approach. Initially, a One-Class Support Vector Machine learns normal operational patterns from healthy bearing data, establishing a baseline for anomaly detection. When deviations occur, a Random Forest classier steps in to pinpoint the exact fault typewhether its an inner race defect, outer race damage, or ball bearing failure. Our feature extraction process combines time-domain metrics like RMS and kurtosis with frequency-domain characteristics obtained through FFT analy- sis. To ensure our results reect real-world performance rather than overtted patterns, we employ le-level data splitting that completely separates training and testing datasets. Robustness validation includes noise injection experiments and cross- domain testing under varied operating conditions. The system is purposefully designed with lightweight algorithms suitable for edge computing environments, and weve developed an interactive web dashboard that makes predictions accessible to maintenance personnel without requiring data science expertise. Our experimental ndings conrm that this approach delivers reliable fault diagnosis capabilities appropriate for Industry 4.0 predictive maintenance deployments.

INTRODUCTION
1. Background Information
  The fourth industrial revolution has transformed how we approach equipment maintenance in manufacturing environ- ments. Where facilities once waited for machines to break down before taking action, we now have the capability to predict failures before they occur [2]. This shift is particularly crucial given that unexpected equipment downtime can cost manufacturers thousands of dollars per hour in lost produc- tion, not to mention the safety risks associated with sudden mechanical failures.
  
  Traditional maintenance approaches fall into two cate- gories: reactive maintenance, where repairs happen after breakdowns, and scheduled preventive maintenance, where components are replaced at xed intervals regardless of their actual condition. Both methods have signicant drawbacks.
  
  The reactive approach leads to unpredictable downtime and potential cascade failures, while preventive maintenance often replaces parts that still have substantial useful life remaining, wasting both materials and labor [7].
  
  Machine learning has emerged as a game-changer in this landscape. Unlike rule-based systems that struggle when conditions deviate from their programmed parameters, ML algorithms can learn complex patterns directly from sensor data [1]. For rotating machinery, vibration signals provide particularly rich information about bearing health, as differ- ent types of defects produce distinctive vibration signatures that trained models can recognize.
2. Evolution of Predictive Maintenance
  The journey toward intelligent predictive maintenance began with simple vibration monitoring, where technicians would manually inspect frequency spectra looking for tell- tale peaks indicating specic bearing faults. This required signicant expertise and was time-consuming, making it impractical for facilities with hundreds or thousands of monitored assets.
  
  The introduction of Fast Fourier Transform-based auto- mated analysis represented a major step forward, allowing systems to ag potential issues based on predened fre- quency patterns [9]. However, these systems remained brittle in the face of varying operating conditionswhat looks like a fault signature at one speed or load might be perfectly normal at another.
  
  Industry 4.0 has brought unprecedented connectivity and computational power to the factory oor. Modern industrial equipment often ships with built-in accelerometers, temper- ature sensors, and network interfaces, generating continuous streams of condition monitoring data. This data abundance, combined with advances in machine learning, has enabled truly adaptive diagnostic systems that improve their perfor- mance over time as they encounter more examples [8].
3. Research Problem
  Despite these advances, several challenges prevent widespread adoption of ML-based predictive maintenance. First, fault events are relatively rare in well-maintained fa- cilities, creating severely imbalanced datasets where normal
  
  operation examples vastly outnumber fault examples. Sec- ond, the labeled data required to train supervised classiers is expensive to obtain, as it requires either running equipment to failure (unacceptable in production environments) or creating faults in controlled test rigs. Third, models trained on data from one machine or operating condition often perform poorly when applied to different equipment or environments. Finally, many ML solutions operate as “black boxes,” pro- viding predictions without explanationsa serious barrier to trust in safety-critical applications where maintenance decisions have signicant consequences.
4. Signicance of the Research
Our work addresses these challenges through several key innovations. The hybrid architecture separates anomaly de- tection from fault classication, allowing the system to ag unusual behavior even when it hasnt seen that specic fault type during training. The One-Class SVM approach means we can train an effective anomaly detector using only normal operation data, which is abundant and doesnt require running equipment to failure [3]. By combining multiple complementary featuresboth time-domain statis- tics and frequency-domain characteristicswe create robust representations that generalize better across operating condi- tions. The Random Forest classier provides both accurate predictions and interpretable feature importance rankings, helping maintenance teams understand which signal charac- teristics drive each diagnosis [10]. Finally, our emphasis on lightweight algorithms and edge deployment ensures the sys- tem can operate in real-time on modest hardware, enabling rapid response to developing faults without dependence on cloud connectivity.
LITERATURE REVIEW
1. Overview of Relevant Literature
  The application of machine learning to industrial fault detection has gained substantial research attention across multiple domains. Recent comprehensive reviews have high- lighted both the promise and the persistent challenges in this eld [2]. Studies have demonstrated successful imple- mentations in diverse settings including HVAC systems [1], electrical power grids [6], renewable energy installations [5], and heavy industrial equipment [8]. A common thread across this research is the superiority of data-driven approaches over traditional threshold-based methods, particularly in coplex systems where fault signatures vary with operating condi- tions.
2. Key Theories and Concepts
  Several foundational concepts underpin modern ML-based predictive maintenance. Anomaly detection, as implemented through techniques like One-Class SVM, learns the char- acteristics of normal system behavior and ags deviations without requiring extensive fault examplesa crucial advan- tage given the rarity of failure events in operational datasets [3]. Ensemble learning methods, particularly Random Forest and gradient boosting algorithms, combine predictions from
  
  multiple base learners to achieve both higher accuracy and better generalization than individual models [4]. Feature engineering remains critical despite advances in deep learn- ing; carefully selected statistical and spectral features often outperform raw signal inputs, especially when training data is limited [1]. The shift from reactive to predictive maintenance represents more than just a technological upgradeits a fundamental reimagining of maintenance strategy enabled by the convergence of affordable sensors, edge computing, and sophisticated analytics [7].
3. Gaps in the Literature
While the research literature demonstrates many success- ful laboratory demonstrations, several gaps hinder real-world deployment. Data quality issues plague practical implemen- tations, as real industrial environments introduce noise, sen- sor drift, and operating condition variations rarely captured in benchmark datasets [2]. The interpretability problem is particularly acutemany high-performing models provide accurate predictions but offer little insight into their rea- soning, creating barriers to adoption in industries where maintenance decisions must be explainable and defensible [5]. Generalization challenges persist, with models often showing impressive performance on the specic systems they were trained on but degrading signicantly when ap- plied to different equipment or facilities [6]. The eld also lacks standardized evaluation protocols, making it difcult to meaningfully compare different approaches or reproduce published results [4]. Perhaps most importantly, theres a notable scarcity of research focusing on edge-deployable solutionsmost published work assumes cloud computing resources that may not be available or practical in many industrial settings [8].
METHODOLOGY
1. Research Design
  We designed our system around a two-stage architecture that mirrors how maintenance decisions actually happen in industrial settings. When monitoring equipment, the rst question is always “Is something wrong?” followed by “What specically is wrong?” Our approach reects this logic. Stage one employs unsupervised learning to answer the rst ques- tionspecically, a One-Class SVM trained exclusively on healthy bearing data learns what normal operation looks like and ags anything that deviates signicantly [3]. Only when an anomaly is detected does stage two activate, applying a supervised Random Forest classier to determine whether were seeing an inner race fault, outer race fault, or ball bearing defect.
  
  This design offers several practical advantages over mono- lithic classication approaches. Most importantly, it ad- dresses the labeled data scarcity problemwe can build an effective anomaly detector without any fault examples, using only the abundant normal operation data that every facility generates. This also makes the system more adaptable to novel fault types not seen during training; while the classier
  
  might struggle with an unfamiliar fault pattern, the anomaly detector can still ag it as abnormal, prompting investigation. We deliberately chose lightweight machine learning algo- rithms over deep learning approaches. While neural networks can achieve impressive results, they demand substantial computational resources and large training datasets. Our target deployment environmentedge devices in industrial facilitiescalls for models that can run on modest hardware and train effectively with limited data. Random Forest and
  
  One-Class SVM t these requirements perfectly [10].
2. Data Collection Methods
  Our experiments utilize the Case Western Reserve Univer- sity bearing dataset, a widely-cited benchmark in the bearing diagnostics community [9]. This dataset offers several valu- able characteristics: it includes high-quality vibration signals sampled at 12 kHz from bearings in both healthy and various fault conditions; faults were introduced through precision machining rather than natural wear, ensuring controlled and reproducible defect characteristics; and multiple motor load conditions are represented, allowing evaluation of model robustness across operating points.
  
  Fig. 1. Organization of the CWRU bearing vibration dataset showing folder structure for different fault conditions
  
  The dataset structure, illustrated in Figure 1, organizes signals by fault type and severity. Each recording captures continuous acceleration measurements from the drive-end bearing, with faults ranging from 0.007 to 0.028 inches in diameter. For our purposes, we segment these continuous signals into overlapping analysis windowsa standard prac- tice in vibration analysis that provides multiple independent observations from each recording.
3. Sample Selection
  Preventing data leakage is critical for honest performance evaluation, yet many published studies overlook this issue. The problem arises when samples from the same physical recording appear in both training and testing sets. Even after segmenting into windows, these samples share underlying characteristicssensor noise patterns, mounting resonances, specic operating conditionsthat arent representative of true generalization to new data.
  
  We implement strict le-level splitting: entire vibration recordings are assigned exclusively to either training or testing. This ensures the model never sees data from the same experimental run during both training and evaluation. For the anomaly detection stage, we reserve all fault recordings exclusively for testingthe One-Class SVM trains only on normal data and encounters faulty bearings for the rst time during evaluation. For the classication stage, we split the
  
  labeled fault data at the le level, maintaining complete independence between training and testing sets.
  
  This conservative approach means our reported perfor- mance metrics reect genuine generalization capability rather than memorization of specic experimental conditionsa crucial distinction when evaluating systems intended for deployment on real industrial equipment.
4. Data Analysis Techniques
Raw vibration signals require transformation into mean- ingful features before machine learning algorithms can pro- cess them effectively. We segment each signal into 1024- sample windows with 50% overlap, balancing the competing needs for temporal resolution and statistical reliability.

From each window, we extract a comprehensive fea- ture set spanning both time and frequency domains. Time- domain features capture basic signal characteristics: mean and standard deviation describe central tendency and spread; RMS (root mean square) quanties overall vibration energy; kurtosis and skewness measure impulsiveness and asym- metry, both of which increase with bearing defects; and crest factor (peak-to-RMS ratio) highlights transient impacts characteristic of fault events.

Frequency-domain analysis complements these time- domain metrics. We apply FFT to each window and ex- tract spectral features including peak frequency (indicating dominant vibration components), spectral centroid (char- acterizing the distribution of spectral energy), and energy content in low, mid, and high frequency bands. This multi- band approach provides robustness against noise, as bering faults typically affect specic frequency ranges while leaving others relatively unchanged.

Fig. 2. Complete workow from raw vibration data through feature extraction to fault diagnosis

Figure 2 illustrates our complete processing pipeline. For the anomaly detection model, we train the One-Class SVM with an RBF kernel exclusively on features extracted from normal operation windows. The kernels gamma parameter controls the boundarys smoothness, while the nu parameter determines the expected fraction of outliers. During testing, the models decision function produces a score indicating how far each sample lies from the learned normal re- gionlarge negative scores indicate strong anomalies.

The Random Forest classier receives all labeled data (both normal and faulty) during training. We employ bal- anced class weights to prevent the model from simply predicting the majority class, and limit tree depth to avoid overtting to training idiosyncrasies. The ensemble nature of Random Forestcombining predictions from hundreds of decision treesprovides both accuracy and stability.

Before training, we standardize all features to zero mean and unit variance. This ensures that features with naturally large ranges (like RMS) dont dominate the learning process compared to smaller-scale features (like certain spectral characteristics). Model evaluation employs multiple metrics: classication accuracy, precision, recall, F1-score, and con- fusion matrices for detailed error analysis. Beyond standard performance measures, we conduct robustness experiments adding synthetic noise at various levels and cross-domain tests applying models trained at one load condition to data from different loads.
RESULTS
1. Presentation of Findings
  Our experimental evaluation demonstrates that the hy- brid two-stage approach achieves strong performance across multiple evaluation criteria. The One-Class SVM anomaly detector successfully identied abnormal vibration patterns with high sensitivity, agging the vast majority of fault cases while maintaining acceptably low false alarm rates on normal data. This validates the core premise that normal operation characteristics can be learned without requiring fault examples during training.
  
  The Random Forest classier delivered robust perfor- mance in the fault identication stage, achieving over 96% accuracy in distinguishing between inner race, outer race, and ball bearing faults. Equally important, inference time re- mained well within real-time constraintspredictions com- plete in milliseconds, fast enough for continuous monitor- ing applications. This computational efciency stems from our feature-based approach; by processing compact feature vectors rather than raw signal segments, we dramatically reduce the computational burden compared to deep learning alternatives.
2. Data Analysis and Interpretation
  Examining feature importance rankings from the Random Forest model provides valuable insights into which sig- nal characteristics drive fault classication decisions. RMS emerged as the dominant predictor, consistent with the under- standing that bearing faults increase overall vibration energy. Kurtosis proved nearly as important, reecting its sensitiv- ity to the impulsive shock events generated when rolling elements pass over defects. Frequency-domain features, par- ticularly spectral centroid and mid-band energy, contributed signicantly to distinguishing between fault typesdifferent defect locations produce characteristic shifts in the vibration spectrum.
  
  Figure 3 illustrates typical feature behavior during normal operation. Both RMS and kurtosis remain stable across sequential windows, with RMS showing moderate values reecting routine operational vibration and kurtosis staying near 3 (the value for Gaussian-distributed signals). This stability provides the baseline against which anomalies are detected.
  
  The contrast becomes immediately apparent in Figure 4, which shows the same features extracted from a faulty
  
  Fig. 3. RMS and kurtosis signatures from healthy bearing operation showing stable, low-amplitude characteristics
  
  Fig. 4. Dramatic elevation in RMS and sharp kurtosis spikes characterizing faulty bearing behavior
  
  bearing. RMS jumps to signicantly higher levels, indicat- ing increased vibration energy. More dramatically, kurtosis exhibits sharp spikessometimes exceeding 10revealing the impulsive character of impacts as rolling elements strike defects. These distinctive signatures enable reliable fault detection.
  
  Figure 5 quanties each features contribution to classi- cation accuracy. The dominance of RMS and kurtosis aligns with physical understanding of bearing fault mechanisms. Notably, several frequency-domain features also show sub- stantial importance, validating the decision to include both time and frequency characteristics in the feature set. This information could guide future optimizationfeatures with very low importance might be candidates for removal to further reduce computational cost.
  
  Confusion matrix analysis revealed that classication er- rors, while rare, tended to occur between similar fault types. For instance, the model occasionally confused outer race faults with ball faults, both of which can produce similar spectral signatures depending on defect geometry and load distribution. These error patterns suggest directions
  
  laboratory test rigs; sensor electrical noise, electromagnetic interference, and ambient vibration all degrade signal quality.
  
  Fig. 5. Relative importance of different features in the Random Forest classication model
  
  for improvement, perhaps through addition of features more specically targeted at these challenging cases.
3. Robustness and Generalization Testing
  Real-world deployment demands robustness to conditions beyond those seen during training. We evaluated this through two complementary experiments. First, cross-domain testing assessed generalization across different operating conditions. Training on data from one motor load and testing on different loads reveals how well the model captures fundamental fault signatures versus memorizing load-specic patterns.
  
  Fig. 6. Classication accuracy within-domain and cross-domain, highlight- ing the generalization challenge
  
  Figure 6 presents these results. Within the same operating domain (train and test on same load), accuracy approaches 100%, demonstrating that the model can indeed learn to distinguish fault types when conditions match. Cross-domain performance drops noticeably but remains respectable, typi- cally above 80%. This degradation is unsurprisingvibration signatures do shift with operating conditionsbut the main- tained accuracy shows the feature set captures substantial generalizable information.
  
  Second, we tested robustness to measurement noise by adding Gaussian noise at various signal-to-noise ratios. In- dustrial environments rarely provide the clean signals of
  
  Fig. 7. Model performance degrades gracefully as noise levels increase, maintaining useful accuracy even under challenging conditions
  
  Figure 7 shows accuracy versus noise level. At high SNR (low noise), performance matches clean-signal re- sults. As noise increases, accuracy gradually declines but remains above 80% even at moderate SNR levels compa- rable to real industrial environments. Only at very high noise levelscorresponding to severely degraded sensors or extreme interferencedoes performance drop substantially. This graceful degradation is exactly what we want; the system maintains utility across a wide range of realistic operating conditions.
4. Support for Research Question
These results collectively validate our central hypothesis: a carefully designed hybrid machine learning system can effectively detect and classify bearing faults with accuracy suitable for practical deploymen, while maintaining com- putational efciency compatible with edge computing envi- ronments. The two-stage architecture successfully addresses the limited labeled data challengeanomaly detection works with only normal examples, while the classier achieves high accuracy despite modest training set sizes. Feature engineer- ing combining time and frequency domain characteristics provides robust performance across operating conditions. Most importantly, the system demonstrates the right balance of accuracy, interpretability, and computational efciency for real-world industrial application.
DISCUSSION
1. Interpretation of Results
  Several factors contribute to the hybrid approachs strong performance. The One-Class SVMs ability to learn complex decision boundaries in feature space allows it to capture the natural variability of normal operation while still agging genuinely anomalous patterns. Unlike simple threshold-based approaches that treat each feature independently, the SVM considers combinations of features, detecting anomalies that might not be obvious in any single measurement.
  
  The Random Forest classiers ensemble architecture pro- vides robustness against overttinga critical advantage when working with limited training data, as is typical in industrial fault diagnosis. Each tree in the forest sees a differ- ent random subset of features and training samples, learning partially independent decision rules. Averaging predictions across hundreds of such trees produces stable, reliable clas- sications less prone to the quirks of any individual tree.
  
  Perhaps most importantly, the combination of carefully selected features spanning both time and frequency domains creates representations that capture the underlying physics of bearing faults. RMS and kurtosis respond to fault-induced energy and impulsiveness in the time domain, while spectral features detect characteristic frequency patterns associated with defect locations. This multi-faceted view provides re- dundancyeven if one feature is corrupted by noise or un- usual operating conditions, others often maintain diagnostic value.
  
  The strong performance we observe suggests that machine learning-based systems are ready to move beyond laboratory demonstrations into real industrial deployment, where they can provide substantial value by enabling proactive mainte- nance strategies [2], [7].
2. Comparison with Existing Literature
  Our results align well with recent ndings in the industrial fault detection literature while offering some notable im- provements. Leite et al.s comprehensive review highlighted interpretability as a key barrier to adoption of ML-based fault detection systems [2]. Our approach directly addresses this through feature importance analysis and the inherent in- terpretability of tree-based modelsmaintenance personnel can understand that a fault diagnosis stems from elevated RMS and kurtosis values, concepts familiar from traditional vibration analysis.
  
  Compared to studies focusing on simulated data [6], our validation on the widely-recognized CWRU benchmark dataset provides stronger evidence of real-world applicabil- ity. While simulation offers perfect control of experimental conditions, it often fails to capture the complexity and variability of actual industrial environments. The CWRU dataset, though still laboratory-based, includes real sensor noise and mechanical system dynamics.
  
  The accuracy we achieve exceeds results reported in some recent electrical fault classication studies [4], while our computational efciencymeasured in milliseconds per predictionsurpasses deep learning approaches that require seconds or minutes even on GPU hardware [8]. This ef- ciency advantage is crucial for edge deployment scenarios where models must run on embedded processors with limited computational resources.
3. Implications and Limitations
The practical implications of this work extend beyond the specic accuracy numbers. By demonstrating that effective fault diagnosis can be achieved with lightweight algorithms

and limited labeled data, we lower the barriers to adop- tion for facilities that might lack data science expertise or extensive fault databases. The interpretable nature of the approachshowing which features drive each diagno- sisbuilds trust with maintenance personnel who rightly demand understanding, not just predictions, from decision support tools.

The web-based dashboard we developed makes these capabilities accessible to end users without requiring pro- gramming skills or ML knowledge. Operators can upload vibration data, receive immediate fault diagnoses, and view feature trendsall through an intuitive interface that inte- grates into existing maintenance workows.

However, several limitations deserve attention. Our val- idation used exclusively bearing fault data from a single test rig, albeit under multiple operating conditions. Gener- alization to other machine typesgearboxes, pumps, com- pressorsremains to be demonstrated. While the feature extraction approach is broadly applicable, optimal feature sets likely vary across equipment types.

Real industrial environments introduce challenges beyond our test scenarios. Installations vibration can couple from nearby equipment, making it harder to isolate the target machines signature. Operating conditions may change more rapidly and unpredictably than in our controlled experiments. Sensors degrade over time, introducing drift that could be mistaken for developing faults.

Perhaps most signicantly, our models assume stationary operating conditions during each analysis window. Equip- ment experiencing frequent starts, stops, or load changes requires different analysis approaches, possibly incorporating dynamic models that account for transient behaviors.

Finally, while cross-domain and noise robustness tests show encouraging generalization, signicant operating con- dition changes still degrade performance. Practical deploy- ment likely requires periodic model updates as equipment ages and conditions evolvean operational consideration that extends beyond the technical scope of this research.
CONCLUSION

Summary of Key Findings
This research successfully developed and validated a practical hybrid machine learning framework for predic- tive maintenance in rotating machinery. Our two-stage ap- proachcombining One-Class SVM anomaly detection with Random Forest fault classicationachieved accurate fault diagnosis while maintaining computational efciency suit- able for edge deployment. The system demonstrated over 96% accuracy in classifying bearing faults across multiple fault types and operating conditions, with inference times measured in milliseconds.

Key ndings include the effectiveness of learning nor- mal operation patterns from unlabeled data, eliminating the need for extensive fault examples; the value of combining time-domain and frequency-domain features for robust fault characterization; the importance of rigorous data splitting to prevent optimistic performance estimates; and the feasibility

of high-accuracy fault diagnosis using lightweight algorithms appropriate for resource-constrained industrial environments. The interactive web dashboard successfully bridges the gap between ML model outputs and actionable maintenance decisions, demonstrating that sophisticated diagnostic capa- bilities can be made accessible to personnel without data

science backgrounds.
Contributions to the Field
Our work makes several contributions to the predictive maintenance research community. The hybrid two-stage ar- chitecture offers a practical solution to the labeled data scarcity problem that plagues supervised learning approaches in fault diagnosis. By separating anomaly detection from fault classication, we eable effective monitoring even when comprehensive fault databases are unavailable.

The comprehensive feature engineering framework com- bining statistical and spectral characteristics provides a tem- plate for vibration analysis applications. Our feature im- portance analysis helps identify which signal characteristics carry the most diagnostic valueinformation useful for both understanding fault mechanisms and optimizing future implementations.

Methodologically, we advance best practices through rig- orous attention to data leakage prevention via le-level splitting, and comprehensive robustness evaluation including cross-domain and noise injection experiments. These prac- tices, unfortunately rare in published research, are essential for honest assessment of real-world performance.

The emphasis on edge deploymentdemonstrated through lightweight algorithms and real-time inference capabili- tiesaddresses practical deployment constraints often over- looked in research focused solely on maximizing accuracy regardless of computational cost. Our detailed implementa- tion documentation facilitates reproducibility and provides a foundation for further research.
Recommendations for Future Research

Several promising directions emerge from this work. Inte- gration with IoT sensor networks would enable continuous automated data collection, supporting online learning where models update themselves as new data arrives. This could address the model drift problem, helping systems adapt as equipment ages and operating patterns evolve.

While we deliberately chose classical ML approaches for their efciency and interpretability, hybrid systems combin- ing feature-based methods with deep learning for automatic feature discovery warrant exploration. LSTM networks might capture temporal dependencies across windows that our xed-window approach misses, while CNN architectures could learn fault-specic time-frequency patterns directly from spectrograms.

Transfer learning offers potential to address the generaliza- tion challenge. Rather than training models from scratch for each new installation, transfer learning could adapt models pre-trained on one machine to new equipment with minimal

site-specic dataa crucial capability for industrial deploy- ment across diverse facilities.

Extension from diagnosis to prognosisestimating re- maining useful life rather than just identifying current faultswould provide even greater value for maintenance planning. Combining our fault classication capabilities with degradation models could predict not just what is wrong, but when intervention will become necessary.

Finally, eld deployment and validation in operating in- dustrial facilities remains essential. Laboratory benchmark datasets serve important purposes, but only real-world de- ployment exposes the full complexity of industrial environ- ments and validates practical utility. Partnerships with indus- trial sites willing to host pilot deployments would provide invaluable insights for system renement and identify de- ployment challenges not apparent in controlled experiments.

ACKNOWLEDGMENT

The authors express sincere gratitude to the management of Toc H Institute of Science and Technology for providing the necessary facilities and continuous support throughout this research. We are particularly thankful to Dr. N. Vish- wanath, our project guide, whose valuable insights and guidance were instrumental in shaping this work. We also acknowledge Dr. Sreela Sreedhar, Head of the Department of Computer Science and Engineering, for her encouragement and support.

References

B. Wu, W. Cai, and X. Zhang, A fault detection model for air handling units based on the machine learning algorithms, in Proc. IEEE IECON, Oct. 2021, doi: 10.1109/IECON43393.2021.9254466.
D. Leite, E. Andrade, D. Rativa, and A. M. A. Maciel, Fault Detection and Diagnosis in Industry 4.0: A Review on Challenges and Opportunities, Sensors, Dec. 2024, doi: 10.3390/s25010060.
H. Won and Y. Kim, Performance Analysis of Machine Learning Based Fault Detection for Cloud Infrastructure, in Proc. IEEE ICOIN, Jan. 2021, doi: 10.1109/ICOIN50884.2021.9333875.
J. Kim, S. Cho, S. Sim, C. Han, and S. Kim, Develop- ment for Electrical Fault Detection and Classication Analysis Model based on Machine Learning Algorithms, Apr. 2024, doi: 10.1109/SusTecp0925.2024.10553405.
J. Zwirtes, F. B. Líbano, L. A. d. L. Silva, and E. P. de Freitas, Fault Detection in Photovoltaic Systems Using a Machine Learning Approach, IEEE Access, vol. 13, Mar. 2025, doi: 10.1109/AC- CESS.2025.3547838.
L. Z. Rui, R. T. Naayagi, and G. Panda, Fault Detection and Diagnosis in Power Systems Using Machine Learning, Mar. 2025, doi: 10.1109/ICGEA64602.2025.11009801.
N. K. R. Pothireddy, Generative AI for Edge-Based Predictive Main- tenance in Smart Factories, Journal of Articial Intelligence & Cloud Computing, vol. 4, no. 1, Jan. 2025, doi: 10.47363/JAICC/2025(4)444.
R. de la Fuente, L. Radrigan, and A. S. Morales, Enhancing Predic- tive Maintenance in Mining Mobile Machinery through a TinyML- enabled Hierarchical Inference Network, IEEE Access, Apr. 2025, doi: 10.1109/ACCESS.2025.3557405.
Case Western Reserve University Bearing Data Center, Bearing Data Center Seeded Fault Test Data, Available: https://engineering.case.edu/bearingdatacenter
L. Breiman, Random Forests, Machine Learning, vol. 45, no. 1, pp. 532, 2001.