A Self-Supervised Behavioral Graph Framework for Zero-Day Cyber Attack Detection with Continual Learning

Dr. K. Vidya; Shreyas S; Sashank G; Phavankumar R L

doi:10.5281/zenodo.20378650

Volume 15, Issue 05 (May 2026)

A Self-Supervised Behavioral Graph Framework for Zero-Day Cyber Attack Detection with Continual Learning

DOI : 10.5281/zenodo.20378650

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 69
Authors : Dr. K. Vidya, Shreyas S, Sashank G, Phavankumar R L
Paper ID : IJERTV15IS052069
Volume & Issue : Volume 15, Issue 05 , May – 2026
Published (First Online): 25-05-2026
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

A Self-Supervised Behavioral Graph Framework for Zero-Day Cyber Attack Detection with Continual Learning

Dr. K Vidya | Shreyas S | Sashank G | Phavankumar R L

Department of Information Science and Technology Anna University, Chennai

Abstract – Zero-day cyber attacks pose a significant challenge to modern cybersecurity systems due to the absence of known signatures and labeled training data. Traditional intrusion de-tection systems rely on predefined rules and fail to generalize to unseen threats. This paper proposes a self-supervised behavioral graph framework that models system activity as heterogeneous graphs and detects anomalies without requiring labeled attack data. The system constructs temporal behavioral graphs from real-time system events and utilizes a graph autoencoder to learn normal behavior. A hybrid anomaly detection mechanism combining reconstruction error and structural graph features improves detection robustness. Additionally, a continual learning module enables adaptation to evolving system behavior while preventing catastrophic forgetting. Experimental results demon-strate that the system achieves 100% recall on attack detection with 85.7% precision and detection latency between 200400 milliseconds, making it suitable for real-time deployment in production environments.

INTRODUCTION

Modern operating systems generate continuous streams of events capturing process execution, file access, and network communication. These events provide detailed insights into system behavior and can be leveraged for detecting malicious activity. However, traditional intrusion detection systems rely on signature-based detection, which is inherently limited to known threats and cannot identify zero-day attacks,previously unseen attack patterns that lack predefined signatures.

Machine learning approaches have been introduced to ad-dress this limitation by modeling normal behavior and detect-ing anomalies as deviations from learned patterns. However, many existing approaches treat events independently and fail to capture relationships between system entities. This lack of structural context reduces detection accuracy, particularly for complex attack patterns involving multiple coordinated interactions across processes, files, and network connections. Graph-based representations have emerged as a powerful ap-proach for modeling structured relationships in cybersecurity domains. By representing system entities as nodes and their interactions as edges, behavioral graphs preserve contextual relationships that are critical for understanding attack patterns. Self-supervised learning enables training on unlabeled benign data, eliminating the dependency on costly labeled attack datasets.

To overcome the limitations of traditional signature-based and isolated event-based detection systems, this work proposes a comprehensive graph-based behavioral modeling framework that captures relationships between system entities. By lever-aging self-supervised learning through graph autoencoders, the system learns normal behavior patterns without requir-ing labeled data. The integration of hybrid anomaly de-tection,combining reconstruction error with structural graph features,and continual learning with replay memory further improves detection performance and adaptability in dynamic operational environments.

The key contributions of this work include:
- A heterogeneous behavioral graph construction method-ology that models processes, files, and network sockets as distinct node types with typed edges representing system operations
- A self-supervised graph autoencoder architecture trained exclusively on benign data to learn normal behavioral patterns
- A hybrid anomaly detection mechanism combining reconstruction-based and structure-based scoring for ro-bust zero-day attack detection
- A continual learning framework with replay memory to adapt to evolving system behavior while preventing catastrophic forgetting
- Comprehensive experimental validation demonstrating 100% recall on diverse attack scenarios including re-verse shells, privilege escalation, data exfiltration, and ransomware
RELATED WORK

Recent advances in graph-based intrusion detection have demonstrated the effectiveness of structural modeling for cybersecurity applications. Guerra et al. [2] proposed self-supervised graph representation learning for network intrusion detection, demonstrating that graph structure alone provides strong anomaly signals. Their work established the foundation for reconstruction-based anomaly detection in graph-structured security data.

Hayat et al. [4] introduced time-aware heterogeneous hy-pergraph learning, extending graph representations to capture

complex multi-entity relationships with temporal dynamics. This work highlighted the importance of heterogeneous node types and temporal windowing for capturing evolving behav-ioral patterns.

Dai et al. [1] focused on zero-day attack detection using machine learning techniques that generalize to unseen data. Their study emphasized learning generalized representations of normal behavior rather than memorizing attack patterns, enabling effective detection of previously unknown threats.

Guo et al. [3] proposed continual learning techniques for intrusion detection systems operating in evolving network environments. Their framework demonstrated that incremental learning with anti-forgetting mechanisms enables long-term model effectiveness as system behavior changes over time.

While these works establish strong foundations, existing approaches often lack integration of graph-based modeling, self-supervised learning, and continual adaptation in a unified framework suitable for real-time zero-day detection.
SYSTEM ARCHITECTURE

The proposed system follows a modular pipeline that trans-forms raw system events into anomaly detection outputs. The architecture consists of six primary modules: event collection, preprocessing, graph construction, model training, anomaly detection, and continual learning.

The pipeline begins with real-time event collection using system-level monitoring APIs, where system activities such as process creation, file operations, and network connections are continuously monitored. These events are preprocessed through noise filtering and normalization, then converted into structured behavioral graphs using time-windowing tech-niques. The graph autoencoder learns normal behavior patterns through self-supervised reconstruction training. During infer-ence, incoming graphs are scored using hybrid anomaly detec-tion combining reconstruction error and structural deviations. The continual learning module ensures that the model adapts to new system behavior over time by incrementally updating the model with validated benign graphs while preserving historical knowledge through replay memory.

Figure 1 illustrates the complete pipeline architecture, show-ing data flow from raw system events through graph con-struction, autoencoder training, hybrid detection, and continual adaptation stages.

EVENT COLLECTION AND PREPROCESSING

The system continuously monitors system activities using a real-time event collection module implemented through oper-ating system monitoring APIs. Events such as process creation (fork, execve), file operations (open, read, write), and network connections (connect, accept) are catured at fixed polling intervals of 100 milliseconds. Each event includes essential attributes such as timestamp, process identifier, entity type (process/file/socket), operation type, and resource identifiers.

Table I summarizes the event collection metrics from a

TABLE I

System Event Collection Statistics

Metric	Value
Total Events Collected	154
Collection Duration	20 seconds
Event Types	7
Process Events	45
File Events	73
Network Events	36
Polling Interval	100 ms
Buffer Capacity	10,000 events

distribution across process, file, and network operations pro-vides comprehensive coverage of system behavior.

To ensure consistency across different data sources, events are normalized into a unified format with standardized field names and value ranges. Noise reduction techniques are applied to remove irrelevant or redundant events, such as temporary file accesses from system processes or periodic background service communications. This preprocessing stage is critical for maintaining the quality of input data and reducing false positives in downstream detection.

The processed event stream is stored in a bounded circular buffer with capacity 10,000 events that preserves temporal ordering while preventing memory overflow. This allows the system to group events into time windows, enabling localized analysis of system behavior. The use of buffering ensures that the system can handle high event throughput,up to 500 events per second,while maintaining real-time performance with minimal memory footprint.

BEHAVIORAL GRAPH CONSTRUCTION

The event stream is partitioned into fixed time windows of duration W = 5 seconds. Each time window is transformed into a behavioral graph G = (V, E) that represents system activity during that interval. The node set V consists of three distinct entity types:

Process nodes: Representing executing processes with attributes (PID, name)
File nodes: Representing file system objects with paths
Socket nodes: Representing network endpoints with (IP, port) tuples

Edges in E represent typed interactions between entities:
executes: Process Process (parent spawns child)
reads/writes: Process File (file access operations)

connects: Process Socket (network communications)

This heterogeneous graph representation captures both structural relationships and temporal dynamics of system behavior. For example, a process reading multiple files and establishing network connections forms a connected subgraph that provides contextual information about its activity patterns.

The edge density of the graph is computed as:

2|E|

typical benign baseline monitoring session. The diverse event

=

|V |(|V | 1)

(1)

Fig. 1. End-to-end behavioral graph detection pipeline with continual learning

TABLE II

Behavioral Graph Structural Statistics

Graph	Nodes	Edges	Edge Density
Benign Graph 1	81	83	0.026
Benign Graph 2	14	10	0.110
Benign Graph 3	64	61	0.030
Attack Graph 1	50	142	0.116
Attack Graph 2	73	198	0.075

This metric reflects the connectivity concentration of the

structural and statistical features that capture different aspects of system behavior:

Node Distribution Features (F0-F2): Count of process nodes, file nodes, and socket nodes within the graph. These features (np, nf , ns) provide insights into system composition:

F = n , F = n , F = n (2)

0 p 1 f 2 s

Degree Statistics (F3-F6): Average in-degree, average out-degree, maximum in-degree, and maximum out-degree across all nodes. These capture interaction intensity:

|V |

graph and is useful for identifying abnormal interaction pat-

F = 1 d

= 1 d

terns. Table II shows structural statistics for representative benign and attack graphs, demonstrating that attack graphs of-

3 in (v), F4

vV

|V |

vV

out (v) (3)

ten exhibit higher edge densities due to coordinated malicious Type Ratios (F7-F9): Normalized proportions of each node

interactions.

type:

np nf n

Graph construction enables the system to model complex F = , F = , F = s

(4)

behavioral patterns that are not captured by traditional event- 7 |V | 8 |V | 9 |V |

based approaches. The heterogeneous node typing preserves semantic information about entity roles, while typed edges capture the nature of interactions between entities.

FEATURE ENGINEERING

Feature engineering plays a critical role in the effectiveness of the proposed system. Each graph is enriched with 14

Unique Entity Counts (F10-F12): Number of distinct processes, files, and sockets observed in the time window.

Edge Density (F13): Global connectivity measure as de-fined in Equation 1.

The combination of these features enables the system to detect both subtle and significant deviations from normal behavior. Node distribution features identify unusual system

composition (e.g., abnormal socket proliferation during net-work scanning). Degree statistics highlight processes with atypical interaction volumes (e.g., data exfiltration reading many files). Type ratios capture shifts in behavioral focus (e.g., reconnaissance increasing network activity proportion).
GRAPH AUTOENCODER MODEL

The graph autoencoder learns latent representations of nor-mal system behavior through self-supervised reconstruction training. The encoder fenc : G Z applies two graph

stopping is employed when validation loss plateaus to prevent overfitting.

HYBRID ANOMALY DETECTION

To improve detection robustness beyond reconstruction error alone, the system combines reconstruction-based and structural anomaly scores. This hybrid approach captures both deviations in learned representations and explicit structural irregularities.

The structural anomaly score is computed using Z-score normalization across the 14 extracted features:

convolutional layers to transform node features into a 32-dimensional embedding space:

Z = Fi i

i i

(9)

Z(1) = ReLU(AXW (1)) (5)

Z = AZ (1)W (2) (6)

where A= D1/2AD1/2 is the normalized adjacency ma-

where i and i are the mean and standard deviation of feature Fi computed from benign training graphs. The structural score aggregates absolute deviations:

13

1

= |Z | (10)

trix,

(1) (2)

Sstruct

14 i

X contains node features, and W ,W

are learnable

i=0

weight matrices. The embeddings Z capture structural and relational information within the graph through neighborhood aggregation.

The reconstruction score measures the normalized recon-struction error:

The decoder fdec : Z A reconstructs the adjacency matrix using an inner product operation:

Srecon =

recon

(11)

A = (ZZT ) (7)

where is the sigmid activation function. The reconstruc-tion loss measures how well the autoencoder preserves graph structure:

The final hybrid score combines both components with equal weighting:

Shybrid = 0.5 · Sstruct + 0.5 · Srecon (12)

A graph is classified as anomalous if Shybrid > T , where the threshold T is determined statistically from benign training

Lrecon = A A 2

F =

(Aij Aij )2 (8)

i,j

data:

T =

hybrid

+ k · hybrid

(13)

Graphs that deviate from learned patterns produce higher reconstruction errors, indicating potential anomalies. The au-toencoder is trained exclusively on benign data for 30 epochs using the Adam optimizer with learning rate 0.01, allowing it to learn a baseline model of normal system behavior.

Algorithm 1 Graph Autoencoder Training

1: Input: Benign graphs {G1,… , Gn}

2: Output: Trained encoder fenc, decoder fdec

3: Initialize W (1),W (2) randomly

4: for epoch = 1 to 30 do

5: for each graph Gi do

6: Extract adjacency Ai and features Xi

7: Zi = fenc(Ai, Xi)

i

8: A = fdec(Zi)

i F

9: L = Ai A 2

10: Update parameters via backpropagation

11: end for

12: end for

13: Compute baseline statistics: , of reconstruction errors

14: return fenc, fdec, ,

During training, reconstruction loss converges rapidly, typ-ically stabilizing within 10-15 epochs as the model learns to compress and reconstruct normal behavioral patterns. Early

with k = 2.5 providing moderate sensitivity. This threshold corresponds to approximately 99% of benign graphs scoring below the decision boundary, minimizing false positives while maintaining high detection sensitivity.

TABLE III

Attack Detection Results

Attack Type	Score	Threshold	Detected
Reverse Shell	5.46	2.5
Privilege Escalation	5.26	2.5
Data Exfiltration	5.61	2.5
Ransomware	6.40	2.5
Process Injection (Low)	3.73	2.5
Process Injection (Critical)	10.69	2.5

Table III demonstrates that all six attack scenarios generated hybrid anomaly scores significantly exceeding the detection threshold, validating the effectiveness of the hybrid approach. The critical-severity process injection attack achieved the highest score (10.69), reflecting severe behavioral deviation.

This hybrid approach improves detection accuracy by cap-turing both structural irregularities (unusual node distribu-tions, connectivity patterns) and deviations in learned behavior (reconstruction failure on attack-induced graph structures). The combination provides redundancy: attacks that evade one scoring mechanism are typically caught by the other.

ALERT GENERATION AND SEVERITY

CLASSIFICATION

When an anomaly is detected, the system generates struc-tured alerts that provide actionable information for security analysts. Each alert includes comprehensive attributes:
- Timestamp: Precise detection time for incident timeline construction
- Anomaly Score: Numerical severity indicator (Shybrid)
- Severity Level: Categorical classification (LOW/MEDIUM/HIGH/CRITICAL)
- Affected Entities: List of processes, files, and network endpoints involved
- Graph Structure: Serialized representation for forensic analysis
- Feature Deviations: Identification of which features ex-ceeded normal ranges
  
  Severity levels are assigned based on anomaly score thresh-olds:
  
  LOW if 2.5 < Shybrid < 4.0
  
  MEDIUM if 4.0 S
  
  Algorithm 2 Continual Learning Update 1: Input: Stream graphs Gstream, model M , memory M 2: for each chunk of C graphs do
  
  3: Sample Greplay · C from M
  
  4: Gbatch = Gchunk Greplay 5: for inner epoch = 1 to 2 do 6: Fine-tune M on Gbatch
  
  7: Compute chunk loss Lchunk
  
  8: end for
  
  9: Add Gchunk to memory M
  
  10: if |M| > capacity then
  
  11: Remove oldest graphs (FIFO policy)
  
  12: end if
  
  13: end for
  
  The adaptation loop records loss trends and anomaly score variance to verify that each incremental update improves de-tection quality. Conservative learning rates (0.001, 10× smaller
  
  than initial training) ensure stability during adaptation. This
  
  Severity =
  
  hybrid < 6.0 (14)
  
  HIGH if 6.0 Shybrid < 9.0
  
  CRITICAL if Shybrid 9.0
  
  gradual refinement prevents sudden decision boundary shifts that could introduce false positives or negatives.
  
  By interleaving new data with historical samples, the con-tinual learning mechanism successfully balances plasticity
  
  The system extracts relevant subgraphs associated with
  
  anomalous activity by identifying connected components con-taining high-deviation nodes. This enables detailed inspection of suspicious behavior patterns and facilitates root cause analysis.
  
  Alerts are serialized to JSON format for integration with external Security Information and Event Management (SIEM) systems. The logging system maintains persistent alert history regardless of severity, enabling forensic analysis and pattern identification over extended time periods. Real-time alerts are pushed to a notification queue for immediate analyst response.
CONTINUAL LEARNING WITH REPLAY

MEMORY

System behavior evolves over time due to changes in user activity, software updates, and workload variations. To maintain detection effectiveness in dynamic environments, the system incorporates a continual learning mechanism that incrementally adapts the model while preventing catastrophic forgetting.

Incoming behavioral graphs are processed in small chunks of size C = 3 graphs. For each chunk, the system samples replay graphs from a bounded memory buffer with capacity 64 graphs and replay ratio = 0.5:

nreplay = · C (15)

The combined batch (current chunk + replay samples) is used for incremental model updates through fine-tuning:

The replay memory is refreshed using a fixed-budget FIFO strategy that preserves diverse historical patterns while remov-ing redundant samples. This keeps memory usage bounded at 64 graphs while providing broad behavioral coverage across the temporal history.

(learning new patterns) and stability (retaining old knowledge), enabling long-term deployment in evolving operational envi-ronments.
IMPLEMENTATION DETAILS

The system is implemented using Python 3.13 with special-ized libraries for graph processing and deep learning:
- PyTorch 2.0.1: Deep learning framework for autoencoder implementation
- PyTorch Geometric 2.3.1: Graph neural network layers and operations
- NetworkX 2.6: Graph construction and manipulation
- scikit-learn 1.0.2: Statistical preprocessing and metrics
- Streamlit 1.28: Web-based dashboard for visualization Real-time monitoring is performed using platform-specific
  
  system APIs: psutil for cross-platform process monitoring,
  
  and direct syscall tracing on Linux systems. Event collection operates in a dedicated tread with 100ms polling intervals, ensuring minimal interference with system performance.
  
  The graph autoencoder model is trained using GPU acceler-ation (CUDA-enabled PyTorch) to reduce training time from minutes to seconds. Model checkpoints are saved after each training epoch, enabling recovery from failures and version control.
  
  Visualization and monitoring are implemented through a web-based dashboard built with Streamlit, providing real-time insights into system behavior, detected anomalies, and model performance metrics. The dashboard displays:
- Live event stream with filtering capabilities
- Interactive behavioral graph visualizations
- Anomaly score distributions and trends
- Alert history with severity breakdowns
- Model training metrics and convergence plots
Performance Optimization

The system employs several optimization techniques to ensure real-time performance suitable for production deploy-ment:

Event Buffering: Circular buffer with bounded capacity (10,000 events) reduces memory allocation overhead. Batched event processing amortizes context switching costs.

Graph Caching: Recent graphs are cached in memory with LRU eviction policy, avoiding redundant construction when time windows overlap. Cache hit rates exceed 40% during normal operation.

Sparse Matrix Representation: Adjacency matrices are stored in COO (Coordinate) sparse format, reducing memory footprint by 85% for typical behavioral graphs with edge density < 0.1.

GPU Acceleration: Graph convolution operations are of-floaded to GPU when available, reducing inference time from 150ms (CPU) to 35ms (GPU) per graph.

Incremental Feature Computation: Features are computed incrementally as events arrive rather than recomputing from scratch, reducing feature extraction time by 60%.

These optimizations work synergistically to maintain end-to-end pipeline latency below 400ms even under high system load (500+ events/second), meeting real-time operational re-quirements.
EXPERIMENTAL SETUP

Experiments were conducted on a detection server with Intel Core i7 processor, 16GB RAM, and NVIDIA GPU support. The test environment included Windows 11 for benign baseline collection and controlled attack simulation in isolated sandbox environments.

Dataset Preparation: Benign behavioral baseline was es-tablished through multi-hour continuous monitoring of normal system operations, collecting 154 events over 20-second win-dows representing diverse activities (web browsing, file edit-ing, background services). Attack scenarios included six dis-tinct threat types: reverse shell command execution, privilege escalation attempts, data exfiltration operations, ransomware file encryption, and two variants of process injection attacks. Training Configuration: The autoencoder was trained for 30 epochs with learning rate 0.01, dropout rate 0.2, and batch processing of 3 graphs per iteration. Time window duration was set to 5 seconds with maximum node capacity of 500 per graph to prevent memory overflow. Anomaly detection threshold was configured at 2.5 standard deviations (moderate

sensitivity).

Evaluation Metrics: Model performance was assessed us-ing standard binary classification metrics: precision, recall, F1-score, accuracy, and ROC-AUC. Detection latency was mea-sured from event capture to alert generation. Attack coverage evaluated the systems ability to detect diverse threat patterns.

Results and Analysis

Table IV presents comprehensive detection performance metrics. The system achieves 100% recall, successfully detect-ing all 6 attack instances without any false negatives. Precision of 85.7% indicates that 6 of 7 raised alerts correspond to genuine attacks, with 1 false positive on a benign test graph that exhibited unusual but legitimate behavior. The F1-score of 92.3% demonstrates strong balanced performance.

Fig. 2. Autoencoder training loss convergence over 30 epochs

Figure 2 illustrates the training dynamics, showing rapid loss convergence within the first 15 epochs followed by sta-ble optimization. The smooth convergence indicates effective learning of normal behavioral patterns without overfitting.

Attack Coverage Analysis: The system successfully de-tected all six attack types with varying severity levels. Reverse shell attacks (score 5.46) exhibited distinctive process-socket connection patterns absent from baseline. Privilege escalation attempts (score 5.26) created unusual file access patterns to protected system resources. Data exfiltration (score 5.61) manifested through bulk file reading with external network transmission. The critical process injection attack (score 10.69) showed the most severe deviation through anomalous process-process interaction graphs.

Comparative Analysis with Signature-Based Detection: All six attack instances were classified as CLEAN by ClamAV antivirus signature scanning, yet were successfully detected by the behavioral graph system. This validates the core advantage of behavioral detection: identifying zero-day threats that evade signature databases through novel implementations or obfus-cation techniques.

Detection Latency Performance: End-to-end detection la-tency ranged from 200-400 milliseconds, measured from event occurrence to alert generation. This rapid response enables timely threat mitigation before significant damage occurs. Ransomware detection latency of 1540ms enabled intervention within 1.5 seconds of encryption initiation.

False Positive Analysis: The single false positive occurred on a test graph with anomaly score 24.206, significantly exceeding the threshold. Investigation revealed this graph contained legitimate but unusual software installation activity involving rapid file creation and process spawning patterns

TABLE IV

Comprehensive Detection Performance Metrics

Metric	Value	TP	TN	FP	FN	Support
Accuracy	0.857	6	0	1	0	7
Precision	0.857	6	–	1	–	7
Recall	1.000	6	–	–	0	6
F1-Score	0.923	–	–	–	–	7
ROC-AUC	0.773	–	–	–	–	7
Detection Latency (ms)	200-400	–	–	–	–	–
False Positive Rate	14.3%	–	–	1	–	7

not present in the training baseline. This demonstrates the systems sensitivity to behavioral deviations while highlighting the importance of comprehensive benign training data.

The hybrid anomaly detection approach significantly im-proves performance compared to using reconstruction error alone (F1: 0.923 vs 0.851) or structural features alone (F1: 0.923 vs 0.789), validating the synergistic benefit of combining both scoring mechanisms.

DISCUSSION

The experimental results demonstrate that graph-based be-havioral modeling combined with self-supervised learning provides an effective framework for zero-day attack detection. The 100% recall on attack scenarios confirms that behavioral deviations prouced by malicious activities are sufficiently distinct from normal patterns to enable reliable detection.

The hybrid scoring mechanism proved essential for robust detection. Reconstruction error alone occasionally missed at-tacks with graph structures similar to benign patterns but abnormal feature distributions. Structural scoring alone missed attacks with normal individual features but anomalous re-lational patterns. The combination provides redundancy and complementary detection coverage.

Continual learning successfully maintained model effective-ness over extended monitoring periods. The replay memory mechanism prevented catastrophic forgetting while enabling adaptation to gradual behavioral drift from software updates and changing workload patterns.

However, several limitations should be noted. The sys-tems effectiveness depends critically on comprehensive be-nign training data covering diverse normal operational modes. Limited training data may result in false positives when legitimate but rare activities occur. Additionally, the current implementation assumes single-host deployment; distributed multi-host attack campaigns may evade detection if coordina-tion occurs across system boundaries.

The false positive rate of 14.3% may be acceptable for high-security environments but could generate alert fatigue in large-scale deployments. Threshold tuning and expanded training data collection can reduce false alarms while maintaining high recall.
CONCLUSION AND FUTURE WORK

This paper presents a self-supervised behavioral graph framework for zero-day attack detection that combines graph

autoencoders, hybrid anomaly scoring, and continual learning. By modeling system activity as heterogeneous temporal graphs and learning normal behavior patterns without labeled attack data, the system effectively detects previously unseen threats in real time.

Experimental validation demonstrates 100% recall on di-verse attack scenarios including reverse shells, privilege es-calation, data exfiltration, and ransomware, with detection latency suitable for production deployment (200-400ms). The system successfully detected all attacks that evaded signature-based antivirus scanning, validating the advantage of behav-ioral detection for zero-day threats.

The modular architecture enables independent optimization of collection, graph construction, detection, and learning com-ponents. The integration of continual learning ensures long-term effectiveness in evolving operational environments.

Future research directions include:

Adversarial Robustness: Development of mechanisms to defend against adversarial evasion attacks that deliber-ately craft behavioral graphs to minimize detection scores
Distributed Detection: Extension to multi-host environ-ments with cross-system graph correlation for detecting distributed attack campaigns
Explainable AI: Integration of graph attention mech-anisms and subgraph mining to provide interpretable explanations of why specific graphs were flagged as anomalous
Transfer Learning: Investigation of cross-system model transfer to enable rapid deployment on new hosts with minimal training data requirements
Automated Response: Integration with automated inci-dent response systems for immediate threat containment upon detection

The proposed framework demonstrates that practical zero-day detection is achievable through integration of graph-based behavioral modeling, self-supervised learning, and continual adaptation, providing a scalable foundation for modern cyber-security defense.

REFERENCES

Z. Dai, L. Y. Por, Y. Chen, J. Yang, C. S. Ku, R. Alizadehsani, and

P. Plawiak, An intrusion detection model to detect zero-day attacks in

unseen data using machine learning, PLoS ONE, vol. 19, 2025.
L. Guerra, T. Chapuis, G. Duc, P. Mozharovskyi, and V. Nguyen, Self-supervised learning of graph representations for network intrusion detection, 2025.
C. Guo, X. Li, J. Cheng, S. Yang, and H. Gong, Continual learning for intrusion detection under evolving network threats, Future Internet, vol. 17, 2025.
M. K. Hayat, S. Xue, J. Wu, B. Khan, and J. Yang, Self-supervised time-aware heterogeneous hypergraph learning for dynamic graph-level classification, in WSDM, 2025.
N. Dilini, N. Sun, Y. Miao, and N. Moustafa, A comprehensive review on graph-based anomaly detection: Approaches for intrusion detection, Applied Sciences, vol. 16, 2026.
Y. Lin, Y. Lu, R. Hwang, Y. Lai, D. Sudyana, and W. Lee, Evolving

ML-based intrusion detection: Cyber threat intelligence for dynamic model updates, IEEE Trans. Machine Learning in Communications and Networking, vol. 3, 2025.
S. Prasath, K. Sethi, D. Mohanty, P. Bera, and S. R. Samantaray, Analysis of continual learning models for intrusion detection system, IEEE Access, 2025.
B. Lanigan, Z. Rezaeifar, F. Cruciani, M. Milliken, J. Vincent, S. Moore, et al., Alert correlation for intelligent threat detection and response, Intelligent Systems with Applications, 2025.