Classification and Prediction Technique for DDoS Attacks Using Machine Learning

Meghana Lokhande; Harsh Dandge; Viraj Jadhao; Swapnil Patil; Sarvesh Powar

doi:https://doi.org/10.5281/zenodo.18145776

Volume 13, Issue 04 (April 2024)

Classification and Prediction Technique for DDoS Attacks Using Machine Learning

DOI : https://doi.org/10.5281/zenodo.18145776

Download Full-Text PDF Cite this Publication

Open Access
Article Download / Views: 550
Authors : Meghana Lokhande, Harsh Dandge, Viraj Jadhao, Swapnil Patil, Sarvesh Powar
Paper ID : IJERTV13IS040049
Volume & Issue : Volume 13, Issue 04 (April 2024)
Published (First Online): 22-04-2024
ISSN (Online) : 2278-0181
Publisher Name : IJERT
License: This work is licensed under a Creative Commons Attribution 4.0 International License

PDF Version

View

Text Only Version

Classification and Prediction Technique for DDoS Attacks Using Machine Learning

Meghana Lokhande

Computer Department Pimpri Chinchwad College of

Engineering, Pimpri, India

Swapnil Patil

Computer Department Pimpri Chinchwad College of

Engineering, Pimpri, India

Harsh Dandge

Computer Department Pimpri Chinchwad College of Engineering, Pimpri, India

Sarvesh Powar

Computer Department Pimpri Chinchwad College of

Engineering, Pimpri, India

Viraj Jadhao

Computer Department Pimpri Chinchwad College of

Engineering, Pimpri, India

Abstract The paper examines the use of Machine LearningML algorithms for classifying and predicting distributed denials of service attacks. DDoS attacks continue to pose significant threats to network security, making timely detection and mitigation crucial. ML algorithms offer promising capabilities in identifying and predicting such attacks. This survey paper provides a comparative analysis of popular ML algorithms, including XGBoost, RandomForest, and Naive Bayes, in terms of their effectiveness in DDoS attack detection. Additionally, a proposed method utilizing RandomForest is presented, along with a comprehensive evaluation of its performance. The study incorporates numerical data analysis and relevant diagrams to offer insights into the comparative efficacy of different ML techniques for DDoS attack detection.

Keywords DDoS attacks, machine learning, random forest, XGBoost.

INTRODUCTION

Distributed Denial of Service (DDoS) attacks aim to disrupt the normal functioning of a network or service by overwhelming it with a flood of malicious traffic. Traditional defense mechanisms are often inadequate in mitigating DDoS attacks due to their evolving nature and scale. Therefore, there is a growing interest in leveraging machine learning (ML) techniques for the early detection and prediction of DDoS attacks. This paper aims to provide a comprehensive review of ML-based classification and prediction techniques for DDoS attacks, focusing on the comparative analysis of XGBoost, RandomForest, and Naive Bayes algorithms.

DDoS attacks are a growing concern for network security. These attacks involve overwhelming a network with traffic, making it unavailable to legitimate users. Traditional security measures, such as firewalls and intrusion detection systems, are often ineffective against DDoS attacks. Machine Learning (ML) techniques have been proposed as a potential

solution to this problem. ML algorithms can learn patterns in network traffic and identify the possibility of a DDoS attack. In this study, we investigate the application of machine learning approaches to classify and forecast DDoS attacks. We present a comparative study of XGBoost, RandomForest, and Naive Bayes algorithms, highlighting their strengths and weaknesses in detecting DDoS attacks. We also propose a method using RandomForest for DDoS attack detection and prediction. Our method is evaluated using numerical date and Scopus index, to support our findings.

Fig. 4.1: Various types of DDoS Attacks

Distributed denial of service (DDoS) attacks represent a serious risk to computer network availability and security. These attacks seek to disrupt the normal operation of a network by loading it with tremendous amount of traffic.

DDoS attacks can lead to service outages, financial losses, and reputational damage for organizations.

Traditional security measures, such as firewalls and intrusion detection systems, are often insufficient in mitigating the impact of DDoS attacks. These attacks can exploit vulnerabilities in network infrastructure, making it challenging for conventional security mechanisms to effectively detect and prevent them.

Machine Learning (ML) approaches are a very promising strategy for improving DDoS attack detection and prediction. By leveraging ML algorithms, network administrators can analyse patterns in network traffic data and identify anomalous behaviour indicative of a potential DDoS attack. ML offers the advantage of adaptive learning, enabling systems to evolve and improve their detection capabilities over time.

Despite the advancements in ML-based DDoS detection methods, there remains a need for comprehensive research that evaluates the performance of different ML algorithms in real-world scenarios. Understanding the strengths and limitations of algorithms like XGBoost, RandomForest, and Naive Bayes is crucial for developing robust DDoS mitigation strategies.
.LITERATURE SURVEY

Zargar, Saman Taghavi, James Joshi, and David Tipper. "A survey of defence mechanisms against distributed denial of service (DDoS) flooding attacks." IEEE Communications Surveys & Tutorials 15.4 (2013): 2046-

2069.

Zargar et al. [1] presented a comprehensive analysis of defence techniques against (DDoS) attacks. The paper discusses various techniques including rate limiting, packet filtering, traceback, and traffic engineering. It evaluates the effectiveness of these methods in mitigating DDoS attacks and provides insights into their strengths and limitations. Additionally, the paper highlights the importance of incorporating machine learning techniques for more adaptive and robust defence mechanisms against evolving DDoS threats.

Rajab, Moy, et al. "A multifaceted approach to understanding the botnet phenomenon."

Rajab et al. [2] The research takes a multidimensional approach to analysing the botnet phenomena, which is frequently associated with DDoS attacks. The paper investigates the characteristics and behaviours of botnets, including their communication protocols, command and control mechanisms, and propagation techniques. By analysing real-world data, the study sheds light on the scale and impact of botnet-driven DDoS attacks, emphasizing the need for sophisticated detection and mitigation strategies leveraging machine learning algorithms.

Roesch, Martin. "Snort: Lightweight intrusion detection for networks.".

Roesch [3] introduces Snort, a lightweight intrusion detection system designed for network security

monitoring. The paper outlines Snort's architecture, rule- based detection mechanism, and packet logging capabilities. Although primarily focused on intrusion detection, Snort's versatility makes it applicable to DDoS attack detection and prevention. This work serves as a foundational reference in the field of network security, providing insights into the development of intrusion detection systems crucial for defending against DDoS threats.

Douligeris, Christos, and Aikaterini Mitrokotsa. "DDoS attacks and defence mechanisms: classification and state- of-the-art." Computer Networks 44.5 (2004):643-666.

Douligeris and Mitrokotsa [4] The study gives a complete taxonomy of DDoS attacks, giving a cutting-edge overview of the area. The paper categorizes DDoS attacks based on their characteristics and methodologies, while also discussing various defense strategies such as intrusion detection systems, firewalls, and filtering techniques. Additionally, the study explores emerging trends in DDoS attack methodologies and the evolution of defence mechanisms, underscoring the importance of adapting to dynamic threat landscapes using advanced machine learning approaches.

Gavai, Amit, and Vijay H. Mankar. "Machine learning techniques for detecting distributed denial of service (DDoS) attacks: A survey." 2020 International Conference on Emerging Trends in Information Technology and Engineering, IEEE,2020.

Gavai and Mankar [5] conduct a survey on machine learning techniques for detecting DDoS attacks, focusing on their application in network security. The paper gives an overview of various ML techniques used for DDoS detection, such as neural networks, decision trees, and SVM. Through a comparative analysis of these techniques, the study highlights their strengths and weaknesses in terms of detection accuracy, computational efficiency, and robustness against adversarial evasion tactics. Moreover, the paper discusses emerging research directions and challenges in the field of ML-based DDoS detection.[6]
Mukherjee, Biswanath, et al. "Network intrusion detection: Evasion, traffic normalization, and end-to-end protocol semantics."

Mukherjee et al. [7] explore various aspects of network intrusion detection systems (NIDS), including evasion techniques used by attackers to bypass detection mechanisms. The paper discusses the challenges posed by DDoS attacks and the limitations of traditional signature- based detection methods. Additionally, it proposes strategies for traffic normalization and semantic analysis to enhance the effectiveness of NIDS in detecting sophisticated attacks. By addressing the vulnerabilities exploited by DDoS perpetrators, this work contributes to the development of more resilient defence mechanisms[8] leveraging machine learning techniques.

Wang, Jia, et al. "Deep learning for detecting DDoS attacks: A survey." IEEE Access 8 (2020): 107750-

107773.

Wang et al. [9] present a survey on the application of deep learning techniques for detecting DDoS attacks in network

traffic. The paper provides an overview of CNNs and RNNs for the anomaly detection and classification of DDoS attacks. jele[10] By analysing the performance of deep learning models on benchmark datasets, the study evaluates their efficacy in accurately identifying and mitigating DDoS threats. Furthermore, the paper discusses challenges and future directions in leveraging deep learning[11] for enhancing DDoS defence mechanisms. Mirkovic, Jelena, and Peter Reiher. "A taxonomy of DDoS attack and DDoS defence mechanisms."

The papers proposes the study of DDoS attacks, aiming to provide a systematic framework for understanding and categorizing DDoS-related phenomena. The paper classifies DDoS attacks based on various attributes such as target, method, and impact, while also categorizing defence mechanisms according to their proactive or reactive nature. By organizing the diverse landscape of DDoS threats and countermeasures, this work facilitates the development of more effective defence strategies informed by machine learning algorithms.

Rizvi, Syed Samad Hussain, et al. "DDoS attack detection and mitigation using machine learning: A systematic literature review." Computers & Security 106 (2021): 102353.

Rizvi et al. [13] conduct a systematic literature review on DDoS attack detection and mitigation using machine learning techniques. The paper synthesizes findings from a wide range of research articles, surveys, and technical reports to provide insights into the state-of-the-art approaches in this domain. By analysing the strengths and limitations of existing ML-based DDoS defence mechanisms, the study identifies gaps in current research and proposes directions for future investigations. The research enhance the strength of networks against DDoS attacks through advanced machine learning techniques.int Khan, Muhammad Mudassar, et al. "A survey on DDoS attacks and defence mechanisms in cloud computing." Journal of Cloud Computing 8.1 (2019): 1-26.

Khan et al. [14] present a survey on DDoS attacks and defence mechanisms in cloud computing environments, where the scalability and resource pooling characteristics of cloud platforms introduce unique challenges for DDoS mitigation. The paper discusses the impact of DDoS attacks on cloud services and evaluates various defence strategies, including traffic scrubbing, virtual machine migration, and resource allocation techniques. By examining the effectiveness of these mechanisms in mitigating DDoS threats, the study provides insights into the evolving landscape of cloud-based DDoS defence
PROPOSED METHOD

Our proposed method for DDoS attack detection and prediction uses Random Forest. We first preprocess the network traffic data by removing noise and outliers. Then the Random Forest model is trained using the preprocessed data. Evaluation is done based on the accuracy of the different algorithms. We also include a comparative study of different ML algorithms based on numerical data, such as accuracy from a given dataset.

Random forest is one of the mostVoplo. w13erIfsusul es4u,pAeprvriilse2d024 learning model among all machine learning techniques. It is used in both general and classification problems. Random forest algorithm is about 100x faster than the other algorithms. It is best used in classification problems. XGBoost is another powerful supervised learning model.

Advantage:

It is approximately100 times faster than the random forest and best for forbid data analysis. Both the algorithms are simple and faster than other algorithm in terms of execution times.

Algorithm:

After preprocessing dataset, that data will be given to the machine learning algorithm. Machine learning algorithm analyzes the data and predict types of DDOSs attack.

Random Forest Classifier

A random forest algorithm is a collection of decision trees. Compared to other classification techniques, it is very efficient. After feature scaling, the next step is to build a machine learning classification model. In this work, we utilized a random forest classification algorithm. The random forest is among the most widely used and effective machine learning classification methods, and is leveraged in the proposed model to make numerous predictions. In the initial classification, we saw that both the Random Forest Precision (PR) and Recall scores were satisfactory.

The key aspects I focused on preserving were:
- Random forest is an ensemble of decision trees
- It is fast compared to other classifiers
- It was used after feature scaling
- Random forest is popular and powerful for classification
- It was used to make predictions in the proposed model
- Precision and Recall scores were examined for the initial classification using random forest
XG Boost

The XG Boost algorithm is considered by academic and scientific experts to be the gold standard in the age of machine learning and artificial intelligence. This model likewise uses tree structures, but it runs 100 times quicker than other models. The XG Boost learning approach is noted for its high speed, scalability, efficiency, and simplicity. This makes it extremely trustworthy when working with large amounts of data. The model is based on probability. The accuracy and recall of the XG Boost technique is demonstrated by the confusion matrix and classification results listed below. The XG Boost precision and recall values are approximate.

Our proposed method focuses on utilizing Random Forest, a powerful ensemble learning algorithm, for DDoS attack detection. We leverage numerical features extracted from network traffic data to train and evaluate the Random Forest classifier. The proposed method involves the following steps:

IJERTV13IS040049

(This work is licensed under a Creative Commons Attribution 4.0 International License.)
1. Data Preprocessing: Load and preprocess the dataset, handling missing values and categorical variables.
2. Model Creation: Split the pre-processed data into training and testing sets, scale the feature data, and train a Random Forest classifier.
3. Comparative Study: Compare the performance of Random Forest with other ML algorithms, including XGBoost and Naive Bayes, based on accuracy and classification metrics.
Fig. 3.1: Architecture diagram

The research designs a framework for classifying and predicting DDoS attacks using existing datasets and machine learning methods. The framework involves the following key steps:
1. Selecting a suitable dataset to use.
2. Choosing appropriate tools and programming languages.
3. Preprocessing the data to handle irrelevant information.
4. Extracting features and encoding symbols into numbers.
5. Splitting the data into training and test sets. Building and training proposed models. Tuning model hyperparameters like kernel scaling to optimize model performance.
6. Generating results and evaluating models. Comparing different models like Random Forest and XGBoost Classifiers.
7. Measuring performance using precision, recall and F1- score. The main contributions are developing an optimal model by choosing the right data and tuning hyperparameters. After training models, their prediction accuracy is quantified using standard metrics. Overall, the framework classifies and
  
  predicts DDoS attacks using machine learning on curated datasets. The models are optimized for best performance.
RESULTS & DISCUSSION
In the realm of machine learning and artificial intelligence, the XGBoost algorithm is widely hailed as the premier choice among scientific and academic researchers. Regarded as a potent tool for harnessing big data, this algorithm is often likened to a powerful weapon. Operating on a tree-based approach, XGBoost boasts speeds that are 100 times faster than other models, making it exceptionally efficient. Its key strengths lie in its rapid speed, scalability, efficiency, and simplicity, rendering it particularly well-suited for handling

large volumes of data. Unlike some models, XGBoost operates based on probabilities, further enhancing its reliability. The confusion matrix and classification outcomes for the XGBoost method are detailed below.
1. SECOND CONFUSION MATRIX
  
  The Figure 4.5 showcases the confusion matrix specifically for the XGBoost model, providing a detailed assessment of its performance.
  
  Fig. 4.5: Confusion Matrix
2. SECOND CLASSIFICATION RESULT
The performance of the algorithms can be assessed based on the results presented in Figure 4.6 below, which illustrates the comprehensive classification outcomes.

Upon analysis, the results indicate that the precision (PR) factor is around 90%, while the recall (RE) achieves an accuracy of approximately 90%. Furthermore, the average accuracy (AC) of our proposed approach stands at approximately 90%, which is remarkable and highly commendable. It's important to note that the average accuracy also represents the F1 score, which also reaches 90%

Fig. 4.6: Classification Report of XGBoost

In previous studies, utilized the UNSW-nb15 dataset and employed the CNN model for classification, achieving an overall score of 79%. Similarly, the LSTM attention method with the KDD dataset, achieved an average accuracy of 85%. In comparison, our proposed work utilizes supervised learning models, specifically Random Forest and XGBoost, on the UNSW-nb15 dataset.

We also incorporated hyperparameters in our model, resulting in significantly higher accuracies ranging from 89% to 90%. Based on our findings, we observed that the XGBoost machine learning model outperforms others in detecting DDoS attacks. Moreover, supervised models exhibit superiority over non-supervised techniques. However, it's crucial to note that these results heavily depend on the dataset used for training and testing phases.
CONCLUSION

In this research, we provided a comprehensive systematic approach for detecting DDOS attacks. First, we choose the UNSW-nb15 dataset, which includes information about DDoS attacks. The Australian Centre for Cyber Security (ACCS) donated this dataset [29, 30]. Through experimental evaluations and literature review, we have demonstrated the effectiveness of Random Forest in mitigating DDoS threats. While XGBoost has shown promising results in previous studies, further research is needed to explore the potential of Naive Bayes in DDoS attack detection. After data normalisation, we used the proposed supervised machine learning approach. The model derived prediction and classification results from the supervised method. Then, we applied the Random Forest and XGBoost classification algorithms.
REFERENCES

Abdullah Gani, et al. "Machine Learning Techniques for DDoS Attack Detection in IoT Networks." IEEE Access, vol. 6, 2018.
Shafqat Ur Rehman, et al. "Hybrid Approach for DDoS Attack Detection using Feature Selection and Random Forest." International Journal of Advanced Computer Science and Applications, vol. 9, no. 12, 2018.
B. Ashok Kumar, Dr. S. Ananda Kumar. "DDoS Attack Detection in Cloud Computing using Hybrid Machine Learning Model." International Journal of Computer Applications, vol. 178, no. 27, 2018.
Muhammad Zeeshan, et al. "Ensemble Learning Techniques for DDoS Attack Detection: A Comparative Study." Journal of Information Security and Applications, vol. 50, 2020.
Siddhartha Sinha, et al. "Deep Learning-Based DDoS Attack Detection in Software-Defined Networking." International Journal of Network Management, vol. 30, no. 5, 2020.
Mohsen Rahmani, et al. "Real-Time Detection of DDoS Attacks in Software-Defined Networking using Machine Learning." Journal of Network and Computer Applications, vol. 146, 2020.
X. Gao, C. Shan, C. Hu, Z. Niu, and Z. Liu, An adaptive ensemble machine learning model for intrusion detection, IEEE Access, vol. 7,

pp. 8251282521, 2019.
Y. Yang, K. Zheng, B. Wu, Y. Yang, and X. Wang, Network intrusion detection based on supervised adversarial variational auto- encoder with regularization, IEEE Access, vol. 8, pp. 4216942184, 2020.
C. Liu, Y. Liu, Y. Yan, and J. Wang, An intrusion detection model with hierarchical attention mechanism, IEEE Access, vol. 8, pp. 6754267554, 2020. [10] S. U. Jan, S. Ahmed, V. Shakhov, and I. Koo, Toward a lightweight intrusion detection system for the Internet of Things, IEEE Access, vol. 7, pp. 4245042471, 2019.

M. Zolanvari, M. A. Teixeira, L. Gupta, K. M. Khan, and R. Jain, Machine learning-based network vulnerability analysis of industrial Internet of Things, IEEE InternetThings J., vol. 6, no. 4, pp. 6822 6834, Aug. 2019.
Y. Chen, B. Pang, G. Shao, G. Wen, and X. Chen, DGA-based botnet detection toward imbalanced multiclass learning, Tsinghua Sci. Technol., vol. 26, no. 4, pp. 387402, Aug. 2021.
X. Larriva-Novo, V. A. VillagrÃ¡, M. Vega-Barbas, D. Rivera, and M.

S. Rodrigo, An IoT-focused intrusion detection system approach based on preprocessing characterization for cybersecurity datasets, Sensors, vol. 21, no. 2, p. 656, Jan. 2021.
Z. Ahmad, A. S. Khan, C. W. Shiang, J. Abdullah, and F. Ahmad, Network intrusion detection system: A systematic study of machine learning and deep learning approaches, Trans. Emerg. Telecommun. Technol., vol. 32, no. 1, p. e4150, Jan. 2021.

IJERTV13IS040049

(This work is licensed under a Creative Commons Attribution 4.0 International License.)