Technique for preparation of large data for machine learning algorithms to generate intrusion detection system
Full Text |
Pdf
|
Author |
S. P. Senthilkumar and Aranga Arivarasan
|
e-ISSN |
1819-6608 |
On Pages
|
1539-1546
|
Volume No. |
18
|
Issue No. |
13
|
Issue Date |
September 13, 2023
|
DOI |
https://doi.org/10.59018/0723193
|
Keywords |
machine learning, security breaches, data preparation, shell scripting, data transformation.
|
Abstract
Machine Learning has become the norm for creating models that predict and detect security breaches in computer systems. Modelling of machine learning systems requires a huge amount of data for training and testing the machine learning algorithm. The present research is concerned with preparing the data prior to feeding it to machine learning algorithms. The necessity for such preparation occurs due to the physical limitations of spreadsheet applications (with regard to the number of records they can handle). This research describes a simple shell scripting approach to combine, extract, and weed out unwanted records and finally prepare the data for feeding to machine learning algorithms.
Back