Abstract
Anomaly Detection (AD) is an important area to reliably detect malicious behavior and attacks on computer systems. Log data is a rich source of information about systems and thus provides a suitable input for AD. With the sheer amount of log data available today, Machine Learning (ML) and its further development Deep Learning (DL) have been applied for years to create models for AD. Especially when processing complex log data, DL is often able to achieve better performance than ML. To detect anomalous patterns that span over multiple log lines, it is necessary to group these log lines into log-event sequences. This work uses a Long Short-Term Memory (LSTM) model for AD which is one of the most important approaches to represent long-range temporal dependencies in log-event sequences of arbitrary length. This means that we use past information to predict whether future events are normal or anomalous. For the LSTM model we adapt a state of the art open source implementation called LogDeep. For the evaluation, we use a Hadoop Distributed File System (HDFS) data set, which is well studied in current research, and an open source Audit data set provided by the Austrian Institute of Technology (AIT). In this paper we show that without padding, a common preprocessing step used that strongly influences the AD process and artificially improves detection results and thus accuracy in lab testing, it is not possible to achieve the same high quality of results shown in literature. Furthermore, we analyze limitations of DL approaches applied for AD and list future research priorities and design challenges.
Original language | English |
---|---|
Title of host publication | EICC 2023: European Interdisciplinary Cybersecurity Conference |
Editors | Aleksandra Mileva, Steffen Wendzel, Virginia Franqueira |
Pages | 71-77 |
Number of pages | 7 |
Publication status | Published - 14 Jun 2023 |
Event | EICC 2023: European Interdisciplinary Cybersecurity Conference - Stavanger, Norway Duration: 14 Jun 2023 → 15 Jun 2023 |
Conference
Conference | EICC 2023: European Interdisciplinary Cybersecurity Conference |
---|---|
Country/Territory | Norway |
City | Stavanger |
Period | 14/06/23 → 15/06/23 |
Research Field
- Cyber Security
Keywords
- Security and privacy
- Intrusion/anomaly detection and malware mitigation