ModSec-Learn: Boosting ModSecurity with Machine Learning

Scano, Christian; Floris, Giuseppe; Montaruli, Biagio; Demetrio, Luca; Valenza, Andrea; Compagna, Luca; Ariu, Davide; Piras, Luca; Balzarotti, Davide; Biggio, Battista

doi:10.1007/978-3-031-76459-2_3

Computer Science > Machine Learning

arXiv:2406.13547 (cs)

[Submitted on 19 Jun 2024]

Title:ModSec-Learn: Boosting ModSecurity with Machine Learning

Authors:Christian Scano, Giuseppe Floris, Biagio Montaruli, Luca Demetrio, Andrea Valenza, Luca Compagna, Davide Ariu, Luca Piras, Davide Balzarotti, Battista Biggio

View PDF HTML (experimental)

Abstract:ModSecurity is widely recognized as the standard open-source Web Application Firewall (WAF), maintained by the OWASP Foundation. It detects malicious requests by matching them against the Core Rule Set (CRS), identifying well-known attack patterns. Each rule is manually assigned a weight based on the severity of the corresponding attack, and a request is blocked if the sum of the weights of matched rules exceeds a given threshold. However, we argue that this strategy is largely ineffective against web attacks, as detection is only based on heuristics and not customized on the application to protect. In this work, we overcome this issue by proposing a machine-learning model that uses the CRS rules as input features. Through training, ModSec-Learn is able to tune the contribution of each CRS rule to predictions, thus adapting the severity level to the web applications to protect. Our experiments show that ModSec-Learn achieves a significantly better trade-off between detection and false positive rates. Finally, we analyze how sparse regularization can reduce the number of rules that are relevant at inference time, by discarding more than 30% of the CRS rules. We release our open-source code and the dataset at this https URL and this https URL, respectively.

Comments:	arXiv admin note: text overlap with arXiv:2308.04964
Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2406.13547 [cs.LG]
	(or arXiv:2406.13547v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.13547
Related DOI:	https://doi.org/10.1007/978-3-031-76459-2_3

Submission history

From: Giuseppe Floris Floris [view email]
[v1] Wed, 19 Jun 2024 13:32:47 UTC (609 KB)

Computer Science > Machine Learning

Title:ModSec-Learn: Boosting ModSecurity with Machine Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:ModSec-Learn: Boosting ModSecurity with Machine Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators