April 2, 2024, 7:42 p.m. | Patrick Darwinkel

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.00056v1 Announce Type: cross
Abstract: We explored leveraging state-of-the-art deep learning, big data, and natural language processing to enhance the detection of vulnerable web server versions. Focusing on improving accuracy and specificity over rule-based systems, we conducted experiments by sending various ambiguous and non-standard HTTP requests to 4.77 million domains and capturing HTTP response status lines. We represented these status lines through training a BPE tokenizer and RoBERTa encoder for unsupervised masked language modeling. We then dimensionality reduced and concatenated …

abstract accuracy and natural language processing art arxiv big big data cs.cr cs.lg cs.ni data deep learning detection domains http improving language language processing natural natural language natural language processing processing server servers specificity standard state systems through transformer type versions vulnerable web

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne