all AI news
VulBERTa: Simplified Source Code Pre-Training for Vulnerability Detection. (arXiv:2205.12424v1 [cs.CR])
cs.LG updates on arXiv.org arxiv.org
This paper presents VulBERTa, a deep learning approach to detect security
vulnerabilities in source code. Our approach pre-trains a RoBERTa model with a
custom tokenisation pipeline on real-world code from open-source C/C++
projects. The model learns a deep knowledge representation of the code syntax
and semantics, which we leverage to train vulnerability detection classifiers.
We evaluate our approach on binary and multi-class vulnerability detection
tasks across several datasets (Vuldeepecker, Draper, REVEAL and muVuldeepecker)
and benchmarks (CodeXGLUE and D2A). The evaluation …
arxiv code detection pre-training simplified training vulnerability