April 23, 2024, 4:42 a.m. | Darya Likhareva, Hamsini Sankaran, Sivakumar Thiyagarajan

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.13078v1 Announce Type: cross
Abstract: Researchers must stay current in their fields by regularly reviewing academic literature, a task complicated by the daily publication of thousands of papers. Traditional multi-label text classification methods often ignore semantic relationships and fail to address the inherent class imbalances. This paper introduces a novel approach using the SciBERT model and CNNs to systematically categorize academic abstracts from the Elsevier OA CC-BY corpus. We use a multi-segment input strategy that processes abstracts, body text, titles, …

abstract academic arxiv bert class classification cnn cs.cl cs.lg current daily fields literature modeling paper papers publication relationships research researchers semantic text text classification through topic modeling type

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York