April 18, 2024, 2:07 p.m. | /u/Melodic_Reality_646

Machine Learning www.reddit.com

Each text has no more than 15 words, and the classes are highly imbalanced. But they all have at least 30 or so instances.

I was successful with data of the same nature but with around 15 labels with an ensemble of gradient boosting models.

Before diving into testing a bunch models I wondered if there’s some strategy to tackle high-dimensional problems like this one.

Some problems are just not solvable, let’s face it. But what would you guys try?

boosting classification data ensemble gradient instances labels least machinelearning nature testing text transformers words

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US