May 31, 2023, 9:33 a.m. | /u/No_Refrigerator_1907

Natural Language Processing www.reddit.com

Hey,

I'm trying to build an NER model using a Bert-like model

My question is about the entities I'm trying to identify, All of them are composed of many words, they're usually seperatd by either a white space, comma, a dash or even a dot sometimes.

Here's an example :

* **QLQ-BR23** which is composed of 2 abreviations and seperated by a dash
* **Functional Assessment of Cancer Therapy - Lung**

I'm working on annotating my texts using Doccano but …

bert dash example hey identify languagetechnology ner space words

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

DevOps Engineer (Data Team)

@ Reward Gateway | Sofia/Plovdiv