Jan. 31, 2024, 11:13 p.m. | /u/Seankala

Machine Learning www.reddit.com

My team and I are currently trying to implement a search functionality for one of our products. As of now, we're trying to create a language model-based method and are comparing it against an Elasticsearch baseline (i.e., BM25).

The model that we've trained is a publicly available ELECTRA-based checkpoint. The model's been pre-trained on English and Korean data. We trained the model using sentence-level contrastive learning techniques introduced in various papers (e.g., the SimCSE model from EMNLP 2020). As of …

embeddings language language model machinelearning normal products search team vector vector search

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US