[D] Beyond embeddings for natural language search | allainews.com

Aug. 28, 2023, 7 p.m. | /u/marcus_hk

Machine Learning www.reddit.com

Paraphrasing some threads of the past few months, the bottleneck in QA-Retrieval seems to be retrieval, and vector search with embeddings does not, by itself, seem to be good enough for "production".

I recently came across [OpenEvidence](https://www.openevidence.com/) (not to be confused with [Open Evidence](https://open-evidence.com/)), which seems to do retrieval and references [pretty well](https://www.openevidence.com/ask/8bcf49ba-4103-4eb5-8c70-3d045897cef8). Digging into some of their [published papers](https://arxiv.org/abs/2302.08091) and [their LinkedIn page](https://www.linkedin.com/company/openevidence/about/), it looks like they built an ontology out of PubMed,

>By analyzing medical text and **extracting …

beyond biomedical embeddings good history language machinelearning medical natural natural language paraphrasing production relations retrieval science search text threads vector vector search

More from www.reddit.com / Machine Learning

[D] How did OpenAI go from doing exciting research to a big-tech-like company? 2 hours ago | www.reddit.com

capabilities engineering fast forward gpt4 +6

[D] Culture of Recycling Old Conference Submissions in ML 4 hours ago | www.reddit.com

conference conferences culture iclr +10

[D] How Do You Efficiently Conduct Ablation Studies in Machine Learning? 4 hours ago | www.reddit.com

fine-tuning grid insights machine +7

[P] N-way-attention 8 hours ago | www.reddit.com

algorithm attention concept every +12

[D] Is it possible to train ViTMAE with Hyperspectral Satellite Images? 19 hours ago | www.reddit.com

encoder format images learn +4

[D] Mamba Convergence speed 22 hours ago | www.reddit.com

class convergence dataset example +10

[P] Local RAG with RETSim, Ollama and Gemma 1 day ago | www.reddit.com

gemma machinelearning notebooks ollama +3

[Project] Tabletop HandyBot: low-cost robotic arm assistant for tabletop tasks 1 day, 2 hours ago | www.reddit.com

arm assistant cost functional +9

[R] Grounding DINO 1.5 Release: the most capable open-set detection model 1 day, 2 hours ago | www.reddit.com

building dataset detection foundation +12

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net