March 2, 2024, 4:37 p.m. | /u/gamerx88

Machine Learning www.reddit.com

Curious what everybody is using to implement LLM powered apps for production usage and your experience with these toolings and advice.

This is what I am using for some RAG prototypes I have been building for users in finance and capital markets.

**Pre-processing\ETL:**
Unstructured.io + Spark, Airflow

**Embedding model:**
Cohere Embed v3
Previously using OpenAI Ada but Cohere has significantly better retrieval recall and precision for my use case. Also exploring other open weights embedding models

**Vector Database:**
Elasticsearch previously …

advice airflow apps building capital cohere embed embedding etl experience finance llm machinelearning markets pre-processing processing production rag spark stack tech tech stack unstructured usage

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Scientist

@ Publicis Groupe | New York City, United States

Bigdata Cloud Developer - Spark - Assistant Manager

@ State Street | Hyderabad, India