[P] Explore baseball history with vector search | allainews.com

June 9, 2023, 11:47 p.m. | /u/davidmezzetti

Machine Learning www.reddit.com

This project explores baseball history using similarity search, the [Baseball Databank](https://github.com/chadwickbureau/baseballdatabank) dataset available on GitHub, [Streamlit](https://github.com/streamlit/streamlit) and [txtai](https://github.com/neuml/txtai).

Raw data is automatically downloaded from the Baseball Databank project and indexed. Two separate indexes are created, one for batting stats and one for pitching stats. The indexing pipeline is the same for both and shown below.

https://preview.redd.it/9bn3gb5yw25b1.png?width=720&format=png&auto=webp&v=enabled&s=76f4cc6a6778b59c4c2ab2aec95421fd18bab1f1

The application shows the name of the player, the year, a trend of their OPS+ over time and the 10 most similar seasons. This …

application baseball data indexing machinelearning ops pipeline project raw shows stats trend

More from www.reddit.com / Machine Learning

How Large Language Models play video games [D] 6 hours ago | www.reddit.com

agents case engineering explore +15

[Project] An LLM-Powered Web App for SEC Filing Insights 6 hours ago | www.reddit.com

apis app financial future +18

[Research] Understanding The Attention Mechanism In Transformers: A 5-minute visual guide. 🧠 10 hours ago | www.reddit.com

architectures attention dictionary guide +12

[D] Is there a more systematic way of choosing the layers or how deep the … 15 hours ago | www.reddit.com

architecture deep learning least machinelearning +6

[D] Where does the real value of a data scientist come from? 19 hours ago | www.reddit.com

code companies data data scientist +11

[D] NVIDIA GPU Benchmarks & Comparison 21 hours ago | www.reddit.com

a100 ada cards cloud +15

[N] 1st Workshop on In-Context Learning at ICML 2024 22 hours ago | www.reddit.com

context context learning icml in-context learning +2

[R] A Careful Examination of Large Language Model Performance on Grade School Arithmetic 23 hours ago | www.reddit.com

abstract benchmark benchmarks claim +21

[D] [R] Are there any methods/works that enable extracting high-quality dense feature map from CLIP/OpenCLIP … 1 day, 1 hour ago | www.reddit.com

clip compute feature finetuning +8

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Consultant Senior Power BI & Azure - CDI - H/F

@ Talan | Lyon, France

View on ai-jobs.net