April 30, 2022, 9:13 p.m. | /u/Jarros

Data Science www.reddit.com

I'm just a beginner in statistics and DS, who wants to build some sort of open source data model like word2vec but for Wikipedia entries, so anyone could use that model to calculate semantic distance between let's say an entry 'Google' and an entry 'Apple (company)', to group entries into clusters ('Tech firms') and so on.

So I figured out my next plan: since each Wikipedia article has got hyperlinks to other articles, first I'd make some sort of a …

articles datascience wikipedia

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analytics & Insight Specialist, Customer Success

@ Fortinet | Ottawa, ON, Canada

Account Director, ChatGPT Enterprise - Majors

@ OpenAI | Remote - Paris