May 2, 2022, 6:25 p.m. | /u/Public-Confusion4934

Data Science www.reddit.com

Hey guys,
I’m working on a DS project where I have a bunch of researcher cvs/resumes, and I need to get a list of all their publications (title and year) listed on their resume.

Right now Im using pdfminer to read the pdf, and then refextact to extract the publications, but not all the publication sections on the resumes are formatted the same and it has very low success rates with such cases (like 2/37, 200/937, 0/9 it varied a …

datascience python resume tools

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Enterprise Data Architect

@ Pathward | Remote

Diagnostic Imaging Information Systems (DIIS) Technologist

@ Nova Scotia Health Authority | Halifax, NS, CA, B3K 6R8

Intern Data Scientist - Residual Value Risk Management (f/m/d)

@ BMW Group | Munich, DE

Analytics Engineering Manager

@ PlayStation Global | United Kingdom, London

Junior Insight Analyst (PR&Comms)

@ Signal AI | Lisbon, Lisbon, Portugal