Oct. 26, 2023, 11:02 p.m. | Robin Linacre

Towards Data Science - Medium towardsdatascience.com

How effectively do different approaches to record linkage use information in the records to make predictions?

Wringing information out of data. Image created by the author using DALL·E 3

A pervasive data quality problem is to have multiple different records that refer to the same entity but no unique identifier that ties these entities together.

In the absence of a unique identifier such as a Social Security number, we can use a combination of individually non-unique variables such as name, …

author dall data data quality data science entity-resolution image information multiple predictions quality record-linkage records

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne