Aug. 18, 2023, 2:13 p.m. | Stefan Suwelack

Towards Data Science - Medium towardsdatascience.com

A short introduction to data-slicing methods including hands-on examples on the CIFAR-100 dataset.

Data slices on CIFAR100. Source: created by the author.

tl;dr:

Data slices are semantically meaningful subsets of the data, where the model performs anomalously. When dealing with an unstructured data problem (e.g. images, text), finding these slices is an important part of every data scientist’s job. In practice this task involves a lot of individual experience and manual work. In this post, we present some methods and …

data-centric-ai data visualization machine learning

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Lead Data Modeler

@ Sherwin-Williams | Cleveland, OH, United States