Jan. 4, 2024, 3:12 a.m. | /u/Cyraxess

Data Science www.reddit.com

Folks, I've been pondering a question and I've found it to be more complicated than I initially thought.

Assume we have two very large tables (large enough that discussing efficiency is worthwhile). Let's say one table is named 'course' and the other 'registration', recording the instances of students registering for courses. The goal is simply to filter out the courses that have been registered for by at least one student this year.

It is simple, we can do:

SELECT course.* …

course courses datascience efficiency filter found instances question recording registration sql students table tables thought

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Analyst (Digital Business Analyst)

@ Activate Interactive Pte Ltd | Singapore, Central Singapore, Singapore