May 22, 2023, 10:43 a.m. | /u/PeterUnemployed

Data Science www.reddit.com

I am trying to answer a hypothesis wherein companies with more heterogenous boards are more compliant and less likely to attempt fraud than companies with homogenous boards.

So in effect I have a dataset containing hundreds of thousands of companies and details about the people behind each and everyone of them (such as age, sex, income level etc)

Now I need to find a way to quantify similarity. The one approach I have tried is to calculate cosine distance from …

age boards companies datascience dataset fraud hypothesis income people sex

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US