all AI news
Question: how to balance a dataset for a multi-output network
I am currently solving a problem that has multiple outputs. For example, let’s say it classifies spam/ham and urgent/not urgent. However, these two classes are not balanced within the dataset.
How would one go about balancing such a dataset so that the number of spam and ham instances are similar and the number of urgent and not urgent instances are similar?submitted by /u/MasalaByte