all AI news
[P] Need NLP resources to make sense of webdata
So basically I'm involved in working with some scraped webdata and need to clean it or make sense of it. However, it's extremely unstructured and I wanted to figure out how I could leverage NLP to make sense of it so I could clean it.
Essentially it's something like scraping a whole outfit and details of size, color, etc but the details are not in a structured format always. sometimes it's like, "<brand name> <color> <size> <shirt/skirt/trousers> <color of shirt> …!-->