Nov. 3, 2023, 9:21 a.m. | /u/TeenColonistWrangler

Machine Learning www.reddit.com

I'm trying to cluster PDF files that I've converted into images, and I've gotten a good suggestion to train an autoencoder with convolutional layers and cluster in the latent space. I'm hoping to implement this with Keras.

The problem I'm running into is that these PDF files are scans, so some of the files are slightly rotated, and some of them are rotated by a full 90 degrees. As far as I know autoencoders are generally not rotation invariant, and …

autoencoder cluster clustering files good image images keras machinelearning making pdf rotation running scans space train

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Software Engineer, Generative AI (C++)

@ SoundHound Inc. | Toronto, Canada