Nov. 26, 2023, 6:19 a.m. | /u/APaperADay

Computer Vision www.reddit.com

**Paper**: [https://arxiv.org/abs/2311.08525](https://arxiv.org/abs/2311.08525)

**Code**: [https://github.com/tuggeluk/ffcv-imagenet/tree/rotation\_module](https://github.com/tuggeluk/ffcv-imagenet/tree/rotation_module)

**Abstract**:

>Humans and animals recognize objects irrespective of the beholder's point of view, which may drastically change their appearances. Artificial pattern recognizers also strive to achieve this, e.g., through translational invariance in convolutional neural networks (CNNs). However, both CNNs and vision transformers (ViTs) perform very poorly on rotated inputs. Here we present artificial mental rotation (AMR), a novel deep learning paradigm for dealing with in-plane rotations inspired by the neuro-psychological concept of mental rotation. Our simple …

abstract amr animals artificial change cnns computervision convolutional neural networks deep learning humans networks neural networks novel objects rotation through transformers vision vision transformers

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Director, Clinical Data Science

@ Aura | Remote USA

Research Scientist, AI (PhD)

@ Meta | Menlo Park, CA | New York City