Transformers have been at the heart of the Natural Language Processing (NLP)
and Computer Vision (CV) revolutions. The significant success in NLP and CV
inspired exploring the use of Transformers in point cloud processing. However,
how do Transformers cope with the irregularity and unordered nature of point
clouds? How suitable are Transformers for different 3D representations (e.g.,
point- or voxel-based)? How competent are Transformers for various 3D
processing tasks? As of now, there is still no systematic survey of the …

