ICML2025
Steerable Transformers for Volumetric Data
Soumyabrata Kundu, Risi Kondor
摘要
We introduce Steerable Transformers, an extension of the Vision Transformer that is equivariant to the action of the Special Euclidean group SE(d). We propose an steerable self-attention mechanism that operates on features extracted by steerable convolutions. Our experiments in both two and three dimensions show augmenting steerable convolutional networks with steerable transformer leads to improved performance.