AICurious Logo

What is: Bottleneck Transformer?

SourceBottleneck Transformers for Visual Recognition
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

The **Bottleneck Transformer (BoTNet) ** is an image classification model that incorporates self-attention for multiple computer vision tasks including image classification, object detection and instance segmentation. By just replacing the spatial convolutions with global self-attention in the final three bottleneck blocks of a ResNet and no other changes, the approach improves upon baselines significantly on instance segmentation and object detection while also reducing the parameters, with minimal overhead in latency.