AICurious Logo

What is: Sparse R-CNN?

SourceSparse R-CNN: End-to-End Object Detection with Learnable Proposals
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

Sparse R-CNN is a purely sparse method for object detection in images, without object positional candidates enumerating on all(dense) image grids nor object queries interacting with global(dense) image feature.

As shown in the Figure, object candidates are given with a fixed small set of learnable bounding boxes represented by 4-d coordinate. For the example of the COCO dataset, 100 boxes and 400 parameters are needed in total, rather than the predicted ones from hundreds of thousands of candidates in a Region Proposal Network (RPN). These sparse candidates are used as proposal boxes to extract the feature of Region of Interest (RoI) by RoIPool or RoIAlign.