- fully sparse
- SparseLIF, 24
- (mAP and NDS on nuScenes: 74, 77)
- (mAP on AV2 val set: 40)
- 它的实验中,比**EA-LSS 23好。**
- SparseFusion, 24
- (mAP and NDS on nuScenes: 70.1, 72.7)
- (mAP on AV2 val set: 39.8)
- SparseFusion: Fusing Multi-Modal Sparse Representations for Multi-Sensor 3D Object Detection, iccv23
- Futr3d: A unified sensor fusion framework for 3d detection, cvpr23
- Deepinteraction: 3d object detection via modality interaction, nips22
- sparse + global attention
- Cross Modal Transformer, iccv23 (mAP and NDS on nuScenes, test: 72, 74)
- UniTR: A unified and efficient multi-modal transformer for bird’s-eye-view representation, iccv23
- exhaustive global attention buries the advantages of the sparse paradigm and makes it difficult to benefit from long-term temporal information
- instance based
- Fully Sparse Fusion for 3D Object Detection, TPAMI24
- (mAP and NDS on nuScenes: 70, 74)
- (mAP on AV2 val set: 33)
- ObjectFusion, iccv23 (mAP and NDS on nuScenes: 71, 74)
- PoIFusion, 24 ((mAP and NDS on nuScenes: 74, 75)
- Other
- **EA-LSS 23:** (mAP and NDS on nuScenes: 77, 78)
- EA-LSS [20] enhances depth estimation at the edge of objects
Fully Sparse Fusion for 3D Object Detection, TPAMI24
SparseFusion, 24
SparseLIF, 24
Cross Modal Transformer, iccv23
ObjectFusion, iccv23
PoIFusion, 24