
features generated from Lift-Splat-Shot (LSS) [42] and our Sparse View Transformer. The pink area denotes non-empty voxels predicted from images.

SparseFusion: Efficient Sparse Multi-Modal Fusion Framework for Long-Range 3D Perception, 24
[18] BEVPoolv2: A cutting-edge implementation of bevdet toward deployment, 22