operate on important voxels instead of non-empty voxels [sparser than SSC]

(a) Comparison of the foreground and background ratios. (b) By replacing SPRS-Conv with its counterpart in sparse CNN, unnecessary positions are effectively suppressed from being activated.
Dilation on important voxels instead of no dilation during downsampling [denser than SSC]

(a) SSC: Submanifold sparse convolution, 2017: but the receptive field is limited.
(b) Regular sparse convolution [14]: compute features for adjacent empty voxels ⇒ dilated feature ⇒ effectively expand the receptive field ⇒ but computation burden.
Consequently, after convolutional (stride > 1) down-sampling, the number of non-empty voxels might be increased rather than decreased as shown in Fig. 1 (b): the number of non-empty voxels even doubled (see stage 2) compared to the input (see stage1).
==================
2D image domain [29, 11]: add an auxiliary learnable module to predict a soft mask [21, 28] that locates areas to be skipped for computational efficiency.
The module often requires additional post-finetuning, auxiliary costs for integration, and incurs non-negligible computational overheads.

Illustration of magnitude-guided spatial sampling and spatial pruned submanifold sparse convolution (SPSS-Conv)

Illustration of spatial pruned regular sparse convolution (SPRS-Conv). shows the case of stride=2. 没看懂。
这块不讲,还是看图好。

regular sparse convolution [14]
