Multi-sensor fusion
Why BEV?

What is BEV and how?
‣

好像没讲depth的supervision,不清楚有没有直接关于depth 的loss,但是相关讨论可以参考:‣
以前也有人做,本文we diagnose and lift key efficiency bottlenecks in the view transformation with optimized BEV pooling, reducing latency by more than 40×.

我们用,不关心起实现。
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation, ICRA 23