FCOS3D-MVDet3D
| Authors | Tai Wang*, Qing Lian*, Chenming Zhu, Xinge Zhu, Wenwei Zhang |
| Description | We build a multi-view framework with temporal stereo modeling to convert multi-view features to a 3D grid space and perform 3D detection thereon. The backbone is based on FCOS3D++, pretrained on Waymo with only object annotations. We do not involve lidar depth labels both during training and inference. Code will be released at MMDetection3D. |
| Project Link | Link |
TYPE_VEHICLE
- Sensors: C
TYPE_PEDESTRIAN
- Sensors: C
TYPE_CYCLIST
- Sensors: C
ALL_NS
- Sensors: C