LXL: LiDAR Excluded Lean 3D Object Detection with 4D Imaging Radar and Camera Fusion

Xiong, Weiyi; Liu, Jianan; Huang, Tao; Han, Qing-Long; Xia, Yuxuan; Zhu, Bing

doi:10.1109/TIV.2023.3321240

Computer Science > Computer Vision and Pattern Recognition

arXiv:2307.00724 (cs)

[Submitted on 3 Jul 2023 (v1), last revised 3 Oct 2023 (this version, v4)]

Title:LXL: LiDAR Excluded Lean 3D Object Detection with 4D Imaging Radar and Camera Fusion

Authors:Weiyi Xiong, Jianan Liu, Tao Huang, Qing-Long Han, Yuxuan Xia, Bing Zhu

View PDF

Abstract:As an emerging technology and a relatively affordable device, the 4D imaging radar has already been confirmed effective in performing 3D object detection in autonomous driving. Nevertheless, the sparsity and noisiness of 4D radar point clouds hinder further performance improvement, and in-depth studies about its fusion with other modalities are lacking. On the other hand, as a new image view transformation strategy, "sampling" has been applied in a few image-based detectors and shown to outperform the widely applied "depth-based splatting" proposed in Lift-Splat-Shoot (LSS), even without image depth prediction. However, the potential of "sampling" is not fully unleashed. This paper investigates the "sampling" view transformation strategy on the camera and 4D imaging radar fusion-based 3D object detection. LiDAR Excluded Lean (LXL) model, predicted image depth distribution maps and radar 3D occupancy grids are generated from image perspective view (PV) features and radar bird's eye view (BEV) features, respectively. They are sent to the core of LXL, called "radar occupancy-assisted depth-based sampling", to aid image view transformation. We demonstrated that more accurate view transformation can be performed by introducing image depths and radar information to enhance the "sampling" strategy. Experiments on VoD and TJ4DRadSet datasets show that the proposed method outperforms the state-of-the-art 3D object detection methods by a significant margin without bells and whistles. Ablation studies demonstrate that our method performs the best among different enhancement settings.

Comments:	Accepted by IEEE Transactions on Intelligent Vehicles
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2307.00724 [cs.CV]
	(or arXiv:2307.00724v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2307.00724
Related DOI:	https://doi.org/10.1109/TIV.2023.3321240

Submission history

From: Weiyi Xiong [view email]
[v1] Mon, 3 Jul 2023 03:09:44 UTC (8,136 KB)
[v2] Fri, 7 Jul 2023 13:20:59 UTC (5,575 KB)
[v3] Sun, 27 Aug 2023 12:49:57 UTC (7,782 KB)
[v4] Tue, 3 Oct 2023 10:07:26 UTC (7,595 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:LXL: LiDAR Excluded Lean 3D Object Detection with 4D Imaging Radar and Camera Fusion

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:LXL: LiDAR Excluded Lean 3D Object Detection with 4D Imaging Radar and Camera Fusion

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators