Abstract
Collaborative perception by leveraging the shared semantic information plays a crucial role in overcoming the individual limitations of isolated agents. However, existing collaborative perception methods tend to focus solely on the spatial features of semantic information, while neglecting the importance of the temporal dimension. Consequently, the potential benefits of collaboration remain underutilized. In this article, we propose Select2Col, a novel collaborative perception framework that takes into account the spatial-temporal importance of semantic information. Within the Select2Col, we develop a collaborator selection method that utilizes a lightweight graph neural network (GNN) to estimate the importance of semantic information (IoSI) of each collaborator in enhancing perception performance, thereby identifying contributive collaborators while excluding those that potentially bring negative impact. Moreover, we present a semantic information fusion algorithm called HPHA (historical prior hybrid attention), which integrates multi-scale attention and short-term attention modules to capture the IoSI in feature representation from the spatial and temporal dimensions respectively, and assigns IoSI-consistent weights for efficient fusion of information from selected collaborators. Extensive experiments on three open datasets demonstrate that our proposed Select2Col significantly improves the perception performance compared to state-of-the-art approaches.
Original language | English |
---|---|
Pages (from-to) | 12556 - 12569 |
Number of pages | 14 |
Journal | IEEE Transactions on Vehicular Technology |
Volume | 73 |
Issue number | 9 |
DOIs | |
Publication status | Published - Sept 2024 |
MoE publication type | A1 Journal article-refereed |
Funding
This work was supported in part by the National Key Research and Development Program of China under Grant 2021YFB2900200, in part by the National Natural Science Foundation of China under Grant 62071425, in part by ZhejiangKey Research andDevelopment Plan underGrant 2022C01129, in part by the Zhejiang Provincial Natural Science Foundation of China under Grant LR23F010005, in part by the National Key Laboratory of Wireless Communications Foundation under Grant 2023KP01601, and in part by the Big Data and Intelligent Computing Key Lab of CQUPT under Grant BDIC-2023-B-001.
Keywords
- Collaboration
- Collaborative Perception
- Feature extraction
- Hybrid Attention
- Importance of Semantic Information
- Object detection
- Point cloud compression
- Roads
- Semantic Information Fusion
- Semantics
- Spatial-Temporal Dimensions
- Three-dimensional displays
- semantic information fusion
- Collaborative perception
- hybrid attention
- spatial-temporal dimensions
- importance of semantic information