Paper
15 August 2023 A category-level 6D pose estimation method based on end-to-end fast and efficient heterogeneous feature fusion
Huafeng Wang, Ao Chen
Author Affiliations +
Proceedings Volume 12719, Second International Conference on Electronic Information Technology (EIT 2023); 1271928 (2023) https://doi.org/10.1117/12.2685582
Event: Second International Conference on Electronic Information Technology (EIT 2023), 2023, Wuhan, China
Abstract
Monocular 6D pose estimation is a fundamental task in computer vision, and this paper focuses on class-level 6D pose estimation that can predict the pose of previously unknown objects. In the work carried out on the basis of RGB-D images, previous approaches have paid less attention to the distinction between different structural features in them and the consistency of their contributions in the pose estimation task when using deep learning for feature extraction, and thus most of the fusion is a direct stitching of heterogeneous information features. In particular, the experimental results of the current work on 6D pose estimation surface that there are still limitations in its pose estimation for multi-class target training. The sparse structure of the point cloud would make most methods easy to ignore the effective features. Moreover, the existing method studies do not sufficiently explore the complementarity between the effective information of heterogeneous features, which can lead to the fusion of the optimal combination methods that lack the contribution of each, thus bringing a large amount of redundant information and consuming computational resources. Therefore, this paper designs a more effective point cloud feature extraction method for dynamic graph structure for this task. To address the inherent requirements of complementary fusion, we design an adaptive method for further feature extraction on heterogeneous data and a fusion method with effective self-attentive disequilibrium contributions to extract core information from potential feature complements quickly, accurately, and efficiently. We conducted experiments on popular benchmark datasets, such as the NOCS-REAL [1] dataset. The experimental results show that our proposed method can perform the multi-class target 6D pose estimation task end-to-end and has a good performance on these datasets while achieving a real-time inference speed of almost 20 FPS.
© (2023) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
Huafeng Wang and Ao Chen "A category-level 6D pose estimation method based on end-to-end fast and efficient heterogeneous feature fusion", Proc. SPIE 12719, Second International Conference on Electronic Information Technology (EIT 2023), 1271928 (15 August 2023); https://doi.org/10.1117/12.2685582
Advertisement
Advertisement
RIGHTS & PERMISSIONS
Get copyright permission  Get copyright permission on Copyright Marketplace
KEYWORDS
Pose estimation

Feature fusion

Feature extraction

Point clouds

Education and training

Data fusion

RGB color model

Back to Top