Abstract
Affordance detection presents intricate challenges and has a wide range of robotic applications. Previous works have faced limitations such as the complexities of 3D object shapes, the wide range of potential affordances on real-world objects, and the lack of open-vocabulary support for affordance understanding. In this paper, we introduce a new openvocabulary affordance detection method in 3D point clouds, leveraging knowledge distillation and text-point correlation. Our approach employs pre-trained 3D models through knowledge distillation to enhance feature extraction and semantic understanding in 3D point clouds. We further introduce a new text-point correlation method to learn the semantic links between point cloud features and open-vocabulary labels. The intensive experiments show that our approach outperforms previous works and adapts to new affordance labels and unseen objects. Notably, our method achieves the improvement of 7.96% mIOU score compared to the baselines. Furthermore, it offers real-time inference which is well-suitable for robotic manipulation applications.
Original language | English |
---|---|
Title of host publication | 2024 IEEE International Conference on Robotics and Automation (ICRA) |
Pages | 13968-13975 |
DOIs | |
Publication status | Published - 8 Aug 2024 |
Event | IEEE International Conference on Robotics and Automation - Yokohama, Japan Duration: 13 May 2024 → 17 May 2024 https://2024.ieee-icra.org/ |
Publication series
Name | 2024 IEEE International Conference on Robotics and Automation (ICRA) |
---|
Conference
Conference | IEEE International Conference on Robotics and Automation |
---|---|
Abbreviated title | ICRA |
Country/Territory | Japan |
City | Yokohama |
Period | 13/05/24 → 17/05/24 |
Internet address |
Research Field
- Complex Dynamical Systems