Open-vocabulary Affordance Detection in 3d Point Clouds

Toan Nguyen, Minh Nhat Vu, An Vuong, Dzung Nguyen, Thieu Vo, Ngan Le, Anh Nguyen

Research output: Chapter in Book or Conference ProceedingsConference Proceedings with Oral Presentationpeer-review

Abstract

Affordance detection is a challenging problem with a wide variety of robotic applications. Traditional affordance detection methods are limited to a predefined set of affordance labels, hence potentially restricting the adaptability of intelligent robots in complex and dynamic environments. In this paper, we present the Open-Vocabulary Affordance Detection (OpenAD) method, which is capable of detecting an unbounded number of affordances in 3D point clouds. By simultaneously learning the affordance text and the point feature, OpenAD successfully exploits the semantic relationships between affordances. Therefore, our proposed method enables zero-shot detection and can be able to detect previously unseen affordances without a single annotation example. Intensive experimental results show that OpenAD works effectively on a wide range of affordance detection setups and outperforms other baselines by a large margin. Additionally, we demonstrate the practicality of the proposed OpenAD in real-world robotic applications with a fast inference speed. Our project is available at https://openad2023.github.io.
Original languageEnglish
Title of host publication2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Pages5692-5698
DOIs
Publication statusPublished - 14 Dec 2023
Event2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) - Detroit, United States
Duration: 1 Oct 20235 Oct 2023

Publication series

Name2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

Conference

Conference2023 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
Country/TerritoryUnited States
CityDetroit
Period1/10/235/10/23

Research Field

  • Complex Dynamical Systems

Fingerprint

Dive into the research topics of 'Open-vocabulary Affordance Detection in 3d Point Clouds'. Together they form a unique fingerprint.

Cite this