We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Grounded Affordance from Exocentric View.
- Authors
Luo, Hongchen; Zhai, Wei; Zhang, Jing; Cao, Yang; Tao, Dacheng
- Abstract
Affordance grounding aims to locate objects' "action possibilities" regions, an essential step toward embodied intelligence. Due to the diversity of interactive affordance, i.e., the uniqueness of different individual habits leads to diverse interactions, which makes it difficult to establish an explicit link between object parts and affordance labels. Human has the ability that transforms various exocentric interactions into invariant egocentric affordance to counter the impact of interactive diversity. To empower an agent with such ability, this paper proposes a task of affordance grounding from the exocentric view, i.e., given exocentric human-object interaction and egocentric object images, learning the affordance knowledge of the object and transferring it to the egocentric image using only the affordance label as supervision. However, there is some "interaction bias" between personas, mainly regarding different regions and views. To this end, we devise a cross-view affordance knowledge transfer framework that extracts affordance-specific features from exocentric interactions and transfers them to the egocentric view to solve the above problems. Furthermore, the perception of affordance regions is enhanced by preserving affordance co-relations. In addition, an affordance grounding dataset named AGD20K is constructed by collecting and labeling over 20K images from 36 affordance categories. Experimental results demonstrate that our method outperforms the representative models regarding objective metrics and visual quality. The code is available via: github.com/lhc1224/Cross-View-AG.
- Subjects
KNOWLEDGE transfer; PROBLEM solving
- Publication
International Journal of Computer Vision, 2024, Vol 132, Issue 6, p1945
- ISSN
0920-5691
- Publication type
Article
- DOI
10.1007/s11263-023-01962-z