D(3)Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding

Chen, DZ; Wu, Q; Niessner, M; Chang, AX

Chen, DZ (通讯作者),Tech Univ Munich, Munich, Germany.

COMPUTER VISION - ECCV 2022, PT XXXII, 2022; 13692 (): 487