LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

Example of 3D dense captioning.
Xin Chen
Xin Chen
陈欣 | Research Scientist

My research interests include generative AI, human agents, 3D and human motion generation.