I am a Ph.D. candidate at the Center for Vision, Cognition, Learning, and Autonomy (VCLA), University of California, Los Angeles (UCLA), advised by Prof. Song-Chun Zhu.
I received dual bachelor degrees in Computer Engineering from University of Illinois at Urbana-Champaign and Zhejiang University.
My research interests are focused on the fields of computer vision, robotics, and cognitive science. I actively engaged in pushing the boundaries of generalizable object understanding and enhancing 3D vision through diffusion models, visual-language models (VLMs), and large language models (LLMs).