Yunhao Fang 方云浩

Research Scientist Intern,
Nvidia
Email: seerkfang [@] gmail [DOT] com
Twitter / Github / LinkedIn

About me

I'm a Research Scientist Intern at Nvidia, advised by Dr. Jason Lu and Prof. Song Han, and a core contributor to Nvidia’s multimodal model, VILA.

I hold a Master's degree from the Department of Computer Science and Engineering at the University of California San Diego, where I was fortunate to be advised by Prof. Hao Su. Before that, I earned my B.Eng. in Electronic Engineering from Zhejiang University. I have also spent time at Shanghai AI Laboratory, as the maintainer of the opensource codebase mmtracking.

My long-term research goal is to advance multimodal intelligence by developing automated learning systems that integrate closed-loop data pipelines, efficient algorithms, and robust evaluation tools.

Research interests

My research interests include

Perception
- Generalized Representation
- Synergy between Understanding and Generation

Reasoning and Common Sense
- Concept Emergence and Common Sense
- Advanced Reasoning for Scientific Discoveries

Generative Modeling
- Efficient World Model
- Learning from (Human or AI) Feedback

Selected Publications & Preprints

Papers sorted by years. The full list is available on Google Scholar.

2024

VILA^2: VLM Augmented VLM with Self-Improvement
Yunhao Fang*, Ligeng Zhu*, Yao Lu, Yan Wang, Pavlo Molchanov, Jang Hyun Cho, Marco Pavone, Song Han, Hongxu Yin
In Submission

2023

Deductive Verification of Chain-of-Thought Reasoning
Zhan Ling*, Yunhao Fang*, Xuanlin Li, Zhiao Huang, Hao Su
Neural Information Processing Systems (NeurIPS) 2023
[Code]

Distilling Large Vision-Language Model with Out-of-Distribution Generalizability
Xuanlin Li*, Yunhao Fang*, Minghua Liu, Zhan Ling, Zhuowen Tu, Hao Su
International Conference on Computer Vision (ICCV) 2023
[Code]

Professional Services

Conference Reviewer: ECCV 2024, ICLR 2024, CVPR 2024

Teaching

Teaching Assistant: CSE 275: Deep Learning for 3D Data at UC San Diego, Fall 2023

Awards

China National Scholarship, 2022