Yunhao Fang 方云浩

Yunaho Fang 

Research Scientist Intern,
Nvidia
Email: seerkfang [@] gmail [DOT] com
Twitter / Github / LinkedIn

About me

I'm a Research Scientist Intern at Nvidia, advised by Dr. Jason Lu and Prof. Song Han, and a core contributor to Nvidia’s multimodal model, VILA.

I hold a Master's degree from the Department of Computer Science and Engineering at the University of California San Diego, where I was fortunate to be advised by Prof. Hao Su. Before that, I earned my B.Eng. in Electronic Engineering from Zhejiang University. I have also spent time at Shanghai AI Laboratory, as the maintainer of the opensource codebase mmtracking.

My long-term research goal is to advance multimodal intelligence by developing automated learning systems that integrate closed-loop data pipelines, efficient algorithms, and robust evaluation tools.

Research interests

My research interests include

  • Perception

    • Generalized Representation

    • Synergy between Understanding and Generation

  • Reasoning and Common Sense

    • Concept Emergence and Common Sense

    • Advanced Reasoning for Scientific Discoveries

  • Generative Modeling

    • Efficient World Model

    • Learning from (Human or AI) Feedback

Selected Publications & Preprints

Papers sorted by years. The full list is available on Google Scholar.

2024

vila^2 

VILA^2: VLM Augmented VLM with Self-Improvement
Yunhao Fang*, Ligeng Zhu*, Yao Lu, Yan Wang, Pavlo Molchanov, Jang Hyun Cho, Marco Pavone, Song Han, Hongxu Yin
In Submission

2023

verify_cot 

Deductive Verification of Chain-of-Thought Reasoning
Zhan Ling*, Yunhao Fang*, Xuanlin Li, Zhiao Huang, Hao Su
Neural Information Processing Systems (NeurIPS) 2023
[Code]

vlm_distillation 

Distilling Large Vision-Language Model with Out-of-Distribution Generalizability
Xuanlin Li*, Yunhao Fang*, Minghua Liu, Zhan Ling, Zhuowen Tu, Hao Su
International Conference on Computer Vision (ICCV) 2023
[Code]

Professional Services

  • Conference Reviewer: ECCV 2024, ICLR 2024, CVPR 2024

Teaching

  • Teaching Assistant: CSE 275: Deep Learning for 3D Data at UC San Diego, Fall 2023

Awards

  • China National Scholarship, 2022