Wenbo (Gordon) Hu

whu at cs dot ucla dot edu

Hi! I am Wenbo. I'm a first-year CS PhD student at University of California, Los Angeles advised by Prof. Kai-Wei Chang and Prof. Nanyun Peng. I also obtained my M.S. in CS from UCLA in 2024. Previously, I obtained my B.S. in Data Science at the Halicioglu Data Science Institute (HDSI) from University of California, San Diego in 2023. I'm fortunate to have worked with Prof. Zhuowen Tu and Prof. Hao Su during my undergraduate study.

My primary research interest lies in the intersection of vision, language, and agentic. Particularly, I have worked on 2D and 3D vision-language models in visual understanding and embodied tasks, and evaluation benchmarks for multimodal models. My long-term research goal is to build intelligent systems that can perceive, understand and interact with the complex physical world.

I'm actively looking for strong and motivated graduate and undergraduate students to collaborate. If you have similar research interests or interested in working on research projects in general, feel free to reach out to me!

CV  /  GitHub  /  Google Scholar /  LinkedIn  /  Email  /  Twitter (X)


News
  • 01/2025: MRAG-Bench is accepted at ICLR 2025!
  • 10/2024: We release MRAG-Bench: Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models.
  • 09/2024: MQT-LLaVA: Matryoshka Query Transformer for Large Vision-Language Models is accepted at NeurIPS 2024!
  • 05/2024: Released Matryoshka Query Transformer for Large Vision-Language Models, check our new work here.
  • 05/2024: VALOR-EVAL is accepted by ACL 2024 Findings!
  • 12/2023: BLIVA is accepted by AAAI 2024!
  • 10/2023: Joined PLUS lab and UCLA NLP Group at UCLA.
  • 04/2023: Joined Machine Learning, Perception, and Cognition Lab (mlPC) at UCSD.
  • 02/2022: Joined Hao Su Lab at UCSD for computer vision and robotics research.

  • Publications

    Loading...

    Teaching Experience
    Teaching Assistant: CSE151A: Intro to Machine Learning
    UCSD (Winter 2023)

    Awards
    UCLA CS Departmental Fellowship Award

    Service
    Reviewer for ICLR, NeurIPS, CVPR, ICCV, AAAI, ACL, EMNLP, NAACL, ACL Rolling Review, and IEEE Transactions on Multimedia.



    Pageviews


    Inspired by this and this.