Research Profile

Yihuan Yao

Bachelor: Harbin Institute of Technology (Weihai)

Master: The Chinese University of Hong Kong, Shenzhen

🌸 Researching...

🟢 Online

About Me

Welcome to my research homepage. I am a researcher focused on artificial intelligence and computer vision, primarily working on autonomous driving systems and multimodal learning.

My research interests cover Vision-Language Models (VLM), Vision-Language-Action Models (VLA), and world models for autonomous driving systems. I believe in building intelligent systems that can understand and interact with the world meaningfully.

I am passionate about open-source research and contributing to advancing AI technologies that make autonomous driving systems safer and more reliable.

Research Interests

Autonomous Driving Multimodal Learning Vision-Language Models (VLM) Vision-Language-Action Models (VLA) World Models

Programming Languages

Python LaTeX HTML Markdown

Frameworks & Tools

PyTorch MMCV MMDetection OpenCV NumPy TensorBoard

Research Projects

Mashiro's Blog

Personal blog website sharing research insights and learning notes. Features login/registration and dynamic effects.

VLM Research

Vision-Language Model research exploring multimodal understanding and reasoning capabilities.

Autonomous Driving System

Deep learning-based perception and decision-making systems for autonomous driving.

RESEARCH PROFILE