Junlin Yang

Hi, I'm Junlin Yang, a third-year student in Department of Computer Science and Technology at Tsinghua University. I'll be fortunate to visit Siebel School of Computing and Data Science at University of Illinois at Urbana-Champaign as a summer research intern in 2025 summer, where I'll be advised by Prof. Hao Peng. I've had the privilege of being a research intern at the XLANG Lab at The University of Hong Kong, advised by Prof. Tao Yu.

Feel free to reach out if you're interested in my research, looking for collaboration, or just want to chat!

Email  /  Twitter  /  Bluesky  /  Github  /  Google Scholar

profile photo

Research

I am particularly interested in Machine Learning, with a focus on NLP , Reinforcement Learning and Multimodal Learning. My research has been focused on building embodied agents, especially computer agents, that can excel in solving human tasks and collaborating effectively with people. Recently, I am interested in developing scalable methods towards general, self-evolving reasoning intelligence.

* indicates equal contribution
OpenCUA: Open Foundations for Computer-Use Agents
Xinyuan Wang*, Bowen Wang*, Dunjie Lu*, Junlin Yang*, Tianbao Xie*, Junli Wang*, Jiaqi Deng, Xiaole Guo, Yiheng Xu, Chen Henry Wu, Zhennan Shen, Zhuokai Li, Ryan Li, Xiaochuan Li, Junda Chen, Boyuan Zheng, Peihang Li, Fangyu Lei, Ruisheng Cao, Yeqiao Fu, Dongchan Shin, Martin Shin, Jiarui Hu, Yuyan Wang, Jixuan Chen, Yuxiao Ye, Danyang Zhang, Yipu Wang, Heng Wang, Diyi Yang, Victor Zhong, Y.Charles, Zhilin Yang, Tao Yu
COLM 2025 Workshop AIA (Oral)
project page

TL; DR: We present OpenCUA, a comprehensive open-source framework for scaling CUA data and foundation models which includes an annotation infrastructure, the first large-scale computer-use task dataset and a scalable pipeline that transforms demonstrations into state–action pairs with reflective long Chain-of-Thought reasoning. Our end-to-end agent model, OpenCUA-32B achieves an average success rate of 32.5% on OSWorld-Verified, establishing a new state-of-the-art (SOTA) among open-source models and surpassing OpenAI CUA (GPT-4o).

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Tianbao Xie*, Jiaqi Deng*, Xiaochuan Li*, Junlin Yang*, Haoyuan Wu, Jixuan Chen, Wenjing Hu, Xinyuan Wang, Yuhui Xu, Zekun Wang, Yiheng Xu, Junli Wang, Doyen Sahoo, Tao Yu, Caiming Xiong
arXiv, 2025
project page / arXiv / Code

TL; DR: GUI grounding is essential for computer-use agents but current benchmarks oversimplify the task. We introduce OSWorld-G, a detailed benchmark, and Jedi, the largest GUI grounding dataset with 4M examples. Models trained on Jedi outperform existing methods and boost AI agents' task success on OSWorld from 5% to 27%. Our studies show that specialized, diverse data improves generalization to new interfaces.

Projects

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments
Code
TL; DR: OSWorld is a first-of-its-kind scalable, real computer environment for multimodal agents, supporting task setup, execution-based evaluation, and interactive learning across operating systems. It has been adopted by CUA research teams at OpenAI, Anthropic, Bytedance, etc. It has gained over 2k stars on GitHub.

Selected Awards and Honors

  • Overall Excellence Scholarship, Tsinghua University, 2024
  • Overall Excellence Scholarship, Tsinghua University, 2023
  • Freshman Scholarship, Tsinghua University, 2022
  • Outstanding Student Cadre, Tsinghua University, 2023

Languages

  • Mandarin (Native)
  • English (Fluent)
  • TOEFL: 110 (R:29, L:29, S:25, W:27)

Service and Leadership

I founded the first alumni mutual-help platform Chi Mu Zhi Jie(尺木之阶) in Shenzhen Experimental High School, my high school alma mater. Our platform aims to bridge the information gap for high school students from diverse economic, familial, and cognitive backgrounds by providing them with comprehensive insights into university academic life. By sharing experiences, offering guidance, and fostering a supportive community, the platform has become a vital resource for students who might otherwise lack access to such information. To date, it has received over 50k reads and attracted more than 2.5k followers.

Miscellanea

Senior Mentors

At Tsinghua, I was fortunate to meet some incredibly kind, talented, and supportive seniors, including Yuxuan Li and Zirui Cheng. During my internship at HKU, I was lucky to work closely with Tianbao Xie and Yiheng Xu. I'm also grateful to have collaborated with Xinyuan Wang and Bowen Wang on projects like AgentNet.

Hobbies

  • Athletics: Tennis, Badminton, jogging
  • Arts: Music, film, reading

The source code is inspired by Jon Barron. Thanks for his sharing! 🙏