Research
I am particularly interested in Machine
Learning, with a focus on NLP
, Reinforcement Learning and
Multimodal Learning. My research has
been focused on building embodied agents, especially
computer agents, that can excel in solving human tasks and
collaborating effectively with people. I remain curious
and
open to exploring various research questions in ML.
|
|
Scaling Computer-Use Grounding
via User Interface Decomposition and Synthesis
Tianbao
Xie*,
Jiaqi
Deng*,
Xiaochuan
Li*,
Junlin
Yang*,
Haoyuan
Wu,
Jixuan
Chen,
Wenjing
Hu,
Xinyuan
Wang,
Yuhui
Xu,
Zekun
Wang,
Yiheng
Xu,
Junli
Wang,
Doyen
Sahoo,
Tao
Yu†,
Caiming
Xiong†
arXiv, 2025
project
page
/
arXiv
TL; DR:
GUI grounding is essential for computer-use agents but
current benchmarks oversimplify the task. We introduce
OSWorld-G, a detailed benchmark, and Jedi, the largest
GUI grounding dataset with 4M examples. Models trained on
Jedi outperform existing methods and boost AI agents' task
success on OSWorld from 5% to 27%. Our studies show that
specialized, diverse data improves generalization to new
interfaces.
|
Selected Awards and Honors
|
- Overall Excellence Scholarship,
Tsinghua University, 2024
- Overall Excellence Scholarship,
Tsinghua University, 2023
- Freshman Scholarship, Tsinghua
University, 2022
- Outstanding Student Cadre, Tsinghua
University, 2023
|
Languages
|
- Mandarin (Native)
- English (Fluent)
- TOEFL: 110 (R:29, L:29, S:25, W:27)
|
Service and Leadership
|
|
I founded the
first alumni mutual-help
platform in my high school.
Our platform aims to bridge the information gap for high
school students from diverse economic, familial, and
cognitive backgrounds by providing them with
comprehensive
insights into university academic life. By
sharing
experiences, offering guidance, and fostering a supportive
community, the platform has become a vital resource for
students who might otherwise lack access to such
information. To date, it has received over 50k reads and
attracted more than 2.5k followers.
|
Miscellanea
|
Senior Mentors
|
At Tsinghua, I was fortunate to meet some incredibly kind,
talented, and supportive seniors, including Yuxuan
Li and Zirui
Cheng. During my
internship at HKU, I was lucky to work closely with
Tianbao Xie and Yiheng
Xu. I'm also grateful to have
collaborated with Xinyuan
Wang and Bowen
Wang on projects like
AgentNet.
|
Hobbies
|
- Athletics: Tennis, Badminton, jogging
- Arts: Music, film, reading
|
|
The source code is inspired by Jon Barron. Thanks for his
sharing! 🙏
|
|