Selected Publications

See all articles on Google Scholar.
SWE-Bench Mobile: Can Large Language Model Agents Develop Industry-Level Mobile Applications?
Muxin Tian*, Zhe Wang*, Blair Yang, Zhenwei Tang, Kunlun Zhu, Honghua Dong, Hanchen Li, Xinni Xie, Guangjing Wang, Jiaxuan You
arxiv preprint, 2026
Where LLM Agents Fail and How They can Learn From Failures
Kunlun Zhu*, Muxin Tian*, Zijia Liu*, Bingxuan Li*, Yingxuan Yang, Jiaxun Zhang, Pengrui Han, Qipeng Xie, Fuyang Cui, Weijia Zhang, Xiaoteng Ma, Xiaodong Yu, Gowtham Ramesh, Jialian Wu, Zicheng Liu, Pan Lu, James Zou, Jiaxuan You
arXiv preprint, 2025
OasisSimp: An Open-source Asian-English Sentence Simplification Dataset
Hannah Liu*, Muxin Tian*, Iqra Ali, Haonan Gao, Qiaoyiwen Wu, Blair Yang, Uthayasanker Thayasivam, En-Shiun Annie Lee, Pakawat Nakwijit, Surangika Ranathunga, Ravi Shekhar
LREC, 2026

* denotes equal contribution

denotes project leader