Run-Ze Fan

PhD Student
Manning College of Information & Computer Sciences
University of Massachusetts Amherst
Email runze.fan(at)icloud(dot)com
runzefan(at)umass(dot)edu

zhihu

Profile

I am an incoming PhD student in Manning College of Information & Computer Sciences, University of Massachusetts Amherst, asvised by Prof. Hamed Zamani. Currently, I am a research assistant at Generative AI Research Lab (GAIR) to explore Generative AI, fortunately working with Prof. Pengfei Liu. Before that, I received the M.S. degree in Computer Technology at Institute of Computing Technology (ICT) of Chinese Academy of Sciences (CAS) supervised by Prof. Jiafeng Guo in 2024 and the B.E. degree in computer science and technology from Shanghai Maritime University in 2021.

Research Interests: My primary research interests include natural language processing, large language models, and machine learning. Specifically, My current research focuses:
  • LLMs Pre-training, Post-training, and Evaluation
  • Data Science and Engineering
  • LLM for Science (Especially Mathematics)
  • Digital Agent

I am happy to collaborate and/or answer questions about my research. If you are interested in research collaboration or have any inquiries about my experience, please send me an email.

News

Publications

google scholar | semantic scholar | dblp
(* indicates equal contribution)
  • Generative AI Act II: Test Time Scaling Drives Cognition Engineering
    Shijie Xia, Yiwei Qin, Xuefeng Li, Yan Ma, Run-Ze Fan, Steffi Chern, Haoyang Zou, Fan Zhou, Xiangkun Hu, Jiahe Jin, Yanheng He, Yixin Ye, Yixiu Liu, Pengfei Liu
    arXiv, 2025
    [PDF] [Abstract] [Bib] [Code] [Page] [机器之心]
    GitHub Repo stars
  • PC Agent: While You Sleep, AI Works -- A Cognitive Journey into Digital World
    Yanheng He*, Jiahe Jin*, Shijie Xia, Jiadi Su, Runze Fan, Haoyang Zou, Xiangkun Hu, Pengfei Liu.
    arXiv, 2024
    [PDF] [Abstract] [Bib] [Code] [Page] [机器之心]
    GitHub Repo stars
  • Data Contamination Report from the 2024 CONDA Shared Task
    Oscar Sainz, Iker García-Ferrero, Alon Jacovi, Jon Ander Campos, Yanai Elazar, Eneko Agirre, Yoav Goldberg, Wei-Lin Chen, Jenny Chim, Leshem Choshen, Luca D'Amico-Wong, Melissa Dell, Run-Ze Fan, Shahriar Golchin, Yucheng Li, Pengfei Liu, Bhavish Pahwa, Ameya Prabhu, Suryansh Sharma, Emily Silcock, Kateryna Solonko, David Stap, Mihai Surdeanu, Yu-Min Tseng, Vishaal Udandarao, Zengzhi Wang, Ruijie Xu, Jinglin Yang.
    ACL 2024 The 1st Workshop on Data Contamination (CONDA)
    [PDF] [Abstract] [Bib] [Page] [Data Contamination Database]
  • OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI
    Zhen Huang, Zengzhi Wang, Shijie Xia, Xuefeng Li, Haoyang Zou, Ruijie Xu, Run-Ze Fan, Lyumanshan Ye, Ethan Chern, Yixin Ye, Yikai Zhang, Yuqing Yang, Ting Wu, Binjie Wang, Shichao Sun, Yang Xiao, Yiyuan Li, Fan Zhou, Steffi Chern, Yiwei Qin, Yan Ma, Jiadi Su, Yixiu Liu, Yuxiang Zheng, Shaoting Zhang, Dahua Lin, Yu Qiao, Pengfei Liu.
    NeurIPS 2024 Datasets and Benchmarks
    [PDF] [Abstract] [Bib] [Code] [Page] [Featured by AK] [机器之心]
    GitHub Repo stars
  • Benchmarking Benchmark Leakage in Large Language Models
    Ruijie Xu*, Zengzhi Wang*, Run-Ze Fan*, Pengfei Liu.
    arXiv, 2024
    [PDF] [Abstract] [Bib] [Code] [Page] [HuggingFace Demo]
    GitHub Repo stars
  • Reformatted Alignment
    Run-Ze Fan, Xuefeng Li, Haoyang Zou, Junlong Li, Shwai He, Ethan Chern, Jiewen Hu, Pengfei Liu.
    EMNLP, 2024, Findings
    [PDF] [Abstract] [Bib] [Code] [Page] [Featured by AK] [量子位]
    GitHub Repo stars
  • RIGHT: Retrieval-augmented Generation for Mainstream Hashtag Recommendation
    Run-Ze Fan, Yixing Fan, Jiangui Chen, Jiafeng Guo, Ruqing Zhang, Xueqi Cheng.
    ECIR, 2024
    [PDF] [Abstract] [Bib] [Code]
    GitHub Repo stars
  • Generative Judge for Evaluating Alignment
    Junlong Li, Shichao Sun, Weizhe Yuan, Run-Ze Fan, Hai Zhao, Pengfei Liu.
    ICLR, 2024
    [PDF] [Abstract] [Bib] [Code] [Page] [机器之心]
    GitHub Repo stars
  • Merging Experts into One: Improving Computational Efficiency of Mixture of Experts
    Shwai He, Run-Ze Fan, Liang Ding, Li Shen, Tianyi Zhou, Dacheng Tao.
    EMNLP, 2023 (Oral)
    [PDF] [Abstract] [Bib] [Code]
    GitHub Repo stars
  • MerA: Merging Pretrained Adapters For Few-Shot Learning
    Shwai He, Run-Ze Fan, Liang Ding, Li Shen, Tianyi Zhou, Dacheng Tao.
    arXiv, 2023
    [PDF] [Abstract] [Bib]

Talks

  • 2024/10: Reformatted Alignment @AITIME, NICE.

Education & Research Experience

Blogs

Selected Honors & Awards

  • 2024: Excellent Master’s Graduation Thesis, Institute of Computing Technology

  • 2021: Excellent Bachelor's Graduation Thesis, Shanghai Maritime University

  • 2021: Excellent Graduate, Shanghai Maritime University

  • 2019, 2020, 2021: First Class Scholarship, Shanghai Maritime University

Academic Service

  • Reviewer:
    • ICLR (2025), NeurIPS (2025)
    • EMNLP (2023), ARR (Feb 2024), EMNLP Industry Track (2023, 2024, 2025), NAACL Industry Track (2025)

Miscellaneous

  • Powerlifting (3yrs+): At a body weight of 70 kg, my prs are: Bench Press (85kg), Squat (110kg), and Deadlift (155kg).