Contact me:

<aside> <img src="/icons/mail_gray.svg" alt="/icons/mail_gray.svg" width="40px" /> Email

</aside>

<aside> <img src="/icons/graduate_gray.svg" alt="/icons/graduate_gray.svg" width="40px" /> Google Scholar

</aside>

<aside> <img src="/icons/git_gray.svg" alt="/icons/git_gray.svg" width="40px" /> Github

</aside>

<aside> <img src="/icons/following_gray.svg" alt="/icons/following_gray.svg" width="40px" /> Twitter(X)

</aside>

News:

<aside>

2024.09.26

🎉 One paper is accepted by NeurIPS 2024 on LLM&LMM evaluation.

</aside>

I'm Zhen Huang (“黄臻” in Chinese), currently a senior student majoring in Computer Science and Technology at Soochow University. I am also an incoming Ph.D. student at Fudan University, under the supervision of Pengfei Liu (primary advisor, affiliated with the GAIR Lab at SJTU) and Xipeng Qiu (affiliated with FudanNLP).

My academic and research interests deeply root in Artificial Intelligence, particularly in the realms of Natural Language Processing, Large Language Models, and Generative AI. My research focuses on enhancing the reasoning capabilities of large language (vision) models and their reliable and robust evaluation (e.g. benchmark construction). My passion for AI drives my dedication to advancing this field, contributing to meaningful innovations and solutions.

Publications


OlympicArena Medal Ranks: Who Is the Most Intelligent AI So Far?

Zhen Huang, Zengzhi Wang, Shijie Xia, Pengfei Liu†

arXiv preprint. 2024. (technical report)

[Paper][机器之心]

OlympicArena: Benchmarking Multi-discipline Cognitive Reasoning for Superintelligent AI

Zhen Huang, Zengzhi Wang, Shijie Xia, Xuefeng Li, Haoyang Zou, Ruijie Xu, Run-Ze Fan, Lyumanshan Ye, Ethan Chern, Yixin Ye, Yikai Zhang, Yuqing Yang, Ting Wu, Binjie Wang, Shichao Sun, Yang Xiao, Yiyuan Li, Fan Zhou, Steffi Chern, Yiwei Qin, Yan Ma, Jiadi Su, Yixiu Liu, Yuxiang Zheng, Shaoting Zhang†, Dahua Lin†, Yu Qiao†, Pengfei Liu†

NeurIPS 2024 Track Datasets and Benchmarks

[Paper][Code][Homepage][🤗 Datasets][ 🤗 Competition][Featured by AK][机器之心]

Sequential Recommendation with Diffusion Models

Hanwen Du, Huanhuan Yuan, Zhen Huang, Pengpeng Zhao†, Xiaofang Zhou

arXiv preprint. 2023.

[Paper]

Projects


O1 Journey [Github]

<aside> 💡

An exploration of “deep thinking” abilities of o1/o3-like AI models.

</aside>

O1 Replication Journey – Part 3: Inference-time Scaling for Medical Reasoning

Zhongzhen Huang*, Gui Geng*, Shengyi Hua*, Zhen Huang*, Haoyang Zou*, Shaoting Zhang†, Pengfei Liu†, Xiaofan Zhang†

arXiv preprint. 2025.

[Paper][Featured by AK][ScienceAI]

O1 Replication Journey – Part 2: Surpassing O1-preview through Simple Distillation (Big Progress or Bitter Lesson?)

Zhen Huang*, Haoyang Zou*, Xuefeng Li*, Yixiu Liu*, Yuxiang Zheng*, Ethan Chern*, Shijie Xia*, Yiwei Qin, Weizhe Yuan, Pengfei Liu†

arXiv preprint. 2024.

[Paper][Featured by AK][机器之心]

O1 Replication Journey: A Strategic Progress Report – Part 1

Yiwei Qin*, Xuefeng Li*, Haoyang Zou*, Yixiu Liu*, Shijie Xia*, Zhen Huang, Yixin Ye, Weizhe Yuan, Hector Liu, Yuanzhi Li, Pengfei Liu†

arXiv preprint. 2024.

[Paper][机器之心]

Education Experience


2021.9-2025.6 (expected)

B.Eng. in Software Engineering, School of Computer Science and Technology

GPA 4.0/4.0, Ranking 1/95

I conduct research on data mining and recommender systems under the supervision of Pengpeng Zhao during my time at ADALab, which is founded by Prof. Xiaofang Zhou (the current head of the Computer Science department at HKUST).