profile image

Jiaheng Liu

Contact Me
Currently, I am an Assistant Professor at Nanjing University. Previously, I was a Research Scientist (Alibaba Star Project) at Alibaba, focusing on Large Language Models (LLMs), including pretraining, long-context modeling, reasoning and alignment.
I am actively looking for research interns and candidate students to work on LLM-related research topics. Please feel free to drop me an email to liujiaheng@nju.edu.cn if you are interested (2025年4月份有两名9月份入学的考研硕士名额).

Moreover, I interned at SenseTime (working with Dr. Yichao Wu, Dr. Ding Liang), Baidu (working with Dr. Tan Yu), and Shanghai AI Lab (working with Dr. Tong He, Prof. Wanli Ouyang). I got my Ph.D. degree from Beihang University, advised by Prof. Ke Xu and Prof. Dong Xu. Earlier, I obtained my bachelor's degree from Beihang University.

I am currently working on Foundation Models, which is a promising direction to AGI.
● Large Language Models

  • Pretraining: MAP-Neo-7B, OpenCoder, Sailor2, Chinese-Tiny-LLM (COLM 2024), D-CPT Law (NeurIPS 2024), E2-LLM (ACL 2024), MuPT (ICLR 2025), etc
  • Alignment: RoleLLM (ACL Findings 2024), Emulated Disalignment (ACL 2024), 2D-DPO (NAACL 2025), DREAM (NAACL 2025), etc
  • LLM Acceleration: DDK (NeurIPS 2024), JSQ (ICML 2024), ACKD (ACL 2023), etc
  • Code Intelligence: UniCoder (ACL 2024), R2C2-Coder, McEval (ICLR 2025), M2rcEval, MdEval, CodeCriticBench, etc
  • Evaluation : ConceptMath (ACL Findings 2024), MT-Bench-101 (ACL 2024), OWL (ICLR 2024), Chinese SimpleQA, KORBench (ICLR 2025), MTU-Bench (ICLR 2025), SuperGPQA, DeltaBench, etc
● Multimodal Large Language Models
  • Pretraining: MIO
  • Evaluation: II-Bench (NeurIPS 2024), MMRA, OmniBench, LIME, etc
● Previously, I worked on Computer Vision.
  • Face Recognition: ICD-Face (ICCV 2023), OneFace (ECCV 2022), CoupleFace (ECCV 2022), AnchorFace (AAAI 2022), DAM (ICCV 2021), etc
  • Point Cloud Understanding: LTA-PCS (CVPR 2024), SLF (ECCV 2024), VRDistill (ACMMM 2024), GD-MAE (CVPR 2023), APSNet (TIP 2022), GMT (TMM 2022), etc
  • Vision Model Acceleration : BPNAS (TIP 2020), LAW (AAAI 2020), CCKD (ICCV 2019), RCO (ICCV 2019), etc

News

  • [1/2025] Four papers are accepted to ICLR 2025.
  • [1/2025] Two papers are accepted to NAACL 2025.
  • [12/2024] We will organize the Open Science for Foundation Models Workshop in conjunction with ICLR 2025.
  • [12/2024] Two papers are accepted to AAAI 2025.
  • [9/2024] Four papers are accepted to NeurIPS 2024.
  • [9/2024] One paper is accepted to EMNLP 2024.
  • [7/2024] Three papers are accepted to ACMMM 2024.
  • [7/2023] One paper is accepted to CIKM 2024.
  • [7/2024] One paper is accepted to COLM 2024.
  • [7/2024] One paper is accepted to ECCV 2024.
  • [5/2024] Seven papers are accepted to ACL 2024.
  • [5/2024] One paper is accepted to ICML 2024.
  • [2/2024] One paper is accepted to CVPR 2024.
  • [2/2024] One paper is accepted to COLING 2024.
  • [11/2023] One paper is accepted to AAAI 2024.
  • [2/2023] One paper is accepted to CVPR 2023.
  • [7/2023] One paper is accepted to ICCV 2023.
  • [9/2023] One paper is accepted to EMNLP 2023.
  • [5/2023] One paper is accepted to ACL 2023.

Publications (Update Soon) [Full List]

(* equal contribution, # corresponding author)
  • PTSBench: A Comprehensive Post-Training Sparsity Benchmark Towards Algorithms and Models
    Zining Wang, Jinyang Guo, Ruihao Gong, Yang Yong, Aishan Liu, Yushi Huang, Jiaheng Liu, Xianglong Liu
    ACM Multimedia (ACM MM), 2024

  • MaterialSeg3D: Segmenting Dense Materials from 2D Priors for 3D Assets
    Zeyu Li, Ruitong Gan, Chuanchen Luo, Yuxi Wang, Jiaheng Liu, Ziwei Zhu, Man Zhang, Qing Li, Zhaoxiang Zhang, Junran Peng, Xu-Cheng Yin
    ACM Multimedia (ACM MM), 2024

  • VRDistill: Vote Refinement Distillation for Efficient Indoor 3D Object Detection
    Ze Yuan, Jinyang Guo, Dakai An, Junran Wu, He Zhu, Jianhao Li, Xueyuan Chen, Ke Xu, Jiaheng Liu#
    ACM Multimedia (ACM MM), 2024

  • Chinese Tiny LLM: Pretraining a Chinese-Centered Large Language Model
    Xinrun Du, Zhouliang Yu, Songyang Gao, Ding Pan, Cheng Yuyang, Ziyang Ma, Ruibin Yuan, Xingwei Qu, Jiaheng Liu, Tianyu Zheng, Xinchen Luo, Guorui Zhou, Binhang Yuan, Wenhu Chen, Jie Fu, Ge Zhang
    Conference on Language Modeling (COLM), 2024

  • Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts
    Jianhao Li, Tianyu Sun, Zhongdao Wang, Enze Xie, Bailan Feng, Hongbo Zhang, Ze Yuan, Ke Xu, Jiaheng Liu#, Ping Luo
    European Conference on Computer Vision (ECCV), 2024

  • Emulated Disalignment: Safety Alignment for Large Language Models May Backfire!
    Zhanhui Zhou, Jie Liu, Zhichen Dong, Jiaheng Liu, Chao Yang, Wanli Ouyang, Yu Qiao
    Association for Computational Linguistics (ACL), 2024

  • UniCoder: Scaling Code Large Language Model via Universal Code
    Tao Sun, Linzheng Chai, Jian Yang, Yuwei Yin, Hongcheng Guo, Jiaheng Liu, Bing Wang, Liqun Yang, Zhoujun Li
    Association for Computational Linguistics (ACL), 2024

  • Towards Real-world Scenario: Imbalanced New Intent Discovery
    Shun Zhang, Chaoran Yan, Jian Yang, Jiaheng Liu, Ying Mo, Jiaqi Bai, Tongliang Li, Zhoujun Li
    Association for Computational Linguistics (ACL), 2024

  • MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
    Ge Bai, Jie Liu, Xingyuan Bu, Yancheng He, Jiaheng Liu, Zhanhui Zhou, Zhuoran Lin, Wenbo Su, Tiezheng Ge, Bo Zheng, Wanli Ouyang
    Association for Computational Linguistics (ACL), 2024

  • E2-LLM: Efficient and Extreme Length Extension of Large Language Models
    Jiaheng Liu*, Zhiqi Bai*, Yuanxing Zhang, Chenchen Zhang, Yu Zhang, Ge Zhang, Jiakai Wang, Haoran Que, Yukang Chen, Wenbo Su, Tiezheng Ge, Jie Fu, Wenhu Chen, Bo Zheng
    Findings of Association for Computational Linguistics (ACL), 2024

  • ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models
    Yanan Wu*, Jie Liu*, Xingyuan Bu*,Jiaheng Liu#, Zhanhui Zhou, Yuanxing Zhang, Chenchen Zhang, Zhiqi Bai, Haibin Chen, Tiezheng Ge, Wanli Ouyang, Wenbo Su, Bo Zheng
    Findings of Association for Computational Linguistics (ACL), 2024

  • RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models
    Zekun Wang*, Zongyuan Peng*, Haoran Que*, Jiaheng Liu#, Wangchunshu Zhou, Yuhan Wu, Hongcheng Guo, Ruitong Gan, Zehao Ni, Jian Yang, Man Zhang, Zhaoxiang Zhang, Wanli Ouyang, Ke Xu, Wenhao Huang, Wenhu Chen, Jie Fu, Junran Peng
    Findings of Association for Computational Linguistics (ACL), 2024

  • Compressing Large Language Models by Joint Sparsification and Quantization
    Jinyang Guo, Jianyu Wu, Zining Wang, Jiaheng Liu, Ge Yang, Yifu Ding, Ruihao Gong, Haotong Qin, Xianglong Liu
    International Conference on Machine Learning (ICML), 2024

  • LTA-PCS: Learnable Task-Agnostic Point Cloud Sampling
    Jiaheng Liu*, Jianhao Li*, Jinyang Guo, Kaisiyuan Wang, Hongcheng Guo, Jian Yang, Junran Peng, Xianglong Liu, Ke Xu
    IEEE Computer Vision and Pattern Recognition (CVPR), 2024

  • m3P: Towards Multimodal Multilingual Translation with Multimodal Prompt
    Jian Yang, Hongcheng Guo, Yuwei Yin, Jiaqi Bai, Bing Wang, Jiaheng Liu, Xinnian Liang, LinZheng Chai, Liqun Yang, Zhoujun Li
    Joint International Conference on Computational Linguistics, Language Resources and Evaluation (COLING), 2024

  • OWL: A Large Language Model for IT Operations
    Hongcheng Guo, Jian Yang#, Jiaheng Liu#, Liqun Yang, Linzheng Chai, Jiaqi Bai, Junran Peng, Xiaorong Hu, Chao Chen, Dongfeng Zhang, Xu Shi, Tieqiao Zheng, Liangfan Zheng, Bo Zhang, Ke Xu, Zhoujun Li
    International Conference on Learning Representations (ICLR), 2024

  • LogFormer: A Pre-train and Tuning Pipeline for Log Anomaly Detection
    Hongcheng Guo, Jian Yang, Jiaheng Liu, Jiaqi Bai, Boyang Wang, Zhoujun Li, Tieqiao Zheng, Bo Zhang, Junran Peng, Qi Tian
    AAAI Conference on Artificial Intelligence (AAAI), 2024

  • M2C: Towards Automatic Multimodal Manga Complement
    Hongcheng Guo, Boyang Wang, Jiaqi Bai, Jiaheng Liu, Jian Yang, Zhoujun Li
    Findings of Empirical Methods in Natural Language Processing (EMNLP), 2023

  • ICD-Face: Intra-class Compactness Distillation for Face Recognition
    Zhipeng Yu, Jiaheng Liu#, Haoyu Qin, Yichao Wu, Kun Hu, Jiayi Tian, Ding Liang
    IEEE International Conference on Computer Vision (ICCV), 2023

  • GripRank: Bridging the Gap between Retrieval and Generation via the Generative Knowledge Improved Passage Ranking
    Jiaqi Bai, Hongcheng Guo, Jiaheng Liu, Jian Yang, Xinnian Liang, Zhao Yan, Zhoujun Li
    ACM International Conference on Information and Knowledge Management (CIKM), 2023

  • Adaptive Contrastive Distillation for BERT Compression
    Jinyang Guo*, Jiaheng Liu*, Zining Wang, Yuqing Ma, Ruihao Gong, Ke Xu, Xianglong Liu
    Findings of Association for Computational Linguistics (ACL), 2023

  • GD-MAE: Generative Decoder for MAE Pre-training on LiDAR Point Clouds
    Honghui Yang, Tong He, Jiaheng Liu, Hua Chen, Boxi Wu, Binbin Lin, Xiaofei He, Wanli Ouyang
    IEEE Computer Vision and Pattern Recognition (CVPR), 2023

  • LogLG: Weakly Supervised Log Anomaly Detection via Log-Event Graph Construction
    Hongcheng Guo, Yuhui Guo, Jian Yang, Jiaheng Liu#, Zhoujun Li, Tieqiao Zheng, Liangfan Zheng, Weichao Hou, Bo Zhang
    International Conference on Database Systems for Advanced Applications (DASFAA), 2023

  • LVP-M3: Language-aware Visual Prompt for Multilingual Multimodal Machine Translation
    Hongcheng Guo*, Jiaheng Liu*, Haoyang Huang, Jian Yang, Zhoujun Li, Dongdong Zhang, Zheng Cui
    Empirical Methods in Natural Language Processing (EMNLP), 2022

  • 3D-Pruning: A Model Compression Framework for Efficient 3D Action Recognition
    Jinyang Guo*, Jiaheng Liu*, Dong Xu
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2022

  • GeometryMotion-Transformer: An End-to-End Framework for 3D Action Recognition
    Jiaheng Liu, Jinyang Guo, Dong Xu
    IEEE Transactions on Multimedia (TMM), 2022

  • OneFace: One Threshold for All
    Jiaheng Liu, Zhipeng Yu, Haoyu Qin, Yichao Wu, Ding Liang, Gangming Zhao, Ke Xu
    European Conference on Computer Vision (ECCV), 2022

  • CoupleFace: Relation Matters for Face Recognition Distillation
    Jiaheng Liu, Haoyu Qin, Yichao Wu, Jinyang Guo, Ding Liang, Ke Xu
    European Conference on Computer Vision (ECCV), 2022

  • APSNet: Towards Adaptive Point Sampling for Efficient 3D Action Recognition
    Jiaheng Liu*, Jinyang Guo*, Dong Xu
    IEEE Transactions on Image Processing (TIP), 2022

  • Deep 3D Vessel Segmentation based on Cross Transformer Network
    Chengwei Pan, Baolian Qi, Gangming Zhao, Jiaheng Liu, Chaowei Fang, Dingwen Zhang, Jinpeng Li
    International Conference on Bioinformatics and Biomedicine (BIBM), 2022

  • Computer-aided Tuberculosis Diagnosis with Attribute Reasoning Assistance
    Chengwei Pan, Gangming Zhao, Junjie Fang, Baolian Qi, Jiaheng Liu, Chaowei Fang, Dingwen Zhang, Jinpeng Li, Yizhou Yu
    Medical Image Computing and Computer Assisted Interventions (MICCAI), 2022

  • Cross-Lingual Cross-Modal Consolidation for Effective Multilingual Video Corpus Moment Retrieval
    Jiaheng Liu, Tan Yu, Hanyu Peng, Mingming Sun, Ping Li
    Findings of North American Chapter of the Association for Computational Linguistics (NAACL), 2022

  • AnchorFace: Boosting TAR@FAR for Practical Face Recognition
    Jiaheng Liu, Haoyu Qin, Yichao Wu, Ding Liang
    AAAI Conference on Artificial Intelligence (AAAI), 2022

  • DAM: Discrepancy Alignment Metric for Face Recognition
    Jiaheng Liu, Yudong Wu, Yichao Wu, Chuming Li, Xiaolin Hu, Ding Liang, Mengyu Wang
    IEEE International Conference on Computer Vision (ICCV), 2021

  • JointPruning: Pruning Networks along Multiple Dimensions for Efficient Point Cloud Processing
    Jinyang Guo, Jiaheng Liu, Dong Xu
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2021

  • GeometryMotion-Net: A Strong Two-stream Baseline for 3D Action Recognition
    Jiaheng Liu, Dong Xu
    IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2021

  • Block Proposal Neural Architecture Search
    Jiaheng Liu, Shunfeng Zhou, Yichao Wu, Ken Chen, Wanli Ouyang, Dong Xu
    IEEE Transactions on Image Processing (TIP), 2020

  • Learning to Auto Weight: Entirely Data-driven and Highly Efficient Weighting Framework
    Zhenmao Li, Yichao Wu, Ken Chen, Yudong Wu, Shunfeng Zhou, Jiaheng Liu, Junjie Yan
    AAAI Conference on Artificial Intelligence (AAAI), 2020

  • Correlation Congruence for Knowledge Distillation
    Baoyun Peng, Xiao Jin, Jiaheng Liu, Shunfeng Zhou, Yichao Wu, Yu Liu, Dongsheng Li, Zhaoning Zhang
    IEEE International Conference on Computer Vision (ICCV), 2019

  • Knowledge Distillation via Route Constrained Optimization
    Xiao Jin, Baoyun Peng, Yichao Wu, Yu Liu, Jiaheng Liu, Ding Liang, Junjie Yan, Xiaolin Hu
    IEEE International Conference on Computer Vision (ICCV), 2019

  • To be updated