About me

I am currently a third-year Ph.D. candidate at Tsinghua University, supervised by Prof. Xiu Li. I was fortunate to be a visiting student at MMLAB@NTU, where I worked with Prof. Ziwei Liu on the intersection of MLLM and human motion. Earlier in my Ph.D. journey, I had the opportunity to explore digital human modeling under the kind guidance of Prof. Yebin Liu. My research interests lie in human motion and interaction synthesis, motion modeling, generative models, and embodied AI.


🆕News

  • [Jun 2025] Recognized as outstanding reviewer at CVPR 2025!
  • [May 2025] Our team was awarded a Gold Medal at the International Exhibition of Inventions Geneva!
  • [Feb 2025] One paper about Text-to-Motion Generation with GPT-4Vision Reward is accepted by CVPR 2025!
  • [Dec 2024] One paper about group dance generation accepted by AAAI!
  • [Sep 2024] One paper about speech driven motion generation accepted by NeurIPS!
  • [Feb 2024] One first-author paper about long dance generation accepted by CVPR 2024!
  • [Dec 2023] One first-author paper about controllable dance generation accepted by ICASSP 2024!
  • [Dec 2023] Two papers about 3D object grounding and gestures generation are accepted by AAAI 2024!
  • [Jul 2023] One first-author paper about music driven dance generation accepted by ICCV 2023!
  • </ul>

    [Please visit my google scholar profile for the full publication list.]

    ✨Selected Publications

    Lodge++: High-quality and Long Dance Generation with Vivid Choreography Patterns
    Ronghui Li, Hongwen Zhang, Yachao Zhang, Yuxiang Zhang, Youliang Zhang, Jie Guo, Yan Zhang, Xiu Li, Yebin Liu.
    arxiv
    [Project][Paper][Code]
    Lodge: A Coarse to Fine Diffusion Network for Long Dance Generation Guided by the Characteristic Dance Primitives
    Ronghui Li, Yuxiang Zhang, Yachao Zhang, Yachao Zhang, Hongwen Zhang, Jie Guo, Yan Zhang, Yebin Liu, Xiu Li.
    IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR 2024)
    [Project][Paper][Code]
    FineDance: A Fine-grained Choreography Dataset for 3D Full Body Dance Generation
    Ronghui Li, Junfan Zhao, Yachao Zhang, Mingyang Su, Zeping Ren, Han Zhang, Yansong Tang, Xiu Li.
    IEEE/CVF Conference on International Conference on Computer Vision (ICCV 2023)
    [Project][Paper][Code]
    Harmonious Group Choreography with Trajectory-Controllable Diffusion
    Yuqin Dai, Wanlu Zhu, Ronghui Li, Zeping Ren, Xiangzheng Zhou, Xiu Li, Jun Li1, Jian Yang.
    Association for the Advance of Artificial Intelligence (AAAI 2025)
    [Project][Paper][Code Comming]
    MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models
    Zunnan Xu, Yukang Lin, Haonan Han, Sicheng Yang, Ronghui Li, Yachao Zhang, Xiu Li.
    Annual Conference on Neural Information Processing Systems (NeurIPS 2024)
    [Project][Paper]
    Exploring Multi-Modal Control in Music-Driven Dance Generation
    Ronghui Li, Yuqin Dai, Yachao Zhang, Jun Li, Jian Yang, Jie Guo, Xiu Li.
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024)
    [Paper]
    Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute
    Chaoqun Gong, Yuqin Dai, Ronghui Li, Achun Bao, Jun Li, Jian Yang, Yachao Zhang, Xiu Li.
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2024)
    [Paper]
    Cross-Modal Match for Language Conditioned 3D Object Grounding
    Yachao Zhang, Runze Hu, Ronghui Li, Yanyun Qu, Yuan Xie, Xiu Li.
    Association for the Advance of Artificial Intelligence (AAAI 2024)
    [Paper]
    Chain of Generation: Multi-Modal Gesture Synthesis via Cascaded Conditional Control
    Zunnan Xu,Yachao Zhang,Sicheng Yang,Ronghui Li,Xiu Li.
    Association for the Advance of Artificial Intelligence (AAAI 2024)
    [Paper]