Daoyuan Chen

Hi there! I am currently a staff at Data Analytics and Intelligence Lab, Alibaba Tongyi. My interest largely lies in the research, systems, and their practical applications related to efficient Machine Learning, Federated Learning (FL), Large Language Models (LLMs) and Multi-modal Learning.

I’ve published over 30 technical papers, a number of which I’ve led as the first author and were presented at top-tier conferences such as ICML, NeurIPS, ICLR, KDD, SIGMOD, ACL and SIGIR. In addition to this, I’m glad to have the opportunity to be founding/core contributor for several open-source projects, such as Data-Juicer & DJ-SORA (one-stop multi-modal data processing for LLMs), FS-Real (an enhanced system enables scalable cross-device FL on phones and cars), pFL-Bench (a comprehensive benchmark for personalized FL), FederatedScope (an easy-to-use FL platform) and AgentScope (a multi-agent LLM platform).

Contact: daoyuanchen.cdy AT alibaba-inc.com; chendaoyuan AT pku.edu.cn

Working Experiences

  • 2023 - Now, Data Analytics and Intelligence Lab, Alibaba Tongyi
  • July 2019 - 2023, Data Analytics and Intelligence Lab, Alibaba DAMO Academy
  • Research Intern, March 2018 - June 2018, Tencent Medical AI Lab
  • Research Assistant, October 2016 - August 2017, Multimedia Software Engineering Research Center @ City University of Hong Kong

Professional Activities

  • Conference PC/Reviewer: NeurIPS, ICML, ICLR, KDD, ACL, CVPR, EMNLP, NAACL, ICCV, ECCV, IJCAI, CIKM, COLM
  • Journal PC/Reviewer: Expert Systems with Applications, Neurocomputing, Neural Networks, Patterns, IEEE Transactions on Big Data, Artificial Intelligence In Medicine
  • Tutorial Organizer: KDD 2022, KDD 2024
  • Competition Organizer: data leaderboards for LLMs including FT-Data Ranker, BetterMixture and ModelScope-Sora

Education

  • M.S., 2016 - 2019, Computer Application Technology, Peking University. (Supervised by Kai Lei & Ying Shen).
  • B.E., 2012 - 2016, Computer Science and Technology, University of Electronic Science and Technology of China.

Awards

  • KDD Cup, AutoML-Graph Track, 4/149, 2020 (our solution)
  • Excellent Graduates, Peking University, 2019
  • COLING Best Paper Nominations, 2018
  • ACM SIGIR Student Travel Grant, 2018
  • Excellent Graduates, University of Electronic Science and Technology of China, 2016

Selected Works [ Google Scholar | DBLP ]

(# indicates equal contribution to the first author; ^ indicates industrial mentor to the first student author.)

LLM (data, privacy-preserving fine-tuning, systems)

Federated Learning (on-device, personalization, systems)

Efficient Machine Learning (adaptiveness, dynamics, applications)

Misc.

Creativity is intelligence having fun.

When it comes to leisure, I enjoy basketball, photography, playing the guitar, and listening to music - R&B and hip-hop being my genre of choice.