BigCode Team, “BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions ”, ICLR 2025. (Among Top 5 Highest-Rated Papers! 🌟 Before→After Rebuttal: 6→8, 8→8, 8→10, 10→10) [Github]
I am an IBM PhD Fellow (2024 - 2025) and a PhD student at CSIRO's Data61 and Monash University.
I currently lead the effort of Computer Intelligence Project, a core project of AI Alliance,
with various entities (e.g., ServiceNow Research, Hugging Face, and IBM Research).
My research focuses on computer-level code intelligence, particularly function calling, code generation, and agentic workflow.
I also do some research in software engineering (e.g., vulnerability detection, and mobile application).
The long-term goal is to build AGI with code/executable languages.
I will be organizing the 1st tutorial on "NLP+Code: Code Intelligence in Language Models" at EMNLP'25 in Suzhou, China. Please stay tuned for the update and consider attending the conference!
Check out our latest work on Open Evaluation Platform for AI Coding and Automated Software Developement -- SWE Arena.
SWE Arena can execute and render any code in real-time! The platform is 100% free to use and soon be open-sourced. Happy coding and voting!
I am open to collaboration. Feel free to contact me if you are interested in working together.
Schedule the meeting with me.
My Working Principles
Why Code Intelligence?
Fun Facts
Recent Publications
BigCode Team, “OctoPack: Instruction Tuning Code Large Language Models”, International Conference on Learning Representations (ICLR), 2024. [Github]
Terry Yue Zhuo, “ICE-Score: Instructing Large Language Models to Evaluate Code”, Findings of European Chapter of the Association for Computational Linguistics (EACL), 2024. [Github]
BigCode Team, “Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models”, Manuscript, 2024.
BigCode Team, “StarCoder: May The Source Be With You!”, Transactions on Machine Learning Research (TMLR), 2023. [Tweet 1] [Tweet 2] [TechCrunch] [StarCoderBase] [StarCoder] [Github]
BigCode Team, “SantaCoder: don't reach for the stars!”, Deep Learning For Code workshop (DL4C) @ ICLR, 2023. [Best Paper Award] [Tweet 1] [Tweet 2] [SantaCoder]