terryyz.github.io

I am an IBM PhD Fellow (2024 - 2025), and a PhD student at CSIRO's Data61 and Monash University.

My research focuses on Code Intelligence + X (e.g., agentic workflow, system efficiency, cybersecurity).

I will be organizing the 1st tutorial on "NLP+Code: Code Intelligence in Language Models" at EMNLP'25 in Suzhou, China. Please stay tuned for the update and consider attending the conference!

I will serve as a Senior Area Chair for EMNLP 2025. Looking forward to your high-quality submissions!

I am open to collaboration on code intelligence. Feel free to contact me if you are interested in working together.
Schedule the meeting with me.

My Working Principles

I typically work at least 10 hours a day, unless I decide to take a break.
Choose wisely and commit wholeheartedly.
Paper is cheap; they will be published eventually.
I prioritize long-term research impact over quick wins in publications.
Any upcoming research I lead will require significant effort and can often be split into multiple papers (though I choose not to).
I am a generalist, always open to new opportunities, depending on my bandwidth.

Why Code Intelligence?

Software is the backbone of all advanced technology, making code the bedrock of innovation. [1, 2, 3]
Code's structured and executable nature allows for precise control and automation. [1, 4]
Code is the most potent tool for AI to interact with and manipulate the world. [1]
Code intelligence can dramatically enhance problem-solving capabilities across various domains. [1]
Advancements in code intelligence lead to more efficient, secure, and scalable solutions. [5, 6, 7]

Fun Facts

I speak Shanghainese, Mandarin, and English.
I started my research in Quantum Computing, not AI.
During my undergraduate years, I explored a wide range of NLP topics, including but not limited to: Commonsense Reasoning, Multimodal Learning, Machine Translation, and Semantic Parsing.
In a previous life, I was a semiprofessional swimmer, training for 10 years.
I'm an amateur pianist, playing for 8 years.
Before high school, I had traveled to most European countries.

Recent Publications

BigCode Team, “BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions ”, ICLR 2025. (Among Top 5 Highest-Rated Papers! 🌟 Before→After Rebuttal: 6→8, 8→8, 8→10, 10→10) [Github]

BigCode Team, “Parameter-Efficient Instruction Tuning Code Large Language Models: An Empirical Study”, DL4Code workshop @ ICLR, 2025.

BigCode Team, “OctoPack: Instruction Tuning Code Large Language Models”, International Conference on Learning Representations (ICLR), 2024. [Github]

Terry Yue Zhuo, “ICE-Score: Instructing Large Language Models to Evaluate Code”, Findings of European Chapter of the Association for Computational Linguistics (EACL), 2024. [Github]

BigCode Team, “StarCoder: May The Source Be With You!”, Transactions on Machine Learning Research (TMLR), 2023. [Tweet 1] [Tweet 2] [TechCrunch] [StarCoderBase] [StarCoder] [Github]

BigCode Team, “SantaCoder: don't reach for the stars!”, Deep Learning For Code workshop (DL4C) @ ICLR, 2023. [Best Paper Award] [Tweet 1] [Tweet 2] [SantaCoder]

卓 ㄓㄨㄛˊ越ㄩㄝˋ ?

Recent Publications

卓ㄓㄨㄛˊ越ㄩㄝˋ