🎯 I am actively looking for students to join my research group @ UCLA CS. Solid coding skills and experiences in large language models pre-/post-training, agentic systems, program analysis and verification, or software security are strongly preferred.
🤝 If you are interested in working with me, drop me an email with (1) your CV and (2) a brief introduction of your research interests and background.
|
|
SWE-Spot: Building Small Repo-Experts with Repository-Centric Learning
Jinjun Peng, Magnus Saebo, Tianjun Zhong, Yi-Jie Cheng, Junfeng Yang, Baishakhi Ray, Simin Chen, Yangruibo Ding
Preprint
|
|
Position: To Defend Against Cyber Attacks, We Must Teach AI Agents to Hack
Terry Yue Zhuo, Yangruibo Ding, Wenbo Guo, Ruijie Meng
ICML 2026
|
|
OpenSage: Self-programming Agent Generation Engine
Hongwei Li, Zhun Wang, Qinrun Dai, Yuzhou Nie, Jinjun Peng, Ruitong Liu, Jingyang Zhang, Kaijie Zhu, Jingxuan He, Lun Wang, Yangruibo Ding, Yueqi Chen, Wenbo Guo, Dawn Song
ICML 2026
|
|
DevOps-Gym: Benchmarking AI Agents in Software DevOps Cycle
Yuheng Tang, Kaijie Zhu, Bonan Ruan, Chuqi Zhang, Michael Yang, Hongwei Li, Suyue Guo, Tianneng Shi, Zekun Li, Christopher Kruegel, Giovanni Vigna, Dawn Song, William Yang Wang, Lun Wang, Yangruibo Ding, Zhenkai Liang, Wenbo Guo
ICLR 2026
|
|
Co-PatcheR: Collaborative Software Patching with Component(s)-specific Small Reasoning Models
Yuheng Tang, Hongwei Li, Kaijie Zhu, Michael Yang, Yangruibo Ding, Wenbo Guo
NeurIPS 2025
|
|
Vulnerability Detection with Code Language Models: How Far Are We?
Yangruibo Ding, Yanjun Fu, Omniyyah Ibrahim, Chawin Sitawarin, Xinyun Chen, Basel Alomair,
David Wagner, Baishakhi Ray, Yizheng Chen
ICSE 2025
Adopted by Gemini-1.5 and Qwen3-Coder-Next
|
|
SemCoder: Training Code Language Models with Comprehensive Semantics Reasoning
Yangruibo Ding,
Jinjun Peng, Marcus J. Min, Gail Kaiser, Junfeng Yang, Baishakhi Ray
NeurIPS 2024
|
|
TRACED: Execution-aware Pre-training for Source Code
Yangruibo Ding,
Ben Steenhoek, Kexin Pei, Gail Kaiser, Wei Le, Baishakhi Ray
ICSE 2024
|
|
CrossCodeEval: A Diverse and Multilingual Benchmark for Cross-File Code Completion
Yangruibo Ding*,
Zijian Wang*, Wasi Uddin Ahmad*, Hantian Ding, Ming Tan, Nihal Jain, Murali Krishna Ramanathan, Ramesh Nallapati, Parminder Bhatia, Dan Roth, Bing Xiang (* equal contribution)
NeurIPS 2023 (Datasets & Benchmarks)
Adopted by DeepSeek-Coder, Qwen2.5-Coder, StarCoder, and Augment Code
|
Honors and Awards
-
IBM Ph.D. Fellowship Award. 2022-2024
-
ACM SIGSOFT Distinguished Paper Award. 2023
-
IEEE TSE Best Paper Award Runner-up. 2022
-
Ph.D. Service Award, Columbia CS. 2025
-
NSF Travel Award. 2022, 2023
-
ACM SIGSOFT CAPS Travel Grant. 2023
|
Recent Talks
-
Feb. - Apr. 2025: "From Code Generation Towards Software Engineering: Advancing Code Intelligence w/ Language Models" @ UW, UMD, CMU, UCLA, UTD, JHU, Georgia Tech, Stony Brook, Dartmouth, NUS.
-
Oct. 2024: "Training Code Language Models with Comprehensive Semantics Reasoning" @ UIUC.
-
Oct. 2024: "Semantic-aware Source Code Modeling" @ UMD, NCSU, ASE'24.
-
Aug. 2024: "Training Code Language Models with Comprehensive Semantics Reasoning" @ Google DeepMind.
-
Apr. 2024: "Vulnerability Detection with Code Language Models: How Far Are We?" @ Columbia SSS Seminar.
|
Academic Services
Chair / Co-Chair
Program Committee
Conference Reviewer
Journal Reviewer
|
Last Updated: Jan 2026.
Photo by Lingyi. Website Template by Jon Barron
|
|