Yang Chen

I am a Research Scientist at NVIDIA (ADLR Group). I received my Ph.D. from Georgia Tech in August 2024.

I currently work on scaling reinforcement learning for reasoning LLMs.

Google Scholar CV

Selected Projects

Recent work

Nemotron-Cascade December 2025

We scale Cascade RL to train general-purpose reasoning LLMs spanning RLHF, instruction following, math, code, and SWE. Our 14B Thinking model achieves SOTA on LiveCodeBench—outperforming Gemini-2.5-Pro, o4-mini, Qwen3-235B, DeepSeek-R1-671B, and reaches silver-medal performance at IOI 2025.

Models Training Data Technical Report

AceReason-Nemotron 1.1 June 2025

We further scaled the SFT and studied the interplay with RL, and released the SOTA 7B model, AceReason-Nemotron-1.1-7B.

Models Training Data ICLR'26

AceReason-Nemotron May 2025

We scaled the previous work to both math and code domains and released the SOTA medium-sized model, AceReason-Nemotron-14B.

Models Training Data NeurIPS'25

AceMath-RL-Nemotron April 2025

Our pilot study on scaling RL for competitive math reasoning LLMs. We released the AceMath-RL-Nemotron-7B model.

Model Research Blog

AceMath December 2024

We worked on developing math reasoning and reward models, and released AceMath-1.5B, 7B, and 72B, along with AceMath-RM-7B and 72B, which surpassed GPT-4o and Qwen2.5-Math at the time.

Models & Datasets Findings of ACL'25

Past Work

Research · 2022–2024

Multimodal LLM

Building AI that understands the visual world

Click to expand

Visual World Knowledge [EMNLP'23, ICCV'23 (Oral)]
Emergent Visual Privacy Concern [EMNLP'24 (Oral)]
Universal Multimodal Retriever [ECCV'24 (Oral)]

Research · 2019–2024

Multilingual LLM

Bridging representation across global languages

Click to expand

Model Selection [EMNLP'21]
Cross-lingual [ACL'23 Findings, ICLR'24, ACL'25]
English–Arabic BERT [EMNLP'20]