Gerald Shen
RL Research @ NVIDIA

I am a Research Scientist working on reinforcement learning at NVIDIA. I completed my HBSc at the University of Toronto studying computer science, evolutionary and human biology. During my undergraduate studies, I was fortunate to work with Marzyeh Ghassemi and Sheldon Huang on unsupervised out-of-distribution detection.
My research interests revolve around RL for LLMs, ranging from systems to algorithms. I am involved in post-training of Nemotron models and was the core developer of NeMo-Aligner.
Research
-
In First Conference on Language Modeling 2024
-
In Advances in Neural Information Processing Systems 2024
-
In The Exploration in AI Today Workshop at ICML 2025 2025