
On the 2025–26 academic job market, seeking tenure-track positions beginning Fall 2026.
CV Research Statement
I work on AI for society through the science of valid measurement and prediction of AI capabilities and risks, and the development of methods to ensure their reliability under real-world conditions. This enables us to anticipate failures before deployment and ensure reliable behavior under real-world distribution shifts, with translational impact in domains such as healthcare.
Olawale (Wale) Salaudeen is an AI Center Fellow in Residence at Schmidt Sciences, a Postdoctoral Researcher at MIT (Healthy ML Lab, led by Prof. Marzyeh Ghassemi), and a Postdoctoral Scholar at the Eric and Wendy Schmidt Center at the Broad Institute of MIT and Harvard. Before his postdoctoral positions, he earned a Ph.D. in Computer Science from the University of Illinois at Urbana-Champaign and the Stanford Trustworthy AI Research (STAIR) Lab at Stanford University, advised by Prof. Sanmi Koyejo.
He works on AI for society through the science of valid measurement and prediction of AI capabilities and risks, and the development of methods to ensure their reliability under real-world conditions. This enables us to anticipate failures before deployment and ensure reliable behavior under real-world distribution shifts, with translational impact in domains such as healthcare.
He has received a Sloan Scholarship, a Beckman Graduate Research Fellowship, a GEM Associate Fellowship, and an NSF Miniature Brain Machinery Traineeship. He was recognized with a Best Paper Award at the NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle, and was named a Top Area Chair at NeurIPS 2025. He has interned at Sandia National Laboratories (w/ Dr. Eric Goodman), Google Brain (w/ Dr. Alex D’Amour), Cruise LLC, and the Max Planck Institute for Intelligent Systems (w/ Dr. Moritz Hardt).
He received a Bachelor of Science in Mechanical Engineering with minors in Computer Science and Mathematics from Texas A&M University.
Olawale (Wale) Salaudeen is an AI Center Postdoctoral Fellow in Residence at Schmidt Sciences, a Postdoctoral Researcher at MIT in the Healthy ML Lab led by Prof. Marzyeh Ghassemi, and a Postdoctoral Scholar at the Eric and Wendy Schmidt Center at the Broad Institute of MIT and Harvard. He works on AI for society through the science of valid measurement and prediction of AI capabilities and risks, and the development of methods to ensure their reliability under real-world conditions. This enables us to anticipate failures before deployment and ensure reliable behavior under real-world distribution shifts, with translational impact in domains such as healthcare. He completed his Ph.D. in Computer Science at the University of Illinois at Urbana-Champaign and the Stanford STAIR Lab, advised by Prof. Sanmi Koyejo. His work has been supported by several honors, including the Sloan Scholarship, Beckman Graduate Research Fellowship, and GEM Associate Fellowship. He has interned at Google Brain, Cruise, and the Max Planck Institute for Intelligent Systems, and holds a B.S. in Mechanical Engineering from Texas A&M University.
Olawale (Wale) Salaudeen is an AI Center Postdoctoral Fellow in Residence at Schmidt Sciences, a Postdoctoral Researcher at MIT and the Broad Institute of MIT and Harvard. He works on AI for society through the measurement and control of AI capabilities and risks. He earned his Ph.D. in Computer Science from the University of Illinois at Urbana-Champaign and the Stanford STAIR Lab.
Selected Honors
Schmidt Sciences AI Center Fellowship • Best Paper, NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle • NeurIPS 2025 Top Area Chair • NYU Tandon Faculty First-Look Fellow • Georgia Tech FOCUS Fellow • Sloan Scholarship • Beckman Graduate Research Fellowship • GEM Associate Fellowship • NSF Miniature Brain Machinery Traineeship • ICML 2022 Top Reviewer (10%)
Selected Research Directions
My research develops the science of understanding, measuring, and improving the reliability of AI systems under real-world change. I work across three interconnected themes:
Valid Measurement and Prediction of AI Capabilities and Risks
AI systems exhibit jagged intelligence—they excel at some tasks but fail at others that share a common human capability. I develop measurements of AI-specific latent traits, capabilities, and risks to enable less jagged, more predictable behaviors across real-world settings.
- AI Construct Lexis
- Measurement to Meaning: A Validity-Centered Framework for AI Evaluation. In Review (preliminary in NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle)
- Toward an Evaluation Science for Generative AI Systems. The Bridge, NAE 2025
- ImageNot: A Contrast with ImageNet Preserves Model Rankings. TMLR 2026 (To appear)
- On Evaluating Methods vs. Evaluating Models. NeurIPS 2025 Workshop on Evaluating the Evolving LLM Lifecycle (Best Paper)
Characterizing and Intervening on Causal and Spurious Mechanisms of AI Behavior
AI systems often rely on spurious correlations, performing well in training environments but failing when conditions shift. My work identifies and disentangles causal from spurious mechanisms so models rely on stable, causal patterns instead.
- Aggregation Hides OOD Generalization Failures from Spurious Correlations. NeurIPS 2025 (Spotlight)
- Are Domain Generalization Benchmarks with Accuracy on the Line Misspecified? TMLR 2025 (J2C) + ICLR 2026 Journal Track
- Causally Inspired Regularization Enables Domain General Representations. AISTATS 2024
- On Domain Generalization Datasets as Proxy Benchmarks for Causal Representation Learning. NeurIPS 2024 CRL Workshop (Oral)
Inference-Time Adaptation of AI Systems
AI behaviors often become unreliable when deployment conditions change. I develop methods that use proxy variables and concept-based representations to adapt model behavior at inference time, without retraining.
Selected Recent News
Older News
Fun Facts
I was born in Nigeria and moved to Dallas, Texas at a young age. I played basketball and water polo in high school. I'm a loyal (if perpetually disappointed) Dallas sports fan, a cinephile, a social dancer (Latin and swing mostly), and a regular at live standup comedy. I'm full of hot takes, most of which I wouldn't die on, but am always eager to share and defend for fun. I also once won an intramural cornhole championship.