I am a Flatiron Research Fellow, at Flatiron Institute, Center for Computational Mathematics. My general research interest lies in Bayesian computation, Bayesian modeling, machine learning, and causal inference.
Before that, I obtained my PhD in Statistics from Columbia University, where I was advised by Andrew Gelman. My thesis includes a collection of methodology developments Toward a Scalable Bayesian Workflow. Before that, I obtained my undergraduate education from Tsinghua University in Mathematics and in Economics.
• My ultimate goal is to develop a scalable Bayesian workflow for open-ended real data problems. Past applications include the lead fallout in Pairs, arsenic diffusion in groundwater, and Covid-19 mortality in Bangladesh.
• But to do better applied statistics needs better methodology development. To that end, I investigate statistical and machine learning methods, with a focuse on model evaluation and aggregation, meta-learning and causal inference. Some ongoing progresses are on cross-validation, stacking, and covariate imbalance.
• But to facilitate complex methods further needs scalable and diagnosable computing. Hence, I develop algorithms and theoreis for fully Bayesian and approximate computations. Recently at Flatiron, I am interested in combining Monte Carlo methods with sophisticated numerical tricks or quadratures, from which the applications include importance sampling, tempeture in Bayeisan infererce, simulated tempering and annealing, free energy estimation, and metastability in MCMC.
A scalable Bayesian workflow needs the combination of fast but reliable computing, efficient but targeted model evaluation, and extensive but directed model building and expansion.
With the input-varying yet partially-pooled model weights, hierarchical stacking improves average and conditional predictions. Our Bayesian formulation includes constant-weight (complete-pooling) stacking as a special case.
Yuling Yao, Collin Cademartori, Aki Vehtari, Andrew Gelman.  Adaptive Path Sampling in Metastable Posterior Distributions. under review.
[preprint] [Package] [Blog]
From importance sampling to adaptive importance sampling to path sampling to adaptive path sampling, and from Rao–Blackwell to Wang-Landau to Jarzynski-Crook: all about free energy and simulated tempering
Yuling Yao, Aki Vehtari, Andrew Gelman.  Stacking for Non-mixing Bayesian Computations: The Curse and Blessing of
Multimodal Posteriors. under review.
[preprint] [Code] [Blog]
The result from multi-chain stacking is not necessarily equivalent, even asymptotically, to fully Bayesian inference, but it serves many of the same goals. Under misspecified models, stacking can give better predictive performance than full Bayesian inference, hence the multimodality can be considered a blessing rather than a curse.
This does not mean that we think Bayesian inference is a bad idea, but it does mean that there is a tension between Bayesian logic and Bayesian workflow which we believe can only be resolved by considering Bayesian logic as a tool, a way of revealing inevitable misfits and incoherences in our model assumptions, rather than as an end in itself.
How to run importance sampling with effieiciency and reassurance
[Online] [Blog] [Code]
The Pareto-smoothed importance sampling diagnostic gives a goodness of fit measurement for joint variational approximtion, while simultaneously improving the error in the estimate.
[Online] [Code] [R package]
"Remember that using Bayes' Theorem doesn't make you a Bayesian. Quantifying uncertainty with probability makes you a Bayesian."
Prabhat Barnwal, Yuling Yao (equal contribution), Yiqian Wang, Nishat Akter Juy, Shabib Raihan,
Mohammad Ashraful Haque, Alexander van Geen. 
No excess mortality detected in rural Bangladesh in 2020 from repeated surveys of a population of 81000. preprint.
Yuling Yao, Rajib Mozumder, Benjamin Bostick, Brian Mailloux, Charles Harvey, Andrew Gelman, Alexander van Geen. 
Making the most of imprecise measurements: Changing patterns of arsenic concentrations in shallow wells of Bangladesh from laboratory and field data. preprint.
Imprecise but widely-accessible field kit tests in companion with flexible statistical modeling that facilitates this open-ended data gathering can provide a balance between total cost and accuracy in many areas of geoscience research and policy.
Andrew Gelman, Aki Vehtari, Daniel Simpson, Charles Margossian, Bob Carpenter, Yuling Yao, Paul-Christian Bürkner, Lauren Kennedy, Jonah Gabry, Martin Modrák. 
Bayesian workflow. preprint.
Theoretical statistics indeed is the theory of applied statistics.
Alexander van Geen, Yuling Yao, Tyler Ellis, Andrew Gelman.  Fallout of Lead over Paris from the 2019 Notre-Dame Cathedral Fire. Geohealth .
[Online] [Code] [Media coverage (Le Monde)] [Media coverage 2]
How much lead was there after the fire?
running BNN on ImageNet: more expressive than MC-Dropout, more affordable than meanfield VI
Maarten Marsman, Felix D Schönbrodt, Richard D Morey, Yuling Yao, Andrew Gelman, Eric-Jan Wagenmakers  A Bayesian bird's eye view of ‘Replications of important results in social psychology’. Royal Society Open Science,4,160426.