15 References
References
Abadie, Alberto, Alexis Diamond, and Jens Hainmueller. 2010.
“Synthetic Control Methods for Comparative Case Studies:
Estimating the Effect of California’s Tobacco Control Program.”
Journal of the American Statistical Association 105 (490):
493–505.
Abadie, Alberto, and Javier Gardeazabal. 2003. “The Economic Costs
of Conflict: A Case Study of the Basque Country.” American
Economic Review 93 (1): 113–32.
Adèr, Hermanus Johannes. 2008. Advising on Research Methods: A
Consultant’s Companion. Johannes van Kessel Publishing.
Agnihotri, Apoorv, and Nipun Batra. 2020. “Exploring Bayesian
Optimization.” Distill 5 (5): e26.
Albert, Christopher G, and Katharina Rath. 2020. “Gaussian Process
Regression for Data Fulfilling Linear Differential Equations with
Localized Sources.” Entropy 22 (2): 152.
Albert, James H, and Siddhartha Chib. 1993. “Bayesian Analysis of
Binary and Polychotomous Response Data.” Journal of the
American Statistical Association 88 (422): 669–79.
Albert, Jim et al. 2009. Bayesian Computation with r. Vol. 2.
Springer.
Albert, Jim. 2022. A Course in Exploratory Data Analysis. https://bayesball.github.io/EDA/.
Albert, Jim, and Jingchen Hu. 2019. Probability and Bayesian
Modeling. Chapman; Hall/CRC.
Alcantara, Rafael, P Richard Hahn, Carlos Carvalho, and Hedibert Lopes.
2025. “Learning Conditional Average Treatment Effects in
Regression Discontinuity Designs Using Bayesian Additive Regression
Trees.” arXiv Preprint arXiv:2503.00326.
Alcantara, Rafael, Meijia Wang, P Richard Hahn, and Hedibert Lopes.
2024. “Modified BART for Learning Heterogeneous Effects in
Regression Discontinuity Designs.” arXiv Preprint
arXiv:2407.14365.
Angelopoulos, Anastasios N, and Stephen Bates. 2021. “A Gentle
Introduction to Conformal Prediction and Distribution-Free Uncertainty
Quantification.” arXiv Preprint arXiv:2107.07511.
Banks, David L, and Mevin B Hooten. 2021. “Statistical Challenges
in Agent-Based Modeling.” The American Statistician 75
(3): 235–42.
Barrett, Malcolm, Lucy McGowan D’Agostino, and Travis Gerke. 2025.
Causal Inference in r. Bookdown.
Baumer, Benjamin S, Daniel T Kaplan, and Nicholas J Horton. 2017.
Modern Data Science with r. Chapman; Hall/CRC.
Bengio, Yoshua, and Yves Grandvalet. 2003. “No Unbiased Estimator
of the Variance of k-Fold Cross-Validation.” Advances in
Neural Information Processing Systems 16.
Ben-Michael, Eli, David Arbour, Avi Feller, Alexander Franks, and Steven
Raphael. 2023. “Estimating the Effects of a California Gun Control
Program with Multitask Gaussian Processes.” The Annals of
Applied Statistics 17 (2): 985–1016.
Berisha, Visar, Chelsea Krantsevich, P Richard Hahn, Shira Hahn, Gautam
Dasarathy, Pavan Turaga, and Julie Liss. 2021. “Digital Medicine
and the Curse of Dimensionality.” NPJ Digital Medicine 4
(1): 153.
Besginow, Andreas, and Markus Lange-Hegermann. 2022. “Constraining
Gaussian Processes to Systems of Linear Ordinary Differential
Equations.” Advances in Neural Information Processing
Systems 35: 29386–99.
Binois, Mickaël, and Robert B Gramacy. 2021. “Hetgp:
Heteroskedastic Gaussian Process Modeling and Sequential Design in
r.”
Bishop, Christopher M, and Hugh Bishop. 2023. Deep Learning:
Foundations and Concepts. Springer Nature.
Blackwell, David. 1969. Basic Statistics. McGraw-Hill New York.
Bodnar, Cristian, Wessel P Bruinsma, Ana Lucic, Megan Stanley, Anna
Allen, Johannes Brandstetter, Patrick Garvan, et al. 2025. “A
Foundation Model for the Earth System.” Nature, 1–8.
Breiman, Leo. 2001. “Random Forests.” Machine
Learning 45: 5–32.
Breiman, Leo, and Jerome H Friedman. 1997. “Predicting
Multivariate Responses in Multiple Linear Regression.”
Journal of the Royal Statistical Society Series B: Statistical
Methodology 59 (1): 3–54.
Brown, Christopher. 1976. “Principal Axes and Best-Fit Planes,
with Applications.”
Brown, E Richard, Sue Holtby, Elaine Zahnd, and George B Abbott. 2005.
“Peer Reviewed: Community-Based Participatory Research in the
California Health Interview Survey.” Preventing Chronic
Disease 2 (4).
Broyden, Charles George. 1970. “The Convergence of a Class of
Double-Rank Minimization Algorithms 1. General Considerations.”
IMA Journal of Applied Mathematics 6 (1): 76–90.
Brozak, Samantha J, Sophia Peralta, Tin Phan, John D Nagy, and Yang
Kuang. 2024. “Dynamics of an LPAA Model for Tribolium Growth:
Insights into Population Chaos.” SIAM Journal on Applied
Mathematics 84 (6): 2300–2320.
Bukiet, Bruce, Elliotte Rusty Harold, and José Luis Palacios. 1997.
“A Markov Chain Approach to Baseball.” Operations
Research 45 (1): 14–23.
Butts, Kyle. n.d. “Factor Model.” https://www.kylebutts.com/blog/factor-model/.
Card, D., and A. B. Krueger. 1994. “Minimum Wages and Employment:
A Case Study of the Fast-Food Industry in New
Jersey and Pennsylvania.” American
Economic Review 84 (4): 772–93. https://www.jstor.org/stable/2118030.
Carpenter, Bob, Andrew Gelman, Matthew D Hoffman, Daniel Lee, Ben
Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li,
and Allen Riddell. 2017. “Stan: A Probabilistic Programming
Language.” Journal of Statistical Software 76: 1–32.
Carpenter, Christopher, and Carlos Dobkin. 2009. “The Effect of
Alcohol Consumption on Mortality: Regression Discontinuity Evidence from
the Minimum Drinking Age.” American Economic Journal: Applied
Economics 1 (1): 164–82.
Carriero, Alex, Kim Luijken, Anne de Hond, Karel GM Moons, Ben van
Calster, and Maarten van Smeden. 2024. “The Harms of Class
Imbalance Corrections for Machine Learning Based Prediction Models: A
Simulation Study.” arXiv Preprint arXiv:2404.19494.
Carvalho, Carlos M, Edward I George, P Richard Hahn, and Robert E
McCulloch. 2021. “Variable Selection and Interaction Detection
with Bayesian Additive Regression Trees.” In Handbook of
Bayesian Variable Selection, 395–414. Chapman; Hall/CRC.
Casella, George, and Roger Berger. 2002. Statistical Inference:
Second Edition. CRC Press.
Chari, Tara, and Lior Pachter. 2023. “The Specious Art of
Single-Cell Genomics.” PLOS Computational Biology 19
(8): e1011288.
Chase, Elizabeth C, Jeremy MG Taylor, and Philip S Boonstra. 2024.
“Modeling Basal Body Temperature Data Using Horseshoe Process
Regression.” Statistics in Medicine 43 (5): 817–32.
Chatterjee, Sourav. 2021. “A New Coefficient of
Correlation.” Journal of the American Statistical
Association 116 (536): 2009–22.
Chen, Ricky TQ, Yulia Rubanova, Jesse Bettencourt, and David K Duvenaud.
2018. “Neural Ordinary Differential Equations.”
Advances in Neural Information Processing Systems 31.
Chernick, Michael R, and Robert A Labudde. 2009. “Revisiting
Qualms about Bootstrap Confidence Intervals.” American
Journal of Mathematical and Management Sciences 29 (3-4): 437–56.
Chiles, Jean-Paul, and Pierre Delfiner. 2012. Geostatistics:
Modeling Spatial Uncertainty. Vol. 713. John Wiley & Sons.
Chipman, Hugh A and, Edward I George, and Robert E McCulloch. 2012.
“BART: Bayesian Additive Regression Trees.” Annals of
Applied Statistics 6 (1): 266–98.
Chipman, Hugh A, Edward I George, Robert E McCulloch, and Thomas S
Shively. 2022. “mBART: Multidimensional Monotone BART.”
Bayesian Analysis 17 (2): 515–44.
Chipman, Hugh, Edward I George, Robert E McCulloch, Dean P Foster, and
Robert A Stine. 2001. “The Practical Implementation of Bayesian
Model Selection.” Lecture Notes-Monograph Series,
65–134.
Chipman, Hugh, Edward George, Richard Hahn, Robert McCulloch, Matthew
Pratola, and Rodney Sparapani. 2014. “Bayesian Additive Regression
Trees, Computational Approaches.” Wiley StatsRef: Statistics
Reference Online, 1–23.
Chipman, Hugh, Pritam Ranjan, and Weiwei Wang. 2012. “Sequential
Design for Computer Experiments with a Flexible Bayesian Additive
Model.” Canadian Journal of Statistics 40 (4): 663–78.
Cinelli, Carlos, Andrew Forney, and Judea Pearl. 2021. “A Crash
Course in Good and Bad Controls.” Sociological Methods &
Research, 00491241221099552.
Cleveland, William S. 1993. Visualizing Data. Hobart press.
Clifford, P., and A. Sudbury. 1973. “A Model for Spatial
Conflict.” Biometrika 60: 581–88.
Cox, David R. 1972. “Regression Models and Life-Tables.”
Journal of the Royal Statistical Society: Series B
(Methodological) 34 (2): 187–202.
Cunningham, Scott. 2021. Causal Inference: The Mixtape. Yale
university press.
Dahl, Benjamin K, Matthew J Heaton, Richard L Warr, Jared D Fisher, and
Grant G Schultz. 2024. “Modeling Crash Risk on Roadway Networks
Using Bayesian Regression Trees.” Technometrics, no.
just-accepted: 1–17.
Dayaratna, Kevin D, and Steven J Miller. 2012. “The Pythagorean
Won-Loss Formula and Hockey: A Statistical Justification for Using the
Classic Baseball Formula as an Evaluative Tool in Hockey.”
arXiv Preprint arXiv:1208.1725.
Deshpande, Sameer K. 2024. “flexBART: Flexible Bayesian Regression
Trees with Categorical Predictors.” Journal of Computational
and Graphical Statistics, no. just-accepted: 1–18.
Deshpande, Sameer K, Ray Bai, Cecilia Balocchi, Jennifer E Starling, and
Jordan Weiss. 2020. “VCBART: Bayesian Trees for Varying
Coefficients.” arXiv Preprint arXiv:2003.06416.
Diaconis, Persi, and Sandy L Zabell. 1982. “Updating Subjective
Probability.” Journal of the American Statistical
Association 77 (380): 822–30.
Diamond, Jared. 2020. “Swing Kings: The Inside Story of Baseball’s
Home Run Revolution.” (No Title).
Doss, Hani, and Antonio Linero. 2024. “Scalable Empirical Bayes
Inference and Bayesian Sensitivity Analysis.” Statistical
Science 39 (4): 601–22.
Driscoll, Michael F. 1973. “The Reproducing Kernel Hilbert Space
Structure of the Sample Paths of a Gaussian Process.”
Zeitschrift für Wahrscheinlichkeitstheorie Und
Verwandte Gebiete 26: 309–16.
Duane, Simon, Anthony D Kennedy, Brian J Pendleton, and Duncan Roweth.
1987. “Hybrid Monte Carlo.” Physics Letters B 195
(2): 216–22.
Dukic, Vanja, Hedibert F Lopes, and Nicholas G Polson. 2012.
“Tracking Epidemics with Google Flu Trends Data and a State-Space
SEIR Model.” Journal of the American Statistical
Association 107 (500): 1410–26.
Duvenaud, David. 2014. “Automatic Model Construction with Gaussian
Processes.” PhD thesis.
Dyer, Eva L, and Konrad Kording. 2023. “Why the Simplest
Explanation Isn’t Always the Best.” Proceedings of the
National Academy of Sciences 120 (52): e2319169120.
EFRON, B. 1979. “Bootstrap Method: Another Look at the Jackknife
Method.” Annals of Statistics 7 (1): 1–26.
Egri, Gokhan, and Xinran Nicole Han. n.d. “Attention Is Kernel
Trick Reloaded.”
Elhage, Nelson, Tristan Hume, Catherine Olsson, Nicholas Schiefer, Tom
Henighan, Shauna Kravec, Zac Hatfield-Dodds, et al. 2022. “Toy
Models of Superposition.” arXiv Preprint
arXiv:2209.10652.
Elhaik, Eran. 2022. “Principal Component Analyses (PCA)-Based
Findings in Population Genetic Studies Are Highly Biased and Must Be
Reevaluated.” Scientific Reports 12 (1): 14683.
Fletcher, Roger. 1970. “A New Approach to Variable Metric
Algorithms.” The Computer Journal 13 (3): 317–22.
Fong, Edwin, Chris Holmes, and Stephen G Walker. 2023. “Martingale
Posterior Distributions.” Journal of the Royal Statistical
Society Series B: Statistical Methodology 85 (5): 1357–91.
Frisch, R. 1934. “Statistical Confluence Analysis by Means of
Complete Regression Systems (1934).” The Foundations of
Econometric Analysis, 271.
Gaffney, Jim A, Lin Yang, and Suzanne Ali. 2022. “Constraining
Model Uncertainty in Plasma Equation-of-State Models with a
Physics-Constrained Gaussian Process.” arXiv Preprint
arXiv:2207.00668.
Gamerman, Dani, and Hedibert F Lopes. 2006. Markov Chain Monte
Carlo: Stochastic Simulation for Bayesian Inference. Chapman;
Hall/CRC.
Gareth, James, Witten Daniela, Hastie Trevor, and Tibshirani Robert.
2013. An Introduction to Statistical Learning: With Applications in
r. Spinger.
Garnett, Roman. 2023. Bayesian Optimization. Cambridge
University Press.
Gelman, Andrew, John B Carlin, Aki Vehtari Stern Hal S, and Donald B
Rubin. 2013. Bayesian Data Analysis. Chapman; Hall/CRC.
Gelman, Andrew, and Cosma Rohilla Shalizi. 2013. “Philosophy and
the Practice of Bayesian Statistics.” British Journal of
Mathematical and Statistical Psychology 66 (1): 8–38.
Gelman, Andrew, Daniel Simpson, and Michael Betancourt. 2017. “The
Prior Can Often Only Be Understood in the Context of the
Likelihood.” Entropy 19 (10): 555.
George, Edward I, and Robert E McCulloch. 1993. “Variable
Selection via Gibbs Sampling.” Journal of the American
Statistical Association 88 (423): 881–89.
———. 1997. “Approaches for Bayesian Variable Selection.”
Statistica Sinica, 339–73.
Geweke, John. 1991. “Efficient Simulation from the Multivariate
Normal and Student-t Distributions Subject to Linear Constraints and the
Evaluation of Constraint Probabilities.” In Computing Science
and Statistics: Proceedings of the 23rd Symposium on the Interface,
571:578. Fairfax, Virginia: Interface Foundation of North America, Inc.
Gibson, Graham C, Nicholas G Reich, and Daniel Sheldon. 2023.
“Real-Time Mechanistic Bayesian Forecasts of COVID-19
Mortality.” The Annals of Applied Statistics 17 (3):
1801.
Gigerenzer, Gerd. 2004. “Mindless Statistics.” The
Journal of Socio-Economics 33 (5): 587–606.
Giordano, Ryan. 2025. “The Biggest Empirical Bayes Estimator in
History.” August 25, 2025. https://doi.org/10.59350/rgwda-0tv16.
Goldfarb, Donald. 1970. “A Family of Variable-Metric Methods
Derived by Variational Means.” Mathematics of
Computation 24 (109): 23–26.
Goodfellow, Ian, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David
Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2020.
“Generative Adversarial Networks.” Communications of
the ACM 63 (11): 139–44.
Goorbergh, Ruben van den, Maarten van Smeden, Dirk Timmerman, and Ben
Van Calster. 2022. “The Harm of Class Imbalance Corrections for
Risk Prediction Models: Illustration and Simulation Using Logistic
Regression.” Journal of the American Medical Informatics
Association 29 (9): 1525–34.
Görtler, Jochen, Rebecca Kehlbeck, and Oliver Deussen. 2019. “A
Visual Exploration of Gaussian Processes.” Distill 4
(4): e17.
Gramacy, Robert B. 2020. Surrogates: Gaussian Process Modeling,
Design, and Optimization for the Applied Sciences. Chapman;
Hall/CRC.
Gramacy, Robert B, and Herbert K H Lee. 2008. “Bayesian Treed
Gaussian Process Models with an Application to Computer
Modeling.” Journal of the American Statistical
Association 103 (483): 1119–30.
Green, Peter J. 1995. “Reversible Jump Markov Chain Monte Carlo
Computation and Bayesian Model Determination.”
Biometrika 82 (4): 711–32.
Grinsztajn, Léo, Edouard Oyallon, and Gaël Varoquaux. 2022. “Why
Do Tree-Based Models Still Outperform Deep Learning on Typical Tabular
Data?” Advances in Neural Information Processing Systems
35: 507–20.
Gu, Youyang. 2020. “COVID-19 Projections Using Machine
Learning.” Retrieved May 29: 2020.
Gubernatis, James E. 2005. “Marshall Rosenbluth and the Metropolis
Algorithm.” Physics of Plasmas 12 (5).
Hahn, P Richard, and Carlos M Carvalho. 2015. “Decoupling
Shrinkage and Selection in Bayesian Linear Models: A Posterior Summary
Perspective.” Journal of the American Statistical
Association 110 (509): 435–48.
Hahn, P Richard, Carlos M Carvalho, and Sayan Mukherjee. 2013.
“Partial Factor Modeling: Predictor-Dependent Shrinkage for Linear
Regression.” Journal of the American Statistical
Association 108 (503): 999–1008.
Hahn, P Richard, Indranil Goswami, and Carl F Mela. 2015. “A
Bayesian Hierarchical Model for Inferring Player Strategy Types in a
Number Guessing Game.”
Hahn, P Richard, Jingyu He, and Hedibert Lopes. 2018. “Bayesian
Factor Model Shrinkage for Linear IV Regression with Many
Instruments.” Journal of Business & Economic
Statistics 36 (2): 278–87.
Hahn, P Richard, and Andrew Herren. 2022. “Feature Selection in
Stratification Estimators of Causal Effects: Lessons from Potential
Outcomes, Causal Diagrams, and Structural Equations.” arXiv
Preprint arXiv:2209.11400.
Hahn, P Richard, Ryan Martin, and Stephen G Walker. 2018. “On
Recursive Bayesian Predictive Distributions.” Journal of the
American Statistical Association 113 (523): 1085–93.
Hahn, P Richard, Jared S Murray, and Carlos M Carvalho. 2020.
“Bayesian Regression Tree Models for Causal Inference:
Regularization, Confounding, and Heterogeneous Effects (with
Discussion).” Bayesian Analysis 15 (3): 965–1056.
Hahn, P. R., D. Puelz, J. He, and C. M. Carvalho. 2016.
“Regularization and Confounding in Linear Regression for Treatment
Effect Estimation.” Bayesian Analysis. https://doi.org/10.1214/16-BA1044.
Hallas, Laura, Ariq Hatibie, Rachelle Koch, Saptarshi Majumdar, Monika
Pyarali, Andrew Wood, and Thomas Hale. 2021. “Variation in US
States’ COVID-19 Policy Responses.” Blavatnik School of
Government, 2021–05.
Hamelryck, Thomas, and Kanti V Mardia. 2025. “Unfolding
AlphaFold’s Bayesian Roots in Probability Kinematics.” arXiv
Preprint arXiv:2505.19763.
Han, Lifeng, Changhan He, Huy Dinh, John Fricks, and Yang Kuang. 2022.
“Learning Biological Dynamics from Spatio-Temporal Data by
Gaussian Processes.” Bulletin of Mathematical Biology 84
(7): 69.
Hanck, Christoph, Martin Arnold, Alexander Gerber, and Martin Schmelzer.
2021. Introduction to Econometrics with r.
Universität Duisburg-Essen.
He, Jingyu, and P Richard Hahn. 2023a. “Stochastic Tree Ensembles
for Regularized Nonlinear Regression.” Journal of the
American Statistical Association 118 (541): 551–70.
———. 2023b. “Stochastic Tree Ensembles for Regularized Nonlinear
Regression.” Journal of the American Statistical
Association 118 (541): 551–70.
Heiss, Andrew. 2023. “How to Make Fancy Road Trip Maps with
R and OpenStreetMap.” June 1, 2023. https://doi.org/10.59350/rgwda-0tv16.
———. 2025. “How to Use a Histogram as a Legend in
{Ggplot2}.” February 19, 2025. https://doi.org/10.59350/gt0nr-wct91.
Herren, Andrew, and P Richard Hahn. 2020. “Semi-Supervised
Learning and the Question of True Versus Estimated Propensity
Scores.” arXiv Preprint arXiv:2009.06183.
———. 2022. “Statistical Aspects of Shap: Functional Anova for
Model Interpretation.” arXiv Preprint arXiv:2208.09970.
Herren, Drew, Richard Hahn, Jared Murray, Carlos Carvalho, and Jingyu
He. 2025. Stochtree: Stochastic Tree Ensembles (XBART and BART) for
Supervised Learning and Causal Inference. https://stochtree.ai/.
Hicks, Michael Townsen, James Humphries, and Joe Slater. 2024.
“ChatGPT Is Bullshit.” Ethics and Information
Technology 26 (2): 38.
Hill, Jennifer L. 2011. “Bayesian Nonparametric Modeling for
Causal Inference.” Journal of Computational and Graphical
Statistics 20 (1): 217–40.
Hoeting, Jennifer A, David Madigan, Adrian E Raftery, and Chris T
Volinsky. 1999. “Bayesian Model Averaging: A Tutorial (with
Comments by m. Clyde, David Draper and EI George, and a Rejoinder by the
Authors.” Statistical Science 14 (4): 382–417.
Hoff, Peter D. 2009. A First Course in Bayesian Statistical
Methods. Vol. 580. Springer.
Hogg, Robert V, Joseph W McKean, Allen T Craig, et al. 2013.
Introduction to Mathematical Statistics. Pearson Education
India.
Holland, Paul W. 1986. “Statistics and Causal Inference.”
Journal of the American Statistical Association 81 (396):
945–60.
Holley, R. A., and T. M. Liggett. 1975. “Ergodic Theorems for
Weakly Interacting Infinite Systems and the Voter Model.”
Ann. Prob. 3 (4): 643–63.
Hollmann, Noah, Samuel Müller, Lennart Purucker, Arjun Krishnakumar, Max
Körfer, Shi Bin Hoo, Robin Tibor Schirrmeister, and Frank Hutter. 2025.
“Accurate Predictions on Small Data with a Tabular Foundation
Model.” Nature 637 (8045): 319–26.
Hopp, Steven L, Michael J Owren, and Christopher S Evans. 2012.
Animal Acoustic Communication: Sound Analysis and Research
Methods. Springer Science & Business Media.
Hotelling, Harold. 1957. “The Relations of the Newer Multivariate
Statistical Methods to Factor Analysis.” British Journal of
Statistical Psychology 10 (2): 69–79.
Hvitfeldt, Emil, and Julia Silge. 2021. Supervised Machine Learning
for Text Analysis in r. Chapman; Hall/CRC.
Jaynes, Edwin T. 2003. Probability Theory: The Logic of
Science. Cambridge university press.
Jeffrey, Richard. 2004. Subjective Probability: The Real Thing.
Cambridge University Press.
Jeffrey, Richard C. 1957. Contributions to the Theory of Inductive
Probability. Princeton University.
Jidling, Carl, Niklas Wahlström, Adrian Wills, and Thomas B Schön. 2017.
“Linearly Constrained Gaussian Processes.” Advances in
Neural Information Processing Systems 30.
Johnson, Alicia A, Miles Q Ott, and Mine Dogucu. 2022. Bayes Rules!:
An Introduction to Applied Bayesian Modeling. Chapman; Hall/CRC.
Jolliffe, Ian T. 1982. “A Note on the Use of Principal Components
in Regression.” Journal of the Royal Statistical Society
Series C: Applied Statistics 31 (3): 300–303.
Kalai, Adam Tauman, Ofir Nachum, Santosh S Vempala, and Edwin Zhang.
2025. “Why Language Models Hallucinate.” arXiv Preprint
arXiv:2509.04664.
Kalbfleisch, JD, and RL Prentice. 1980. “Failure Time
Models.” In The Statistical Analysis of Failure Time
Data, 21–38. John Wiley, New York.
Kaplan, Edward L, and Paul Meier. 1958. “Nonparametric Estimation
from Incomplete Observations.” Journal of the American
Statistical Association 53 (282): 457–81.
Kohavi, Ron et al. 1995. “A Study of Cross-Validation and
Bootstrap for Accuracy Estimation and Model Selection.” In
Ijcai, 14:1137–45. 2. Montreal, Canada.
Krantsevich, Chelsea, P Richard Hahn, Yi Zheng, and Charles Katz. 2023.
“Bayesian Decision Theory for Tree-Based Adaptive Screening Tests
with an Application to Youth Delinquency.” The Annals of
Applied Statistics 17 (2): 1038–63.
Krantsevich, Nikolay. 2023. “Tree Ensemble Algorithms for Causal
Machine Learning.” PhD thesis, Arizona State University.
Krantsevich, Nikolay, Jingyu He, and P Richard Hahn. 2023.
“Stochastic Tree Ensembles for Estimating Heterogeneous
Effects.” In International Conference on Artificial
Intelligence and Statistics, 6120–31. PMLR.
Kuhn, Max, and Kjell Johnson. 2019. Feature Engineering and
Selection: A Practical Approach for Predictive Models. Chapman;
Hall/CRC.
Lanchier, Nicolas. 2017. Stochastic Modeling. Springer.
Lange-Hegermann, Markus. 2018. “Algorithmic Linearly Constrained
Gaussian Processes.” Advances in Neural Information
Processing Systems 31.
Lantz, Brett. 2019. Machine Learning with r: Expert Techniques for
Predictive Modeling. Packt publishing ltd.
Lee, Abigail J, Grace E Chesmore, Kyle A Rocha, Amanda Farah, Maryum
Sayeed, and Justin Myles. 2022. arXiv Preprint
arXiv:2203.16648.
Lei, Bowen, Tanner Quinn Kirk, Anirban Bhattacharya, Debdeep Pati,
Xiaoning Qian, Raymundo Arroyave, and Bani K Mallick. 2021.
“Bayesian Optimization with Adaptive Surrogate Models for
Automated Experimental Design.” Npj Computational
Materials 7 (1): 194.
Li, Yuelin, Elizabeth Schofield, and Mithat Gönen. 2019. “A
Tutorial on Dirichlet Process Mixture Modeling.” Journal of
Mathematical Psychology 91: 128–44.
Linero, Antonio R. 2022. “SoftBart: Soft Bayesian Additive
Regression Trees.” arXiv Preprint arXiv:2210.16375.
Linero, Antonio R, and Yun Yang. 2018. “Bayesian Regression Tree
Ensembles That Adapt to Smoothness and Sparsity.” Journal of
the Royal Statistical Society Series B: Statistical Methodology 80
(5): 1087–1110.
Llaudet, Elena, and Kosuke Imai. 2022. Data Analysis for Social
Science: A Friendly and Practical Introduction. Princeton
University Press.
Lu, Xuetao, and Robert E McCulloch. 2023. “Gaussian Processes
Correlated Bayesian Additive Regression Trees.” arXiv
Preprint arXiv:2311.18699.
Maia, Mateus, Keefe Murphy, and Andrew C Parnell. 2024. “GP-BART:
A Novel Bayesian Additive Regression Trees Approach Using Gaussian
Processes.” Computational Statistics & Data Analysis
190: 107858.
Manski, Charles F. 2009. Identification for Prediction and
Decision. Harvard University Press.
McCartan, Cory, and Kosuke Imai. 2023. “Sequential Monte Carlo for
Sampling Balanced and Compact Redistricting Plans.” The
Annals of Applied Statistics 17 (4): 3300–3323.
McCulloch, Robert E, Rodney A Sparapani, Brent R Logan, and Purushottam
W Laud. 2021. “Causal Inference with the Instrumental Variable
Approach and Bayesian Nonparametric Machine Learning.” arXiv
Preprint arXiv:2102.01199.
McElreath, Richard. 2018. Statistical Rethinking: A Bayesian Course
with Examples in r and Stan. Chapman; Hall/CRC.
Miller, Joshua B, and Adam Sanjurjo. 2018. “Surprised by the Hot
Hand Fallacy? A Truth in the Law of Small Numbers.”
Econometrica 86 (6): 2019–47.
Miller, Steven J. 2007. “A Derivation of the Pythagorean Won-Loss
Formula in Baseball.” Chance 20 (1): 40–48.
Mirzadeh, Iman, Keivan Alizadeh, Hooman Shahrokhi, Oncel Tuzel, Samy
Bengio, and Mehrdad Farajtabar. 2024. “Gsm-Symbolic: Understanding
the Limitations of Mathematical Reasoning in Large Language
Models.” arXiv Preprint arXiv:2410.05229.
Mohan, Arvind, Ashesh Chattopadhyay, and Jonah Miller. 2024. “What
You See Is Not What You Get: Neural Partial Differential Equations and
the Illusion of Learning.” arXiv Preprint
arXiv:2411.15101.
Molnar, Christoph. 2020. Interpretable Machine Learning. Lulu.
com.
Molnar, Christoph, and Timo Freiesleben. 2024. Supervised Machine
Learning for Science: How to Stop Worrying and Love Your Black Box.
Christoph Molnar.
Monti, Corrado, Marco Pangallo, Gianmarco De Francisci Morales, and
Francesco Bonchi. 2023. “On Learning Agent-Based Models from
Data.” Scientific Reports 13 (1): 9268.
Morgan, SL. 2015. Counterfactuals and Causal Inference.
Cambridge University Press.
Murphy, Kevin P. 2012. Machine Learning: A Probabilistic
Perspective. MIT press.
Murray, Jared S. 2021. “Log-Linear Bayesian Additive Regression
Trees for Multinomial Logistic and Count Regression Models.”
Journal of the American Statistical Association 116 (534):
756–69.
Neal, Radford M. 1996. “Monte Carlo Implementation.” In
Bayesian Learning for Neural Networks, 55–98. Springer.
Nelder, John A, and Roger Mead. 1965. “A Simplex Method for
Function Minimization.” The Computer Journal 7 (4):
308–13.
Newman, Mark. 2013. “Computational Physics.” (No
Title).
Nguyen, Mike. 2020. A Guide on Data Analysis: From Basics to Causal
Inference. Bookdown.
Nikolaou, Michael. 2022. “Revisiting the Standard for Modeling the
Spread of Infectious Diseases.” Scientific Reports 12
(1): 7077.
Nuzzo, Regina. 2014. “Scientific Method: Statistical
Errors.” Nature 506 (7487).
Onyper, Serge V, Pamela V Thacher, Jack W Gilbert, and Samuel G Gradess.
2012. “Class Start Times, Sleep, and Academic Performance in
College: A Path Analysis.” Chronobiology International
29 (3): 318–35.
Orlandi, Vittorio, Jared Murray, Antonio Linero, and Alexander
Volfovsky. 2021. “Density Regression with Bayesian Additive
Regression Trees.” arXiv Preprint arXiv:2112.12259.
Papakostas, Demetrios, P Richard Hahn, Jared Murray, Frank Zhou, and
Joseph Gerakos. 2023. “Do Forecasts of Bankruptcy Cause
Bankruptcy? A Machine Learning Sensitivity Analysis.” The
Annals of Applied Statistics 17 (1): 711–39.
Pearl, Judea. 2009. Causality. Cambridge university press.
———. 2022. “Causal Diagrams for Empirical Research (with
Discussions).” In Probabilistic and Causal Inference: The
Works of Judea Pearl, 255–316.
Pell, Bruce, Samantha Brozak, Tin Phan, Fuqing Wu, and Yang Kuang. 2023.
“The Emergence of a Virus Variant: Dynamics of a Competition Model
with Cross-Immunity Time-Delay Validated by Wastewater Surveillance Data
for COVID-19.” Journal of Mathematical Biology 86 (5):
63.
Postman, Marc, John Peter Huchra, and Margaret J Geller. 1986.
“Probes of Large-Scale Structure in the Corona Borealis
Region.” Astronomical Journal (ISSN 0004-6256), Vol. 92, Dec.
1986, p. 1238-1247. 92: 1238–47.
Powers, Scott. 2025. sabRmetrics: Query statsapi,baseballsavant.mlb.com and Fit
Fundamental Sabermetric Models. https://github.com/saberpowers/sabRmetrics.
Pratola, Matthew T, Hugh A Chipman, Edward I George, and Robert E
McCulloch. 2020. “Heteroscedastic BART via Multiplicative
Regression Trees.” Journal of Computational and Graphical
Statistics 29 (2): 405–17.
Quiroga, Miriana, Pablo G Garay, Juan M Alonso, Juan Martin Loyola, and
Osvaldo A Martin. 2022. “Bayesian Additive Regression Trees for
Probabilistic Programming.” arXiv Preprint
arXiv:2206.03619.
Raissi, Maziar, Paris Perdikaris, and George Em Karniadakis. 2017.
“Machine Learning of Linear Differential Equations Using Gaussian
Processes.” Journal of Computational Physics 348:
683–93.
———. 2018. “Numerical Gaussian Processes for Time-Dependent and
Nonlinear Partial Differential Equations.” SIAM Journal on
Scientific Computing 40 (1): 172–98.
Rappold, Ana Grohovac, Michael Lavine, and Susan Lozier. 2007.
“Subjective Likelihood for the Assessment of Trends in the Ocean’s
Mixed-Layer Depth.” Journal of the American Statistical
Association 102 (479): 771–80.
Rencher, Alvin C, and G Bruce Schaalje. 2008. Linear Models in
Statistics. John Wiley & Sons.
Ressel, S, JJ Ruby, GW Collins, and JR Rygg. 2022. “Density
Reconstruction in Convergent High-Energy-Density Systems Using x-Ray
Radiography and Bayesian Inference.” Physics of Plasmas
29 (7).
Roeder, Kathryn. 1990. “Density Estimation with Confidence Sets
Exemplified by Superclusters and Voids in the Galaxies.”
Journal of the American Statistical Association 85 (411):
617–24.
Roeder, Kathryn, and Larry Wasserman. 1997. “Practical Bayesian
Density Estimation Using Mixtures of Normals.” Journal of the
American Statistical Association 92 (439): 894–902.
Rosenbaum, P. R., and D. B. Rubin. 1983. “The Central Role of the
Propensity Score in Observational Studies for Causal Effects.”
Biometrika 70: 41–55.
Rossi, Peter E, Greg M Allenby, and Robert E McCulloch. 2003.
“Bayesian Statistics and Marketing.” Marketing
Science 22 (3): 304–28.
Sauer, Annie, Robert B Gramacy, and David Higdon. 2023. “Active
Learning for Deep Gaussian Process Surrogates.”
Technometrics 65 (1): 4–18.
Schaeffer, Rylan. 2023. “Pretraining on the Test Set Is All You
Need.” arXiv Preprint arXiv:2309.08632.
Schenker, Nathaniel. 1985. “Qualms about Bootstrap Confidence
Intervals.” Journal of the American Statistical
Association 80 (390): 360–61.
Senn, Stephen, Erika Graf, and Angelika Caputo. 2007.
“Stratification for the Propensity Score Compared with Linear
Regression Techniques to Assess the Effect of Treatment or
Exposure.” Statistics in Medicine 26 (30): 5529–44.
Shah, Amar, Andrew Wilson, and Zoubin Ghahramani. 2014. “Student-t
Processes as Alternatives to Gaussian Processes.” In
Artificial Intelligence and Statistics, 877–85. PMLR.
Shalizi, Cosma. 2013. “Advanced Data Analysis from an Elementary
Point of View.”
Shalizi, Cosma Rohilla. 2021. “A Note on Simulation-Based
Inference by Matching Random Features.” arXiv Preprint
arXiv:2111.09220.
Shalizi, Cosma Rohilla, and Andrew C Thomas. 2011. “Homophily and
Contagion Are Generically Confounded in Observational Social Network
Studies.” Sociological Methods & Research 40 (2):
211–39.
Shanno, David F. 1970. “Conditioning of Quasi-Newton Methods for
Function Minimization.” Mathematics of Computation 24
(111): 647–56.
Shi, Yuge. 2019. “Gaussian Processes, Not Quite for
Dummies.” The Gradient.
Silver, Nate. 2012. The Signal and the Noise: Why so Many
Predictions Fail-but Some Don’t. Penguin.
Simpson, Dan. 2021. “Yes but What Is a Gaussian
Process? Or, Once, Twice, Three Times a Definition; or
A Descent into Madness.” November 3, 2021. https://dansblog.netlify.app/yes-but-what-is-a-gaussian-process-or-once-twice-three-times-a-definition-or-a-descent-into-madness.
Solak, Ercan, Roderick Murray-Smith, WE Leithead, Douglas Leith, and
Carl Rasmussen. 2002. “Derivative Observations in Gaussian Process
Models of Dynamic Systems.” Advances in Neural Information
Processing Systems 15.
Sparapani, Rodney A, Brent R Logan, Martin J Maiers, Purushottam W Laud,
and Robert E McCulloch. 2023. “Nonparametric Failure Time:
Time-to-Event Machine Learning with Heteroskedastic Bayesian Additive
Regression Trees and Low Information Omnibus Dirichlet Process
Mixtures.” Biometrics 79 (4): 3023–37.
Starling, Jennifer E, Jared S Murray, Carlos M Carvalho, Radek K
Bukowski, and James G Scott. 2020. “BART with Targeted Smoothing:
An Analysis of Patient-Specific Stillbirth Risk.”
Sudhakar, Tarini, Ashna Bhansali, John Walkington, and David Puelz.
2024. “The Disutility of Compartmental Model Forecasts During the
COVID-19 Pandemic.” Frontiers in Epidemiology 4:
1389617.
Sueur, Jérôme, Thierry Aubin, and Caroline Simonis. 2008.
“Seewave, a Free Modular Tool for Sound Analysis and
Synthesis.” Bioacoustics 18 (2): 213–26.
Swiler, Laura P, Mamikon Gulian, Ari L Frankel, Cosmin Safta, and John D
Jakeman. 2020. “A Survey of Constrained Gaussian Process
Regression: Approaches and Implementation Challenges.”
Journal of Machine Learning for Modeling and Computing 1 (2).
Tan, Yaoyuan Vincent, Carol AC Flannagan, and Michael R Elliott. 2018.
“Predicting Human-Driving Behavior to Help Driverless Vehicles
Drive: Random Intercept Bayesian Additive Regression Trees.”
Statistics and Its Interface 11 (4): 557–72.
Tan, Yaoyuan Vincent, and Jason Roy. 2019. “Bayesian Additive
Regression Trees and the General BART Model.” Statistics in
Medicine 38 (25): 5048–69.
Tarone, Robert E. 1982. “The Use of Historical Control Information
in Testing for a Trend in Proportions.” Biometrics,
215–20.
Tarzanagh, Davoud Ataee, Yingcong Li, Christos Thrampoulidis, and Samet
Oymak. 2023. “Transformers as Support Vector Machines.”
arXiv Preprint arXiv:2308.16898.
Terfloth, Lothar, and Johann Gasteiger. 2001. “Neural Networks and
Genetic Algorithms in Drug Design.” Drug Discovery Today
6: 102–8.
Thal, Dan RC, and Mariel M Finucane. 2023. “Causal Methods
Madness: Lessons Learned from the 2022 ACIC Competition to Estimate
Health Policy Impacts.” Observational Studies 9 (3):
3–27.
Thomay, Cal D. 2014. “Markov Chain Theory with Applications to
Baseball.”
Vaswani, A. 2017. “Attention Is All You Need.” Advances
in Neural Information Processing Systems.
Wachter, Sandra, Brent Mittelstadt, and Chris Russell. 2017.
“Counterfactual Explanations Without Opening the Black Box:
Automated Decisions and the GDPR.” Harv. JL & Tech.
31: 841.
Wahba, Grace. 1973. “A Class of Approximate Solutions to Linear
Operator Equations.” Journal of Approximation Theory 9
(1): 61–77.
Wang, Meijia, Jingyu He, and P Richard Hahn. 2024. “Local Gaussian
Process Extrapolation for BART Models with Applications to Causal
Inference.” Journal of Computational and Graphical
Statistics 33 (2): 724–35.
Wang, Meijia, Ignacio Martinez, and P Richard Hahn. 2024.
“LongBet: Heterogeneous Treatment Effect Estimation in Panel
Data.” arXiv Preprint arXiv:2406.02530.
Wasserman, Larry. 2004. All of Statistics: A Concise Course in
Statistical Inference. Springer Science & Business Media.
———. 2006. All of Nonparametric Statistics. Springer Science
& Business Media.
Whitehead, Thomas M. 2025. “Beyond What’s Normal: Bimodal and
Heaviside Alternatives to Gaussian Process Regression.”
Machine Learning 114 (12): 286.
Williams, Christopher KI, and Carl Edward Rasmussen. 2006. Gaussian
Processes for Machine Learning. Vol. 2. 3. MIT press Cambridge, MA.
Woody, C., S. Carvalho, P. R. Hahn, and J. Murray. 2020.
“Estimating Heterogeneous Effects of Continuous Exposures Using
Bayesian Tree Ensembles: Revisiting the Impact of Abortion Rates on
Crime.” Arxiv Preprint.
Woody, Spencer, Carlos M Carvalho, and Jared S Murray. 2021.
“Model Interpretation Through Lower-Dimensional Posterior
Summarization.” Journal of Computational and Graphical
Statistics 30 (1): 144–61.
Woolridge, J. 2010. Econometric Analysis of Cross Section and Panel
Data. Cambridge, Massachusetts: Massachusetts Institute of
Technology.
Xin, Xi, Fei Huang, and Giles Hooker. 2024. “Why You Should Not
Trust Interpretations in Machine Learning: Adversarial Attacks on
Partial Dependence Plots.” arXiv Preprint
arXiv:2404.18702.
Yang, Xiu, David Barajas-Solano, Guzel Tartakovsky, and Alexandre M
Tartakovsky. 2019. “Physics-Informed CoKriging: A
Gaussian-Process-Regression-Based Multifidelity Method for Data-Model
Convergence.” Journal of Computational Physics 395:
410–31.
Yang, Xiu, Guzel Tartakovsky, and Alexandre Tartakovsky. 2018.
“Physics-Informed Kriging: A Physics-Informed Gaussian Process
Regression Method for Data-Model Convergence.” arXiv Preprint
arXiv:1809.03461.
Zeitler, Jakob, Athanasios Vlontzos, and Ciarán Mark Gilligan-Lee. 2023.
“Non-Parametric Identifiability and Sensitivity Analysis of
Synthetic Control Models.” In Conference on Causal Learning
and Reasoning, 850–65. PMLR.
Zellner, Arnold. 1962. “An Efficient Method of Estimating
Seemingly Unrelated Regressions and Tests for Aggregation Bias.”
Journal of the American Statistical Association 57 (298):
348–68.
Zhang, Lawrence J. 2021. “A Friendly Introduction to Compressed
Sensing.”
Zhou, Shuang, P Giulani, J Piekarewicz, Anirban Bhattacharya, and
Debdeep Pati. 2019. “Reexamining the Proton-Radius Problem Using
Constrained Gaussian Processes.” Physical Review C 99
(5): 055202.
