15  References

References

Abadie, Alberto, Alexis Diamond, and Jens Hainmueller. 2010. “Synthetic Control Methods for Comparative Case Studies: Estimating the Effect of California’s Tobacco Control Program.” Journal of the American Statistical Association 105 (490): 493–505.
Abadie, Alberto, and Javier Gardeazabal. 2003. “The Economic Costs of Conflict: A Case Study of the Basque Country.” American Economic Review 93 (1): 113–32.
Adèr, Hermanus Johannes. 2008. Advising on Research Methods: A Consultant’s Companion. Johannes van Kessel Publishing.
Agnihotri, Apoorv, and Nipun Batra. 2020. “Exploring Bayesian Optimization.” Distill 5 (5): e26.
Albert, Christopher G, and Katharina Rath. 2020. “Gaussian Process Regression for Data Fulfilling Linear Differential Equations with Localized Sources.” Entropy 22 (2): 152.
Albert, James H, and Siddhartha Chib. 1993. “Bayesian Analysis of Binary and Polychotomous Response Data.” Journal of the American Statistical Association 88 (422): 669–79.
Albert, Jim et al. 2009. Bayesian Computation with r. Vol. 2. Springer.
Albert, Jim. 2022. A Course in Exploratory Data Analysis. https://bayesball.github.io/EDA/.
Albert, Jim, and Jingchen Hu. 2019. Probability and Bayesian Modeling. Chapman; Hall/CRC.
Alcantara, Rafael, P Richard Hahn, Carlos Carvalho, and Hedibert Lopes. 2025. “Learning Conditional Average Treatment Effects in Regression Discontinuity Designs Using Bayesian Additive Regression Trees.” arXiv Preprint arXiv:2503.00326.
Alcantara, Rafael, Meijia Wang, P Richard Hahn, and Hedibert Lopes. 2024. “Modified BART for Learning Heterogeneous Effects in Regression Discontinuity Designs.” arXiv Preprint arXiv:2407.14365.
Angelopoulos, Anastasios N, and Stephen Bates. 2021. “A Gentle Introduction to Conformal Prediction and Distribution-Free Uncertainty Quantification.” arXiv Preprint arXiv:2107.07511.
Banks, David L, and Mevin B Hooten. 2021. “Statistical Challenges in Agent-Based Modeling.” The American Statistician 75 (3): 235–42.
Barrett, Malcolm, Lucy McGowan D’Agostino, and Travis Gerke. 2025. Causal Inference in r. Bookdown.
Baumer, Benjamin S, Daniel T Kaplan, and Nicholas J Horton. 2017. Modern Data Science with r. Chapman; Hall/CRC.
Bengio, Yoshua, and Yves Grandvalet. 2003. “No Unbiased Estimator of the Variance of k-Fold Cross-Validation.” Advances in Neural Information Processing Systems 16.
Ben-Michael, Eli, David Arbour, Avi Feller, Alexander Franks, and Steven Raphael. 2023. “Estimating the Effects of a California Gun Control Program with Multitask Gaussian Processes.” The Annals of Applied Statistics 17 (2): 985–1016.
Berisha, Visar, Chelsea Krantsevich, P Richard Hahn, Shira Hahn, Gautam Dasarathy, Pavan Turaga, and Julie Liss. 2021. “Digital Medicine and the Curse of Dimensionality.” NPJ Digital Medicine 4 (1): 153.
Besginow, Andreas, and Markus Lange-Hegermann. 2022. “Constraining Gaussian Processes to Systems of Linear Ordinary Differential Equations.” Advances in Neural Information Processing Systems 35: 29386–99.
Binois, Mickaël, and Robert B Gramacy. 2021. “Hetgp: Heteroskedastic Gaussian Process Modeling and Sequential Design in r.”
Bishop, Christopher M, and Hugh Bishop. 2023. Deep Learning: Foundations and Concepts. Springer Nature.
Blackwell, David. 1969. Basic Statistics. McGraw-Hill New York.
Bodnar, Cristian, Wessel P Bruinsma, Ana Lucic, Megan Stanley, Anna Allen, Johannes Brandstetter, Patrick Garvan, et al. 2025. “A Foundation Model for the Earth System.” Nature, 1–8.
Breiman, Leo. 2001. “Random Forests.” Machine Learning 45: 5–32.
Breiman, Leo, and Jerome H Friedman. 1997. “Predicting Multivariate Responses in Multiple Linear Regression.” Journal of the Royal Statistical Society Series B: Statistical Methodology 59 (1): 3–54.
Brown, Christopher. 1976. “Principal Axes and Best-Fit Planes, with Applications.”
Brown, E Richard, Sue Holtby, Elaine Zahnd, and George B Abbott. 2005. “Peer Reviewed: Community-Based Participatory Research in the California Health Interview Survey.” Preventing Chronic Disease 2 (4).
Broyden, Charles George. 1970. “The Convergence of a Class of Double-Rank Minimization Algorithms 1. General Considerations.” IMA Journal of Applied Mathematics 6 (1): 76–90.
Brozak, Samantha J, Sophia Peralta, Tin Phan, John D Nagy, and Yang Kuang. 2024. “Dynamics of an LPAA Model for Tribolium Growth: Insights into Population Chaos.” SIAM Journal on Applied Mathematics 84 (6): 2300–2320.
Bukiet, Bruce, Elliotte Rusty Harold, and José Luis Palacios. 1997. “A Markov Chain Approach to Baseball.” Operations Research 45 (1): 14–23.
Butts, Kyle. n.d. “Factor Model.” https://www.kylebutts.com/blog/factor-model/.
Card, D., and A. B. Krueger. 1994. “Minimum Wages and Employment: A Case Study of the Fast-Food Industry in New Jersey and Pennsylvania.” American Economic Review 84 (4): 772–93. https://www.jstor.org/stable/2118030.
Carpenter, Bob, Andrew Gelman, Matthew D Hoffman, Daniel Lee, Ben Goodrich, Michael Betancourt, Marcus Brubaker, Jiqiang Guo, Peter Li, and Allen Riddell. 2017. “Stan: A Probabilistic Programming Language.” Journal of Statistical Software 76: 1–32.
Carpenter, Christopher, and Carlos Dobkin. 2009. “The Effect of Alcohol Consumption on Mortality: Regression Discontinuity Evidence from the Minimum Drinking Age.” American Economic Journal: Applied Economics 1 (1): 164–82.
Carriero, Alex, Kim Luijken, Anne de Hond, Karel GM Moons, Ben van Calster, and Maarten van Smeden. 2024. “The Harms of Class Imbalance Corrections for Machine Learning Based Prediction Models: A Simulation Study.” arXiv Preprint arXiv:2404.19494.
Carvalho, Carlos M, Edward I George, P Richard Hahn, and Robert E McCulloch. 2021. “Variable Selection and Interaction Detection with Bayesian Additive Regression Trees.” In Handbook of Bayesian Variable Selection, 395–414. Chapman; Hall/CRC.
Casella, George, and Roger Berger. 2002. Statistical Inference: Second Edition. CRC Press.
Chari, Tara, and Lior Pachter. 2023. “The Specious Art of Single-Cell Genomics.” PLOS Computational Biology 19 (8): e1011288.
Chase, Elizabeth C, Jeremy MG Taylor, and Philip S Boonstra. 2024. “Modeling Basal Body Temperature Data Using Horseshoe Process Regression.” Statistics in Medicine 43 (5): 817–32.
Chatterjee, Sourav. 2021. “A New Coefficient of Correlation.” Journal of the American Statistical Association 116 (536): 2009–22.
Chen, Ricky TQ, Yulia Rubanova, Jesse Bettencourt, and David K Duvenaud. 2018. “Neural Ordinary Differential Equations.” Advances in Neural Information Processing Systems 31.
Chernick, Michael R, and Robert A Labudde. 2009. “Revisiting Qualms about Bootstrap Confidence Intervals.” American Journal of Mathematical and Management Sciences 29 (3-4): 437–56.
Chiles, Jean-Paul, and Pierre Delfiner. 2012. Geostatistics: Modeling Spatial Uncertainty. Vol. 713. John Wiley & Sons.
Chipman, Hugh A and, Edward I George, and Robert E McCulloch. 2012. “BART: Bayesian Additive Regression Trees.” Annals of Applied Statistics 6 (1): 266–98.
Chipman, Hugh A, Edward I George, Robert E McCulloch, and Thomas S Shively. 2022. “mBART: Multidimensional Monotone BART.” Bayesian Analysis 17 (2): 515–44.
Chipman, Hugh, Edward I George, Robert E McCulloch, Dean P Foster, and Robert A Stine. 2001. “The Practical Implementation of Bayesian Model Selection.” Lecture Notes-Monograph Series, 65–134.
Chipman, Hugh, Edward George, Richard Hahn, Robert McCulloch, Matthew Pratola, and Rodney Sparapani. 2014. “Bayesian Additive Regression Trees, Computational Approaches.” Wiley StatsRef: Statistics Reference Online, 1–23.
Chipman, Hugh, Pritam Ranjan, and Weiwei Wang. 2012. “Sequential Design for Computer Experiments with a Flexible Bayesian Additive Model.” Canadian Journal of Statistics 40 (4): 663–78.
Cinelli, Carlos, Andrew Forney, and Judea Pearl. 2021. “A Crash Course in Good and Bad Controls.” Sociological Methods & Research, 00491241221099552.
Cleveland, William S. 1993. Visualizing Data. Hobart press.
Clifford, P., and A. Sudbury. 1973. “A Model for Spatial Conflict.” Biometrika 60: 581–88.
Cox, David R. 1972. “Regression Models and Life-Tables.” Journal of the Royal Statistical Society: Series B (Methodological) 34 (2): 187–202.
Cunningham, Scott. 2021. Causal Inference: The Mixtape. Yale university press.
Dahl, Benjamin K, Matthew J Heaton, Richard L Warr, Jared D Fisher, and Grant G Schultz. 2024. “Modeling Crash Risk on Roadway Networks Using Bayesian Regression Trees.” Technometrics, no. just-accepted: 1–17.
Dayaratna, Kevin D, and Steven J Miller. 2012. “The Pythagorean Won-Loss Formula and Hockey: A Statistical Justification for Using the Classic Baseball Formula as an Evaluative Tool in Hockey.” arXiv Preprint arXiv:1208.1725.
Deshpande, Sameer K. 2024. “flexBART: Flexible Bayesian Regression Trees with Categorical Predictors.” Journal of Computational and Graphical Statistics, no. just-accepted: 1–18.
Deshpande, Sameer K, Ray Bai, Cecilia Balocchi, Jennifer E Starling, and Jordan Weiss. 2020. “VCBART: Bayesian Trees for Varying Coefficients.” arXiv Preprint arXiv:2003.06416.
Diaconis, Persi, and Sandy L Zabell. 1982. “Updating Subjective Probability.” Journal of the American Statistical Association 77 (380): 822–30.
Diamond, Jared. 2020. “Swing Kings: The Inside Story of Baseball’s Home Run Revolution.” (No Title).
Doss, Hani, and Antonio Linero. 2024. “Scalable Empirical Bayes Inference and Bayesian Sensitivity Analysis.” Statistical Science 39 (4): 601–22.
Driscoll, Michael F. 1973. “The Reproducing Kernel Hilbert Space Structure of the Sample Paths of a Gaussian Process.” Zeitschrift für Wahrscheinlichkeitstheorie Und Verwandte Gebiete 26: 309–16.
Duane, Simon, Anthony D Kennedy, Brian J Pendleton, and Duncan Roweth. 1987. “Hybrid Monte Carlo.” Physics Letters B 195 (2): 216–22.
Dukic, Vanja, Hedibert F Lopes, and Nicholas G Polson. 2012. “Tracking Epidemics with Google Flu Trends Data and a State-Space SEIR Model.” Journal of the American Statistical Association 107 (500): 1410–26.
Duvenaud, David. 2014. “Automatic Model Construction with Gaussian Processes.” PhD thesis.
Dyer, Eva L, and Konrad Kording. 2023. “Why the Simplest Explanation Isn’t Always the Best.” Proceedings of the National Academy of Sciences 120 (52): e2319169120.
EFRON, B. 1979. “Bootstrap Method: Another Look at the Jackknife Method.” Annals of Statistics 7 (1): 1–26.
Egri, Gokhan, and Xinran Nicole Han. n.d. “Attention Is Kernel Trick Reloaded.”
Elhage, Nelson, Tristan Hume, Catherine Olsson, Nicholas Schiefer, Tom Henighan, Shauna Kravec, Zac Hatfield-Dodds, et al. 2022. “Toy Models of Superposition.” arXiv Preprint arXiv:2209.10652.
Elhaik, Eran. 2022. “Principal Component Analyses (PCA)-Based Findings in Population Genetic Studies Are Highly Biased and Must Be Reevaluated.” Scientific Reports 12 (1): 14683.
Fletcher, Roger. 1970. “A New Approach to Variable Metric Algorithms.” The Computer Journal 13 (3): 317–22.
Fong, Edwin, Chris Holmes, and Stephen G Walker. 2023. “Martingale Posterior Distributions.” Journal of the Royal Statistical Society Series B: Statistical Methodology 85 (5): 1357–91.
Frisch, R. 1934. “Statistical Confluence Analysis by Means of Complete Regression Systems (1934).” The Foundations of Econometric Analysis, 271.
Gaffney, Jim A, Lin Yang, and Suzanne Ali. 2022. “Constraining Model Uncertainty in Plasma Equation-of-State Models with a Physics-Constrained Gaussian Process.” arXiv Preprint arXiv:2207.00668.
Gamerman, Dani, and Hedibert F Lopes. 2006. Markov Chain Monte Carlo: Stochastic Simulation for Bayesian Inference. Chapman; Hall/CRC.
Gareth, James, Witten Daniela, Hastie Trevor, and Tibshirani Robert. 2013. An Introduction to Statistical Learning: With Applications in r. Spinger.
Garnett, Roman. 2023. Bayesian Optimization. Cambridge University Press.
Gelman, Andrew, John B Carlin, Aki Vehtari Stern Hal S, and Donald B Rubin. 2013. Bayesian Data Analysis. Chapman; Hall/CRC.
Gelman, Andrew, and Cosma Rohilla Shalizi. 2013. “Philosophy and the Practice of Bayesian Statistics.” British Journal of Mathematical and Statistical Psychology 66 (1): 8–38.
Gelman, Andrew, Daniel Simpson, and Michael Betancourt. 2017. “The Prior Can Often Only Be Understood in the Context of the Likelihood.” Entropy 19 (10): 555.
George, Edward I, and Robert E McCulloch. 1993. “Variable Selection via Gibbs Sampling.” Journal of the American Statistical Association 88 (423): 881–89.
———. 1997. “Approaches for Bayesian Variable Selection.” Statistica Sinica, 339–73.
Geweke, John. 1991. “Efficient Simulation from the Multivariate Normal and Student-t Distributions Subject to Linear Constraints and the Evaluation of Constraint Probabilities.” In Computing Science and Statistics: Proceedings of the 23rd Symposium on the Interface, 571:578. Fairfax, Virginia: Interface Foundation of North America, Inc.
Gibson, Graham C, Nicholas G Reich, and Daniel Sheldon. 2023. “Real-Time Mechanistic Bayesian Forecasts of COVID-19 Mortality.” The Annals of Applied Statistics 17 (3): 1801.
Gigerenzer, Gerd. 2004. “Mindless Statistics.” The Journal of Socio-Economics 33 (5): 587–606.
Giordano, Ryan. 2025. “The Biggest Empirical Bayes Estimator in History.” August 25, 2025. https://doi.org/10.59350/rgwda-0tv16.
Goldfarb, Donald. 1970. “A Family of Variable-Metric Methods Derived by Variational Means.” Mathematics of Computation 24 (109): 23–26.
Goodfellow, Ian, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. 2020. “Generative Adversarial Networks.” Communications of the ACM 63 (11): 139–44.
Goorbergh, Ruben van den, Maarten van Smeden, Dirk Timmerman, and Ben Van Calster. 2022. “The Harm of Class Imbalance Corrections for Risk Prediction Models: Illustration and Simulation Using Logistic Regression.” Journal of the American Medical Informatics Association 29 (9): 1525–34.
Görtler, Jochen, Rebecca Kehlbeck, and Oliver Deussen. 2019. “A Visual Exploration of Gaussian Processes.” Distill 4 (4): e17.
Gramacy, Robert B. 2020. Surrogates: Gaussian Process Modeling, Design, and Optimization for the Applied Sciences. Chapman; Hall/CRC.
Gramacy, Robert B, and Herbert K H Lee. 2008. “Bayesian Treed Gaussian Process Models with an Application to Computer Modeling.” Journal of the American Statistical Association 103 (483): 1119–30.
Green, Peter J. 1995. “Reversible Jump Markov Chain Monte Carlo Computation and Bayesian Model Determination.” Biometrika 82 (4): 711–32.
Grinsztajn, Léo, Edouard Oyallon, and Gaël Varoquaux. 2022. “Why Do Tree-Based Models Still Outperform Deep Learning on Typical Tabular Data?” Advances in Neural Information Processing Systems 35: 507–20.
Gu, Youyang. 2020. “COVID-19 Projections Using Machine Learning.” Retrieved May 29: 2020.
Gubernatis, James E. 2005. “Marshall Rosenbluth and the Metropolis Algorithm.” Physics of Plasmas 12 (5).
Hahn, P Richard, and Carlos M Carvalho. 2015. “Decoupling Shrinkage and Selection in Bayesian Linear Models: A Posterior Summary Perspective.” Journal of the American Statistical Association 110 (509): 435–48.
Hahn, P Richard, Carlos M Carvalho, and Sayan Mukherjee. 2013. “Partial Factor Modeling: Predictor-Dependent Shrinkage for Linear Regression.” Journal of the American Statistical Association 108 (503): 999–1008.
Hahn, P Richard, Indranil Goswami, and Carl F Mela. 2015. “A Bayesian Hierarchical Model for Inferring Player Strategy Types in a Number Guessing Game.”
Hahn, P Richard, Jingyu He, and Hedibert Lopes. 2018. “Bayesian Factor Model Shrinkage for Linear IV Regression with Many Instruments.” Journal of Business & Economic Statistics 36 (2): 278–87.
Hahn, P Richard, and Andrew Herren. 2022. “Feature Selection in Stratification Estimators of Causal Effects: Lessons from Potential Outcomes, Causal Diagrams, and Structural Equations.” arXiv Preprint arXiv:2209.11400.
Hahn, P Richard, Ryan Martin, and Stephen G Walker. 2018. “On Recursive Bayesian Predictive Distributions.” Journal of the American Statistical Association 113 (523): 1085–93.
Hahn, P Richard, Jared S Murray, and Carlos M Carvalho. 2020. “Bayesian Regression Tree Models for Causal Inference: Regularization, Confounding, and Heterogeneous Effects (with Discussion).” Bayesian Analysis 15 (3): 965–1056.
Hahn, P. R., D. Puelz, J. He, and C. M. Carvalho. 2016. “Regularization and Confounding in Linear Regression for Treatment Effect Estimation.” Bayesian Analysis. https://doi.org/10.1214/16-BA1044.
Hallas, Laura, Ariq Hatibie, Rachelle Koch, Saptarshi Majumdar, Monika Pyarali, Andrew Wood, and Thomas Hale. 2021. “Variation in US States’ COVID-19 Policy Responses.” Blavatnik School of Government, 2021–05.
Hamelryck, Thomas, and Kanti V Mardia. 2025. “Unfolding AlphaFold’s Bayesian Roots in Probability Kinematics.” arXiv Preprint arXiv:2505.19763.
Han, Lifeng, Changhan He, Huy Dinh, John Fricks, and Yang Kuang. 2022. “Learning Biological Dynamics from Spatio-Temporal Data by Gaussian Processes.” Bulletin of Mathematical Biology 84 (7): 69.
Hanck, Christoph, Martin Arnold, Alexander Gerber, and Martin Schmelzer. 2021. Introduction to Econometrics with r. Universität Duisburg-Essen.
He, Jingyu, and P Richard Hahn. 2023a. “Stochastic Tree Ensembles for Regularized Nonlinear Regression.” Journal of the American Statistical Association 118 (541): 551–70.
———. 2023b. “Stochastic Tree Ensembles for Regularized Nonlinear Regression.” Journal of the American Statistical Association 118 (541): 551–70.
Heiss, Andrew. 2023. “How to Make Fancy Road Trip Maps with R and OpenStreetMap.” June 1, 2023. https://doi.org/10.59350/rgwda-0tv16.
———. 2025. “How to Use a Histogram as a Legend in {Ggplot2}.” February 19, 2025. https://doi.org/10.59350/gt0nr-wct91.
Herren, Andrew, and P Richard Hahn. 2020. “Semi-Supervised Learning and the Question of True Versus Estimated Propensity Scores.” arXiv Preprint arXiv:2009.06183.
———. 2022. “Statistical Aspects of Shap: Functional Anova for Model Interpretation.” arXiv Preprint arXiv:2208.09970.
Herren, Drew, Richard Hahn, Jared Murray, Carlos Carvalho, and Jingyu He. 2025. Stochtree: Stochastic Tree Ensembles (XBART and BART) for Supervised Learning and Causal Inference. https://stochtree.ai/.
Hicks, Michael Townsen, James Humphries, and Joe Slater. 2024. “ChatGPT Is Bullshit.” Ethics and Information Technology 26 (2): 38.
Hill, Jennifer L. 2011. “Bayesian Nonparametric Modeling for Causal Inference.” Journal of Computational and Graphical Statistics 20 (1): 217–40.
Hoeting, Jennifer A, David Madigan, Adrian E Raftery, and Chris T Volinsky. 1999. “Bayesian Model Averaging: A Tutorial (with Comments by m. Clyde, David Draper and EI George, and a Rejoinder by the Authors.” Statistical Science 14 (4): 382–417.
Hoff, Peter D. 2009. A First Course in Bayesian Statistical Methods. Vol. 580. Springer.
Hogg, Robert V, Joseph W McKean, Allen T Craig, et al. 2013. Introduction to Mathematical Statistics. Pearson Education India.
Holland, Paul W. 1986. “Statistics and Causal Inference.” Journal of the American Statistical Association 81 (396): 945–60.
Holley, R. A., and T. M. Liggett. 1975. “Ergodic Theorems for Weakly Interacting Infinite Systems and the Voter Model.” Ann. Prob. 3 (4): 643–63.
Hollmann, Noah, Samuel Müller, Lennart Purucker, Arjun Krishnakumar, Max Körfer, Shi Bin Hoo, Robin Tibor Schirrmeister, and Frank Hutter. 2025. “Accurate Predictions on Small Data with a Tabular Foundation Model.” Nature 637 (8045): 319–26.
Hopp, Steven L, Michael J Owren, and Christopher S Evans. 2012. Animal Acoustic Communication: Sound Analysis and Research Methods. Springer Science & Business Media.
Hotelling, Harold. 1957. “The Relations of the Newer Multivariate Statistical Methods to Factor Analysis.” British Journal of Statistical Psychology 10 (2): 69–79.
Hvitfeldt, Emil, and Julia Silge. 2021. Supervised Machine Learning for Text Analysis in r. Chapman; Hall/CRC.
Jaynes, Edwin T. 2003. Probability Theory: The Logic of Science. Cambridge university press.
Jeffrey, Richard. 2004. Subjective Probability: The Real Thing. Cambridge University Press.
Jeffrey, Richard C. 1957. Contributions to the Theory of Inductive Probability. Princeton University.
Jidling, Carl, Niklas Wahlström, Adrian Wills, and Thomas B Schön. 2017. “Linearly Constrained Gaussian Processes.” Advances in Neural Information Processing Systems 30.
Johnson, Alicia A, Miles Q Ott, and Mine Dogucu. 2022. Bayes Rules!: An Introduction to Applied Bayesian Modeling. Chapman; Hall/CRC.
Jolliffe, Ian T. 1982. “A Note on the Use of Principal Components in Regression.” Journal of the Royal Statistical Society Series C: Applied Statistics 31 (3): 300–303.
Kalai, Adam Tauman, Ofir Nachum, Santosh S Vempala, and Edwin Zhang. 2025. “Why Language Models Hallucinate.” arXiv Preprint arXiv:2509.04664.
Kalbfleisch, JD, and RL Prentice. 1980. “Failure Time Models.” In The Statistical Analysis of Failure Time Data, 21–38. John Wiley, New York.
Kaplan, Edward L, and Paul Meier. 1958. “Nonparametric Estimation from Incomplete Observations.” Journal of the American Statistical Association 53 (282): 457–81.
Kohavi, Ron et al. 1995. “A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection.” In Ijcai, 14:1137–45. 2. Montreal, Canada.
Krantsevich, Chelsea, P Richard Hahn, Yi Zheng, and Charles Katz. 2023. “Bayesian Decision Theory for Tree-Based Adaptive Screening Tests with an Application to Youth Delinquency.” The Annals of Applied Statistics 17 (2): 1038–63.
Krantsevich, Nikolay. 2023. “Tree Ensemble Algorithms for Causal Machine Learning.” PhD thesis, Arizona State University.
Krantsevich, Nikolay, Jingyu He, and P Richard Hahn. 2023. “Stochastic Tree Ensembles for Estimating Heterogeneous Effects.” In International Conference on Artificial Intelligence and Statistics, 6120–31. PMLR.
Kuhn, Max, and Kjell Johnson. 2019. Feature Engineering and Selection: A Practical Approach for Predictive Models. Chapman; Hall/CRC.
Lanchier, Nicolas. 2017. Stochastic Modeling. Springer.
Lange-Hegermann, Markus. 2018. “Algorithmic Linearly Constrained Gaussian Processes.” Advances in Neural Information Processing Systems 31.
Lantz, Brett. 2019. Machine Learning with r: Expert Techniques for Predictive Modeling. Packt publishing ltd.
Lee, Abigail J, Grace E Chesmore, Kyle A Rocha, Amanda Farah, Maryum Sayeed, and Justin Myles. 2022. arXiv Preprint arXiv:2203.16648.
Lei, Bowen, Tanner Quinn Kirk, Anirban Bhattacharya, Debdeep Pati, Xiaoning Qian, Raymundo Arroyave, and Bani K Mallick. 2021. “Bayesian Optimization with Adaptive Surrogate Models for Automated Experimental Design.” Npj Computational Materials 7 (1): 194.
Li, Yuelin, Elizabeth Schofield, and Mithat Gönen. 2019. “A Tutorial on Dirichlet Process Mixture Modeling.” Journal of Mathematical Psychology 91: 128–44.
Linero, Antonio R. 2022. “SoftBart: Soft Bayesian Additive Regression Trees.” arXiv Preprint arXiv:2210.16375.
Linero, Antonio R, and Yun Yang. 2018. “Bayesian Regression Tree Ensembles That Adapt to Smoothness and Sparsity.” Journal of the Royal Statistical Society Series B: Statistical Methodology 80 (5): 1087–1110.
Llaudet, Elena, and Kosuke Imai. 2022. Data Analysis for Social Science: A Friendly and Practical Introduction. Princeton University Press.
Lu, Xuetao, and Robert E McCulloch. 2023. “Gaussian Processes Correlated Bayesian Additive Regression Trees.” arXiv Preprint arXiv:2311.18699.
Maia, Mateus, Keefe Murphy, and Andrew C Parnell. 2024. “GP-BART: A Novel Bayesian Additive Regression Trees Approach Using Gaussian Processes.” Computational Statistics & Data Analysis 190: 107858.
Manski, Charles F. 2009. Identification for Prediction and Decision. Harvard University Press.
McCartan, Cory, and Kosuke Imai. 2023. “Sequential Monte Carlo for Sampling Balanced and Compact Redistricting Plans.” The Annals of Applied Statistics 17 (4): 3300–3323.
McCulloch, Robert E, Rodney A Sparapani, Brent R Logan, and Purushottam W Laud. 2021. “Causal Inference with the Instrumental Variable Approach and Bayesian Nonparametric Machine Learning.” arXiv Preprint arXiv:2102.01199.
McElreath, Richard. 2018. Statistical Rethinking: A Bayesian Course with Examples in r and Stan. Chapman; Hall/CRC.
Miller, Joshua B, and Adam Sanjurjo. 2018. “Surprised by the Hot Hand Fallacy? A Truth in the Law of Small Numbers.” Econometrica 86 (6): 2019–47.
Miller, Steven J. 2007. “A Derivation of the Pythagorean Won-Loss Formula in Baseball.” Chance 20 (1): 40–48.
Mirzadeh, Iman, Keivan Alizadeh, Hooman Shahrokhi, Oncel Tuzel, Samy Bengio, and Mehrdad Farajtabar. 2024. “Gsm-Symbolic: Understanding the Limitations of Mathematical Reasoning in Large Language Models.” arXiv Preprint arXiv:2410.05229.
Mohan, Arvind, Ashesh Chattopadhyay, and Jonah Miller. 2024. “What You See Is Not What You Get: Neural Partial Differential Equations and the Illusion of Learning.” arXiv Preprint arXiv:2411.15101.
Molnar, Christoph. 2020. Interpretable Machine Learning. Lulu. com.
Molnar, Christoph, and Timo Freiesleben. 2024. Supervised Machine Learning for Science: How to Stop Worrying and Love Your Black Box. Christoph Molnar.
Monti, Corrado, Marco Pangallo, Gianmarco De Francisci Morales, and Francesco Bonchi. 2023. “On Learning Agent-Based Models from Data.” Scientific Reports 13 (1): 9268.
Morgan, SL. 2015. Counterfactuals and Causal Inference. Cambridge University Press.
Murphy, Kevin P. 2012. Machine Learning: A Probabilistic Perspective. MIT press.
Murray, Jared S. 2021. “Log-Linear Bayesian Additive Regression Trees for Multinomial Logistic and Count Regression Models.” Journal of the American Statistical Association 116 (534): 756–69.
Neal, Radford M. 1996. “Monte Carlo Implementation.” In Bayesian Learning for Neural Networks, 55–98. Springer.
Nelder, John A, and Roger Mead. 1965. “A Simplex Method for Function Minimization.” The Computer Journal 7 (4): 308–13.
Newman, Mark. 2013. “Computational Physics.” (No Title).
Nguyen, Mike. 2020. A Guide on Data Analysis: From Basics to Causal Inference. Bookdown.
Nikolaou, Michael. 2022. “Revisiting the Standard for Modeling the Spread of Infectious Diseases.” Scientific Reports 12 (1): 7077.
Nuzzo, Regina. 2014. “Scientific Method: Statistical Errors.” Nature 506 (7487).
Onyper, Serge V, Pamela V Thacher, Jack W Gilbert, and Samuel G Gradess. 2012. “Class Start Times, Sleep, and Academic Performance in College: A Path Analysis.” Chronobiology International 29 (3): 318–35.
Orlandi, Vittorio, Jared Murray, Antonio Linero, and Alexander Volfovsky. 2021. “Density Regression with Bayesian Additive Regression Trees.” arXiv Preprint arXiv:2112.12259.
Papakostas, Demetrios, P Richard Hahn, Jared Murray, Frank Zhou, and Joseph Gerakos. 2023. “Do Forecasts of Bankruptcy Cause Bankruptcy? A Machine Learning Sensitivity Analysis.” The Annals of Applied Statistics 17 (1): 711–39.
Pearl, Judea. 2009. Causality. Cambridge university press.
———. 2022. “Causal Diagrams for Empirical Research (with Discussions).” In Probabilistic and Causal Inference: The Works of Judea Pearl, 255–316.
Pell, Bruce, Samantha Brozak, Tin Phan, Fuqing Wu, and Yang Kuang. 2023. “The Emergence of a Virus Variant: Dynamics of a Competition Model with Cross-Immunity Time-Delay Validated by Wastewater Surveillance Data for COVID-19.” Journal of Mathematical Biology 86 (5): 63.
Postman, Marc, John Peter Huchra, and Margaret J Geller. 1986. “Probes of Large-Scale Structure in the Corona Borealis Region.” Astronomical Journal (ISSN 0004-6256), Vol. 92, Dec. 1986, p. 1238-1247. 92: 1238–47.
Powers, Scott. 2025. sabRmetrics: Query statsapi,baseballsavant.mlb.com and Fit Fundamental Sabermetric Models. https://github.com/saberpowers/sabRmetrics.
Pratola, Matthew T, Hugh A Chipman, Edward I George, and Robert E McCulloch. 2020. “Heteroscedastic BART via Multiplicative Regression Trees.” Journal of Computational and Graphical Statistics 29 (2): 405–17.
Quiroga, Miriana, Pablo G Garay, Juan M Alonso, Juan Martin Loyola, and Osvaldo A Martin. 2022. “Bayesian Additive Regression Trees for Probabilistic Programming.” arXiv Preprint arXiv:2206.03619.
Raissi, Maziar, and George Em Karniadakis. 2018. “Hidden Physics Models: Machine Learning of Nonlinear Partial Differential Equations.” Journal of Computational Physics 357: 125–41.
Raissi, Maziar, Paris Perdikaris, and George Em Karniadakis. 2017. “Machine Learning of Linear Differential Equations Using Gaussian Processes.” Journal of Computational Physics 348: 683–93.
———. 2018. “Numerical Gaussian Processes for Time-Dependent and Nonlinear Partial Differential Equations.” SIAM Journal on Scientific Computing 40 (1): 172–98.
Rappold, Ana Grohovac, Michael Lavine, and Susan Lozier. 2007. “Subjective Likelihood for the Assessment of Trends in the Ocean’s Mixed-Layer Depth.” Journal of the American Statistical Association 102 (479): 771–80.
Rencher, Alvin C, and G Bruce Schaalje. 2008. Linear Models in Statistics. John Wiley & Sons.
Ressel, S, JJ Ruby, GW Collins, and JR Rygg. 2022. “Density Reconstruction in Convergent High-Energy-Density Systems Using x-Ray Radiography and Bayesian Inference.” Physics of Plasmas 29 (7).
Roeder, Kathryn. 1990. “Density Estimation with Confidence Sets Exemplified by Superclusters and Voids in the Galaxies.” Journal of the American Statistical Association 85 (411): 617–24.
Roeder, Kathryn, and Larry Wasserman. 1997. “Practical Bayesian Density Estimation Using Mixtures of Normals.” Journal of the American Statistical Association 92 (439): 894–902.
Rosenbaum, P. R., and D. B. Rubin. 1983. “The Central Role of the Propensity Score in Observational Studies for Causal Effects.” Biometrika 70: 41–55.
Rossi, Peter E, Greg M Allenby, and Robert E McCulloch. 2003. “Bayesian Statistics and Marketing.” Marketing Science 22 (3): 304–28.
Sauer, Annie, Robert B Gramacy, and David Higdon. 2023. “Active Learning for Deep Gaussian Process Surrogates.” Technometrics 65 (1): 4–18.
Schaeffer, Rylan. 2023. “Pretraining on the Test Set Is All You Need.” arXiv Preprint arXiv:2309.08632.
Schenker, Nathaniel. 1985. “Qualms about Bootstrap Confidence Intervals.” Journal of the American Statistical Association 80 (390): 360–61.
Senn, Stephen, Erika Graf, and Angelika Caputo. 2007. “Stratification for the Propensity Score Compared with Linear Regression Techniques to Assess the Effect of Treatment or Exposure.” Statistics in Medicine 26 (30): 5529–44.
Shah, Amar, Andrew Wilson, and Zoubin Ghahramani. 2014. “Student-t Processes as Alternatives to Gaussian Processes.” In Artificial Intelligence and Statistics, 877–85. PMLR.
Shalizi, Cosma. 2013. “Advanced Data Analysis from an Elementary Point of View.”
Shalizi, Cosma Rohilla. 2021. “A Note on Simulation-Based Inference by Matching Random Features.” arXiv Preprint arXiv:2111.09220.
Shalizi, Cosma Rohilla, and Andrew C Thomas. 2011. “Homophily and Contagion Are Generically Confounded in Observational Social Network Studies.” Sociological Methods & Research 40 (2): 211–39.
Shanno, David F. 1970. “Conditioning of Quasi-Newton Methods for Function Minimization.” Mathematics of Computation 24 (111): 647–56.
Shi, Yuge. 2019. “Gaussian Processes, Not Quite for Dummies.” The Gradient.
Silver, Nate. 2012. The Signal and the Noise: Why so Many Predictions Fail-but Some Don’t. Penguin.
Simpson, Dan. 2021. “Yes but What Is a Gaussian Process? Or, Once, Twice, Three Times a Definition; or A Descent into Madness.” November 3, 2021. https://dansblog.netlify.app/yes-but-what-is-a-gaussian-process-or-once-twice-three-times-a-definition-or-a-descent-into-madness.
Solak, Ercan, Roderick Murray-Smith, WE Leithead, Douglas Leith, and Carl Rasmussen. 2002. “Derivative Observations in Gaussian Process Models of Dynamic Systems.” Advances in Neural Information Processing Systems 15.
Sparapani, Rodney A, Brent R Logan, Martin J Maiers, Purushottam W Laud, and Robert E McCulloch. 2023. “Nonparametric Failure Time: Time-to-Event Machine Learning with Heteroskedastic Bayesian Additive Regression Trees and Low Information Omnibus Dirichlet Process Mixtures.” Biometrics 79 (4): 3023–37.
Starling, Jennifer E, Jared S Murray, Carlos M Carvalho, Radek K Bukowski, and James G Scott. 2020. “BART with Targeted Smoothing: An Analysis of Patient-Specific Stillbirth Risk.”
Sudhakar, Tarini, Ashna Bhansali, John Walkington, and David Puelz. 2024. “The Disutility of Compartmental Model Forecasts During the COVID-19 Pandemic.” Frontiers in Epidemiology 4: 1389617.
Sueur, Jérôme, Thierry Aubin, and Caroline Simonis. 2008. “Seewave, a Free Modular Tool for Sound Analysis and Synthesis.” Bioacoustics 18 (2): 213–26.
Swiler, Laura P, Mamikon Gulian, Ari L Frankel, Cosmin Safta, and John D Jakeman. 2020. “A Survey of Constrained Gaussian Process Regression: Approaches and Implementation Challenges.” Journal of Machine Learning for Modeling and Computing 1 (2).
Tan, Yaoyuan Vincent, Carol AC Flannagan, and Michael R Elliott. 2018. “Predicting Human-Driving Behavior to Help Driverless Vehicles Drive: Random Intercept Bayesian Additive Regression Trees.” Statistics and Its Interface 11 (4): 557–72.
Tan, Yaoyuan Vincent, and Jason Roy. 2019. “Bayesian Additive Regression Trees and the General BART Model.” Statistics in Medicine 38 (25): 5048–69.
Tarone, Robert E. 1982. “The Use of Historical Control Information in Testing for a Trend in Proportions.” Biometrics, 215–20.
Tarzanagh, Davoud Ataee, Yingcong Li, Christos Thrampoulidis, and Samet Oymak. 2023. “Transformers as Support Vector Machines.” arXiv Preprint arXiv:2308.16898.
Terfloth, Lothar, and Johann Gasteiger. 2001. “Neural Networks and Genetic Algorithms in Drug Design.” Drug Discovery Today 6: 102–8.
Thal, Dan RC, and Mariel M Finucane. 2023. “Causal Methods Madness: Lessons Learned from the 2022 ACIC Competition to Estimate Health Policy Impacts.” Observational Studies 9 (3): 3–27.
Thomay, Cal D. 2014. “Markov Chain Theory with Applications to Baseball.”
Vaswani, A. 2017. “Attention Is All You Need.” Advances in Neural Information Processing Systems.
Wachter, Sandra, Brent Mittelstadt, and Chris Russell. 2017. “Counterfactual Explanations Without Opening the Black Box: Automated Decisions and the GDPR.” Harv. JL & Tech. 31: 841.
Wahba, Grace. 1973. “A Class of Approximate Solutions to Linear Operator Equations.” Journal of Approximation Theory 9 (1): 61–77.
Wang, Meijia, Jingyu He, and P Richard Hahn. 2024. “Local Gaussian Process Extrapolation for BART Models with Applications to Causal Inference.” Journal of Computational and Graphical Statistics 33 (2): 724–35.
Wang, Meijia, Ignacio Martinez, and P Richard Hahn. 2024. “LongBet: Heterogeneous Treatment Effect Estimation in Panel Data.” arXiv Preprint arXiv:2406.02530.
Wasserman, Larry. 2004. All of Statistics: A Concise Course in Statistical Inference. Springer Science & Business Media.
———. 2006. All of Nonparametric Statistics. Springer Science & Business Media.
Whitehead, Thomas M. 2025. “Beyond What’s Normal: Bimodal and Heaviside Alternatives to Gaussian Process Regression.” Machine Learning 114 (12): 286.
Williams, Christopher KI, and Carl Edward Rasmussen. 2006. Gaussian Processes for Machine Learning. Vol. 2. 3. MIT press Cambridge, MA.
Woody, C., S. Carvalho, P. R. Hahn, and J. Murray. 2020. “Estimating Heterogeneous Effects of Continuous Exposures Using Bayesian Tree Ensembles: Revisiting the Impact of Abortion Rates on Crime.” Arxiv Preprint.
Woody, Spencer, Carlos M Carvalho, and Jared S Murray. 2021. “Model Interpretation Through Lower-Dimensional Posterior Summarization.” Journal of Computational and Graphical Statistics 30 (1): 144–61.
Woolridge, J. 2010. Econometric Analysis of Cross Section and Panel Data. Cambridge, Massachusetts: Massachusetts Institute of Technology.
Xin, Xi, Fei Huang, and Giles Hooker. 2024. “Why You Should Not Trust Interpretations in Machine Learning: Adversarial Attacks on Partial Dependence Plots.” arXiv Preprint arXiv:2404.18702.
Yang, Xiu, David Barajas-Solano, Guzel Tartakovsky, and Alexandre M Tartakovsky. 2019. “Physics-Informed CoKriging: A Gaussian-Process-Regression-Based Multifidelity Method for Data-Model Convergence.” Journal of Computational Physics 395: 410–31.
Yang, Xiu, Guzel Tartakovsky, and Alexandre Tartakovsky. 2018. “Physics-Informed Kriging: A Physics-Informed Gaussian Process Regression Method for Data-Model Convergence.” arXiv Preprint arXiv:1809.03461.
Zeitler, Jakob, Athanasios Vlontzos, and Ciarán Mark Gilligan-Lee. 2023. “Non-Parametric Identifiability and Sensitivity Analysis of Synthetic Control Models.” In Conference on Causal Learning and Reasoning, 850–65. PMLR.
Zellner, Arnold. 1962. “An Efficient Method of Estimating Seemingly Unrelated Regressions and Tests for Aggregation Bias.” Journal of the American Statistical Association 57 (298): 348–68.
Zhang, Lawrence J. 2021. “A Friendly Introduction to Compressed Sensing.”
Zhou, Shuang, P Giulani, J Piekarewicz, Anirban Bhattacharya, and Debdeep Pati. 2019. “Reexamining the Proton-Radius Problem Using Constrained Gaussian Processes.” Physical Review C 99 (5): 055202.