Paper Stack

This file serves as a list of all papers I have come across that sound interesting enough to potentially read them eventually. Please note that I started the list in October 2020. As I only fill it up as I go, it does not reflect everything I have read, alas.

Inspiration: Cosma Shalizi’s notebooks.

format: authors, year, title, source, link, added_date, modified_date, tags

Have Read

Steven L. Scott, Hal Varian (2013). Predicting the Present with Bayesian Structural Time Series. Google. 2020-10-04. 2020-10-04. statistics time_series bayesian forecasting paper

Hagai Attias (). Planning by Probabilistic Inference. Microsoft. 2020-10-04. 2020-10-04. statistics reinforcement_learning optimal_control bayesian probabilistic_programming paper

John Winn, Christopher M. Bishop, Thomas Diethe, John Guiver, Yordan Zaykov (2019). Model-based Machine Learning. http://mbmlbook.com. statistics probabilistic_programming machine_learning book

Seung-Jean Kim, Kwangmoo Koh, Stephen Boyd, Dimitry Gorinevsky (). l1 Trend Filtering. SIAM Review. 2020-10-04. 2020-10-04. statistics time_series paper

This paper has strong relation to Sean J. Taylor and Benjamin Letham’s trend model implemented in the Prophet package and described in the accompanying paper Forecasting at Scale.

Sean J. Taylor, Benjamin Letham (2017). Forecasting at Scale. https://peerj.com/preprints/3190/. 2020-10-04. 2020-10-04. statistics time_series forecasting bayesian probabilistic programming stan r python paper

Souhaib Ben Taieb, James W. Taylor, Rob J. Hyndman (2017). Coherent Probabilistic Forecasts for Hierarchical Time Series. International Conference on Machine Learning. 2020-10-04. 2020-10-04. statistics time_series forecasting copula hierarchical probabilistic paper

The authors use copulas to enable the estimation of a joint distribution from the marginal distributions available from bottom-up forecasts.

Marta Banbura, Domenico Giannone, Michele Lenza (2014). Conditional Forecasts and Scenario Analysis with Vector Autoregressions for Large Cross-Sections. European Central Bank Working Paper. 2020-10-07. 2020-10-07. time_series statistics econometric optimal_control paper

This one got me interested in the idea of applying optimal control or reinforcement learning methods to monetary policy decisions, especially in the way suggested by Attias’ “Planning by Probabilistic Inference”. It’s all about finding optimal policies.

Marcelo Hartmann, Georgi Agiashvili, Paul Bürkner & Arto Klami (2020). Flexible Prior Elicitation via the Prior Predictive Distribution. arXiv:2002.09868. 2020-10-08. 2020-10-08. bayesian prior_elicitation probabilistic_programming statistics paper

Ruben Crevits, Christophe Croux (2016). Forecasting using robust exponential smoothing with damped trend and seasonal components. KU Leuven Working Paper. 2020-10-08. 2020-10-08. time_series forecast robust statistics paper

Dhruv Madeka, Lucas Swiniarski, Dean Foster, Leo Razoumov, Kari Torkkola, Ruofeng Wen (2018). Sample Path Generation for Probabilistic Demand Forecasting. MiLeTS 2018. 2020-10-08. 2020-10-08. time_series probabilistic demand_forecasting forecasting paper

Jessica Hullman, Andrew Gelman (2020). Interactive Analysis Needs Theories of Inference. http://www.stat.columbia.edu/~gelman/research/unpublished/EDA_theories_of_inference.pdf. 2020-10-25. 2020-10-25. statistics data_visualization data_analysis prior_elicitation model_checking multiple_comparisons paper

To Read

Veronica J. Berrocal, Adrian E. Raftery, Tilmann Gneiting, and Richard C. Steed (2010). Probabilistic Weather Forecasting for Winter Road Maintenance. Journal of the American Statistical Association. 2020-10-04. 2020-10-04. statistics probabilistic forecasting time_series application copula optimal_control paper

Zad Rafi, Sander Greenland (2020). Semantic and cognitive tools to aid statistical science: replace confidence and significance by compatibility and surprise. BMC Medical Research Methodology. 2020-10-04. 2020-10-04. statistics frequentist paper

Alexander Dokumentov, Rob J. Hyndman (2013). Two-dimensional smoothing of mortality rates. Monash University Working Paper. 2020-10-04. 2020-10-04. statistics time_series forecasting paper

Alexander Dokumentov, Rob J. Hyndman (2014). Low-dimensional decomposition, smoothing and forecasting of sparse functional data. Monash University Working Paper. 2020-10-04. 2020-10-04. statistics time_series functional forecasting paper

William R. Bell, Donald E. K. Martin. Modeling Time-Varying Trading-Day Effects in Monthly Time Series. U.S. Census Bureau. 2020-10-04. 2020-10-04. statistics time_series paper

Drew A. Linzer (2013). Dynamic Bayesian Forecasting of Presidential Elections in the States. Journal of the American Statistical Association. 2020-10-04. 2020-10-04. statistics forecasting bayesian paper

This paper is the foundation for Slate’s presidential election forecast model.

Anindya Roy, Tucker S. McElroy, Peter Linton (2014). Estimation of Causal Invertible VARMA Models. U.S. Census Bureau. arXiv:1406.4584. 2020-10-04. 2020-10-04. statistics time_series paper

R.E. Kalman (1960). A New Approach to Linear Filtering and Prediction Problems. Transactions of the ASME, Journal of Basic Engineering. time_series statistics forecasting paper

Brendan O’Donoghue, Ian Osband, Catalin Ionescu (2020). Making Sense of Reinforcement Learning and Probabilistic Infernece. International Confernce on Learning Representations. arXiv:2001.00805. 2020-10-04. 2020-10-04. reinforcement_learning optimal_control probabilistic statistics paper

If I remember Osband’s Twitter thread correctly, this paper is a critical view of, among others, Levine’s “Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review”.

Sergey Levine (2018). Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review. UC Berkeley. arXiv:1805.00909. 2020-10-04. 2020-10-04. reinforcement_learning optimal_control probabilistic statistics paper

Jan-Willem van de Meent, Brooks Paige, Hongseok Yang, Frank Wood (2018). An Introduction to Probabilistic Programming. arXiv:1809.10756. 2020-10-04. 2020-10-04. statistics computer_science probabilistic_programming bayesian paper

Andrew Gelman, Christian Hennig (2017). Beyond subjective and objective in statistics. The Royal Statistical Society. 2020-10-04. 2020-10-04. statistics bayesian frequentist paper

Yea-Seul Kim, Paula Kayongo, Madeleine Grunde-McLaughlin, Jessica Hullman (2020). Bayesian-Assisted Inference from Visualized Data. arXiv:2008.00142. 2020-10-04. 2020-10-04. bayesian uncertainty_visualization data_visualization uncertainty_communication statistics paper

Tilmann Gneiting, Matthias Katzfuss (2014). Probabilistic Forecasting. Annual Review of Statistics and Its Application. 2020-10-04. 2020-10-04. probabilistic time_series forecasting statistics paper

Tilmann Gneiting, Adrian E. Raftery (2007). Strictly Proper Scoring Rules, Prediction, and Estimation. Journal of the American Statistical Association. 2020-10-04. 2020-10-04. probabilistic forecasting evaluation statistics paper

Roman Schefzik, Thordis L. Thorarinsdottir, Tilmann Gneiting (2013). Uncertainty Quantification in Complex Simulation Models Using Ensemble Copula Coupling. Statistical Science. arXiv:1302.7149. 2020-10-04. 2020-10-04. copula statistics simulation time_series optimal_control paper

This paper applies its methods to the problem of weather and climate predictions.

Tilmann Gneiting (2009). Making and Evaluating Point Forecasts. arXiv:0912.0902. 2020-10-04. 2020-10-04. time_series forecasting evaluation paper

Claire Vernade, Olivier Cappé, Vianney Perchet (2017). Stochastic Bandit Models for Delayed Conversions. arXiv:1706.09186. 2020-10-04. 2020-10-04. multi_armed_bandits statistics machine_learning paper

David J. Spiegelhalter (1986). Probabilistic prediction in patient management and clinical trials. Statistics in Medicine. https://doi.org/10.1002/sim.4780050506. 2020-10-04. 2020-10-04. statistics healthcare clinical_trials probabilistic bayesian paper

Anne Marthe van der Bles, Sander van der Linden, Alexandra L. J. Freeman, James Mitchell, Ana B. Galvao, Lisa Zaval, David J. Spiegelhalter (2019). Communicating uncertainty about facts, numbers and science. Royal Society Open Science. https://doi.org/10.1098/rsos.181870. 2020-10-04. 2020-10-04. statistics uncertainty_communication paper

Bo Peng, Jiayu Li, Selahattin Akkas, Fugang Wang, Takuya Araki, Ohno Yoshiyuki, Judy Qiu (2020). Rank Position Forecasting in Car Racing. arXiv:2010.01707. 2020-10-07. 2020-10-07. forecasting formula1 machine_learning paper

Zhijie Deng, Xiao Yang, Hao Zhang, Yinpeng Dong, Jun Zhu (2020). BayesAdapter: Being Bayesian, Inexpensively and Robustly via Bayesian Fine-Tuning. arXiv:2010.01979. 2020-10-07. 2020-10-07. bayesian neural_networks machine_learning variational_inference paper

Jinwen Qiu, S. Rao Jammalamadaka, Ning Ning (2018). Multivariate Bayesian Structural Time Series Model. Journal of Machine Learning Research. 2020-10-07. 2020-10-07. bayesian time_series statistics paper.

Ning Ning (2020). Multivariate Quantile Bayesian Structural Time Series (MQBSTS) Model. arXiv:2010.01654. 2020-10-07. 2020-10-07. bayesian forecasting time_series quantile_forecasts statistics paper

Nadja Klein, Michael Stanley Smith, David J. Nott (2020). Deep Distributional Time Series Models and the Probabilistic Forecasting of Intraday Electricity Prices. arXiv:2010.01844. 2020-10-07. 2020-10-07. forecasting electricity_forecasting neural_networks machine_learning probabilistic paper

Hédi Hadiji, Sébastien Gerchinovitz, Jean-Michel Loubes, Gilles Stoltz (2020). Diversity-Preserving K–Armed Bandits, Revisited. arXiv:2010.01874. 2020-10-07. 2020-10-07. multi_armed_bandits statistics machine_learning paper

SeungKee Jeon (2020). 1st Place Solution to Google Landmark Retrieval 2020. arXiv:2009.05132. 2020-10-07. 2020-10-07. google kaggle machine_learning computer_vision transfer_learning paper

Alexander K. Lew, Monica Agrawal, David Sontag, Vikash K. Mansinghka (2020). PClean: Bayesian Data Cleaning at Scale with Domain-Specific Probabilistic Programming. arXiv:2007.11838. 2020-10-07. 2020-10-07. probabilistic_programming bayesian statistics computer_science paper

Sayani Gupta, Rob J Hyndman, Dianne Cook, Antony Unwin (2020). Visualizing probability distributions across bivariate cyclic temporal granularities. arXiv:2010.0079. 2020-10-07. 2020-10-07. time_series data_visualization paper

Urvashi Khandelwal, Angela Fan, Dan Jurafsky, Luke Zettlemoyer, Mike Lewis (2020). Nearest Neighbor Machine Translation. arXiv:2010.00710. 2020-10-07. 2020-10-07. natural_language_processing machine_learning paper

Thomas G. Dietterich (1997). Approximate Statistical Tests for Comparing Supervised Classification Learning Algorithms. ?. 2020-10-07. 2020-10-07. machine_learning statistics evaluation paper

Mike West (2019). Bayesian Forecasting of Multivariate Time Series: Scalability, Structure Uncertainty and Decisions. arXiv:1911.09656. 2020-10-07. 2020-10-07. bayesian statistics time_series paper

James H. Stock, Mark W. Watson (2010). Dynamic Factor Models. Oxford Handbook of Economic Forecasting. 2020-10-07. 2020-10-07. time_series statistics econometrics paper

Robert B. Litterman (1984). Foreasting and Policy Analysis With Bayesian Vector Autoregression Models. Federal Reserve Bank of Minneapolis Quaterly Review. ?. 2020-10-07. 2020-10-07. time_series econometrics macroeconomics monetary_policy paper

Christopher A. Sims (1993). A Nine-Variable Probabilistic Macroeconomic Forecasting Model. http://www.nber.org/chapters/c7192. 2020-10-07. 2020-10-07. time_series econometrics macroeconomics paper

Stock and Watson are the editors of this NBER volume.

Tor Jacobson, Per Jansson, Anders Vredin, Anders Warne (1999). A VAR Model for Monetary Policy Analysis in a Small Open Economy. ?. 2020-10-07. 2020-10-07. time_series macroeconomics econometrics paper

Andrew Levin, Volker Wieland, John C. Williams (2001). The Performance of Forecast-Based Monetary Policy Rules under Model Uncertainty. ?. 2020-10-07. 2020-10-07. time_series macroeconomics econometrics optimal_control monetary_policy paper

A. Hakan Kara (2004). Optimal Monetary Policy, Commitment, and Imperfect Credibility. Central Bank Review, Central Bank of the Republic of Turkey. 2020-10-07. 2020-10-07. monetary_policy optimal_control macroeconomics paper

Vitor Gaspar, Frank Smets, and David Vestin (2011). Inflation Expectations, Adaptive Learning and Optimal Monetary Policy. Handbook of Monetary Economics, Volume 3B. 2020-10-07. 2020-10-07. monetary_policy optimal_control macroeconomics paper

Ben S. Bernanke, Jean Boivin, Piotr Eliasz (2005). Measuring the Effects of Monetary Policy - A Factor-Augmented Vector Autoregressive (FAVAR) Approach. The Quarterly Journal of Economics. 2020-07-01. 2020-07-01. monetary_policy time_series econometrics macroeconomics paper

Athanasios Orphanides, John C. Williams (2008). Learning, Expectations Formation, and the Pitfalls of Optimal Control Monetary Policy. ?. 2020-10-07. 2020-10-07. optimal_control monetary_policy macroeconomics paper

Juan F. Rubio-Ramírez, Daniel F. Waggoner, Tao Zha (2008). Structural Vector Autoregressions: Theory of Identification and Algorithms for Inference. Federal Reserve Bank of Atlanta Working Paper. 2020-07-01. 2020-07-01. time_series econometrics paper.

Gary Koop, Dimitris Korobilis (2009). Bayesian Multivariate Time Series Methods for Empirical Macroeconomics. Foundations and Trends in Econometrics. 2020-10-07. 2020-10-07. time_series econometrics macroeconomics paper

Todd E. Clark, Michael W. McCracken (2015). Evaluating Conditional Forecasts from Vector Autoregressions. http://research.stlouisfed.org/wp/2014/2014-025.pdf. 2020-10-07. 2020-10-07. time_series econometrics paper

Daniel F. Waggoner, Tao Zha (1998). Conditional Forecasts in Dynamic Multivariate Models. Federal Reserve Bank of Atlanta Working Paper. 2020-10-07. 2020-10-07. time_series econometrics paper

Helmut Lütkepohl (2005). New Introduction to Multiple Time Series Analysis. Springer. 2020-10-07. 2020-10-07. time_series statistics econometrics book

Sarah E. Heaps (2020). Enforcing stationarity through the prior in vector autoregressions. arXiv:2004.09455. 2020-07-01. 2020-07-01. time_series statistics bayesian stan probabilistic_programming paper

Heaps presented her paper at StanCon 2020.

Benjamin Recht (2018). A Tour of Reinforcement Learning: The View from Continuous Control. arXiv:1806.09460. 2020-10-07. 2020-10-07. reinforcement_learning optimal_control review paper

Attended Recht’s related tutorial at ICML 2018.

Berk Ustun, Cynthia Rudin (2019). Learning Optimized Risk Scores. Journal of Machine Learning Research. 2020-10-07. 2020-10-07. machine_learning constrained_optimization mixed_integer_nonlinear_program interpretable paper

“We developed methods that let domain experts to specify constraints on model form and predictions, and that inform customization by telling them how their constraints affect performance”

Konstantin Mishchenko, Mallory Montgomery, Federico Vaggi (2019). A Self-supervised Approach to Hierarchical Forecasting with Applications to Groupwise Synthetic Controls. arXiv:1906.10586. 2020-10-07. 2020-10-07. synthetic_control causal_inference time_series hierarchical forecasting statistics paper

Zhengfan Wang, Miranda J. Fix, Lucia Hug, Anu Mishra, Danzhen You, Hannah Blencowe, Jon Wakefield, Leontine Alkema (2020). Estimating the Stillbirth Rate for 195 Countries Using a Bayesian Sparse Regression Model with Temporal Smoothing. arXiv:2010.0355. 2020-10-08. 2020-10-08. bayesian hierarchical time_series stillbirth horseshoe application statistics paper

Amanda Gentzel, Justin Clarke, David Jensen (2020). Using Experimental Data to Evaluate Methods for Observational Causal Inference. arXiv:2010.0305. 2020-10-08. 2020-10-08. statistics causal_inference observational_studies

Zachary C. Lipton (2017). The Mythos of Model Interpretability. arXiv:1606.03490. 2020-10-12. 2020-10-12. machine_learning interpretability explainable_ai paper

Tony Duan, Anand Avati, Daisy Yi Ding, Sanjay Basu, Andrew Ng, Alejandro Schuler (2020). NGBoost: Natural Gradient Boosting for Probabilistic Prediction. arXiv:1910.03225. 2020-10-12. 2020-10-12. machine_learning gradient_boosting probabilistic ngboost paper

Eric Zelikman, Sharon Zhou, Jeremy Irvin, Cooper Raterink, Hao Sheng, Jack Kelly, Ram Rajagopal, Andrew Y. Ng, David Gagne (2020). Short-Term Solar Irradiance Forecasting Using Calibrated Probabilistic Models. arXiv:2010.04715. 2020-10-12. 2020-10-12. machine_learning forecasting solar_irridiance probabilistic ngboost gradient_boosting application paper

Ramu Ramanathan, Robert Engle, Clive W.J.Granger, Farshid Vahid-Araghi, Casey Brace (1997). Short-run forecasts of electricity loads and peaks. International Journal of Forecasting. https://doi.org/10.1016/S0169-2070(97)00015-0. 2020-10-17. 2020-10-17. forecast time_series granger engle electricity paper

V. Dordonnata, S.J. Koopman, M. Ooms, A. Dessertaine, J. Collet (2008). An Hourly Periodic State Space Model for Modelling French National Electricity Load. Tinbergen Institute Discussion Paper. https://papers.tinbergen.nl/08008.pdf. 2020-10-17. 2020-10-17. forecast time_series state_space_model electricity paper

A.E. Clements, A.S. Hurn, Z. Li (2014). Forecasting day-ahead electricity load using a multiple equation time series approach. NCER Working Paper Series. http://www.ncer.edu.au/papers/documents/WP103R.pdf. 2020-10-17. 2020-10-17. forecast time_series electricity seemingly_unrelated_regression paper

Souhaib Ben Taieb, Rob J Hyndman (2013). A gradient boosting approach to the Kaggle load forecasting competition. Preprint submitted to International Journal of Forecasting. https://robjhyndman.com/papers/kaggle-competition.pdf. 2020-10-17. 2020-10-17. forecast time_series kaggle gradient_boosting electricity paper

Souhaib Ben Taieb, Rob J Hyndman (2014). Boosting multi-step autoregressive forecasts. Proceedings of the 31st International Conference on Machine Learning. http://proceedings.mlr.press/v32/taieb14.pdf. 2020-10-17. 2020-10-17. forecast time_series gradient_boosting paper

Alon Jacovi, Ana Marasović, Tim Miller, Yoav Goldberg (2020). Formalizing Trust in Artificial Intelligence: Prerequisites, Causes and Goals of Human Trust in AI. arXiv:2010.07487. 2020-10-17. 2020-10-17.

Mitsuru Igami (2020). Artificial intelligence as structural estimation: Deep Blue, Bonanza, and AlphaGo. Econometrics Journal, doi: 10.1093/ectj/utaa005. 2020-10-19. 2020-10-19. ai econometrics structural_estimation dynamic_structural_estimation reinforcement_learning dynamic_programming paper

Abhraneel Sarma, Matthew Kay (2020). Prior Setting in Practice: Strategies and Rationales Used in Choosing Prior Distributions for Bayesian Analysis. CHI’20. 2020-10-25. 2020-10-25. bayesian_inference prior_distribution paper

Xiaoying Pu, Matthew Kay (2018). The Garden of Forking Paths in Visualization: A Design Space for Reliable Exploratory Visual Analytics. 2020-10-25. 2020-10-25. bayesian_inference multiple_comparisons statistics data_analysis paper

Andrew Gelman, Xiao-Li Meng, Hal Stern (1996). Posterior Predictive Assessment of Model Fitness via Realized Discrepancies. Statistica Sinica 6(1996), 733-807. 2020-10-25. 2020-10-25. bayesian_inference model_checking statistics paper

Andrew Gelman, Erik Loken (). The Statistical Crisis in Science. American Scientist, Volume 102. 2020-10-25. 2020-10-25. multiple_comparisons statistics paper

Andrew Gelman, Erik Loken (2013). The garden of forking paths: Why multiple comparisons can be a problem, even when there is no “fishing expedition” or “p-hacking” and the research hypothesis was posited ahead of time. 2020-10-25. 2020-10-25. multiple_comparisons statistics paper

Andrew Gelman, Jennifer Hill, Masanao Yajima (2012). Why We (Usually) Don’t Have to Worry About Multiple Comparisons. Journal of Research on Educational Effectiveness, 5: 189–211, 2012. 2020-10-25. 2020-10-25. bayesian_inference multiple_comparisons statistics paper

Andrew Gelman, Guido Imbens (2013). Why ask why? Forward causal inference and reverse causal questions. 2020-10-25. 2020-10-25. causal_inference statistics econometrics paper

Andrew Gelman, Thomas Basbøll (2014). When do stories work? Evidence and illustration in the social sciences. Sociological Methods and Research. 2020-10-25. 2020-10-25.

Andrew Gelman (2003). A Bayesian Formulation of Exploratory Data Analysis and Goodness-of-Fit Testing.

Andrew Gelman (2004). Exploratory Data Analysis for Complex Models. Journal of Computational and Graphical Statistics, Volume 13, Number 4, Pages 755–779. 2020-10-25. 2020-10-25. data_analysis statistics paper

Andrew Gelman, Yuling Yao (2020). Holes in Bayesian Statistics. 2020-10-25. 2020-10-25. bayesian_statistics statistics paper

Emanuel Zgraggen, Zheguang Zhao, Robert Zeleznik, Tim Kraska (2018). Investigating the Effect of the Multiple Comparisons Problem in Visual Analysis. CHI’18. 2020-10-25. 2020-10-25.

Macartan Humphreys, Raul Sanchez de la Sierra, Peter van der Windt (2013). Fishing, Commitment, and Communication: A Proposal for Comprehensive Nonbinding Research Registration. Political Analysis 21:1–20 doi:10.1093/pan/mps021. 2020-10-23. 2020-10-23.