Home on Minimize Regret

Home on Minimize Regret / Recent content in Home on Minimize Regret Hugo -- gohugo.io en-us Sun, 21 Apr 2024 00:00:00 +0000 Debug Forecasts with Animated Plots /note/2024/04/21/animated-forecasts/ Sun, 21 Apr 2024 00:00:00 +0000 /note/2024/04/21/animated-forecasts/ Speaking of GIFs, animated visualizations of rolling forecasts are eye-opening to the impact of individual observations, the number of observations, and default settings on a model’s forecasts. In the example below, the default forecast::auto.arima() transitions between poor model specifications until it can finally pick up the seasonality after 24 observations, only to generate a negative point forecast despite purely non-negative observations. Fantastic way to understand forecast methods’ edge-case behavior. Reliably Forecasting Time-Series in Real Time /linked/2024/04/15/masson-pydata/ Mon, 15 Apr 2024 00:00:00 +0000 /linked/2024/04/15/masson-pydata/ Straight from my YouTube recommendations, a PyData London 2018 (!) presentation by Charles Masson of Datadog. To predict whether server metrics cross a threshold, he builds a method model that focuses on being robust to all the usual issues of anomalies and structural breaks. He keeps it simple, interpretable, and–for the sake of real-time forecasting–fast. Good stuff all around. The GIFs are the cherry on top. Chronos: Learning the Language of Time Series /linked/2024/03/27/chronos-forecasting/ Wed, 27 Mar 2024 00:00:00 +0000 /linked/2024/03/27/chronos-forecasting/ Ansari et al. (2024) introduce their Chronos model family on Github: Chronos is a family of pretrained time series forecasting models based on language model architectures. A time series is transformed into a sequence of tokens via scaling and quantization, and a language model is trained on these tokens using the cross-entropy loss. Once trained, probabilistic forecasts are obtained by sampling multiple future trajectories given the historical context. The whole thing is very neat. Average Temperatures by Month Instead of Year /linked/2024/03/25/average-temperatures-by-month-not-year/ Mon, 25 Mar 2024 00:00:00 +0000 /linked/2024/03/25/average-temperatures-by-month-not-year/ This tweet is a prime example for why it’s hard to analyze one signal in a time series (here, its trend) without simultaneously adjusting for its other signal components (here, its seasonality). If the tweet gets taken down, perhaps this screenshot on Mastodon remains. AI Act Article 17 - Quality Management System /note/2024/03/24/ai-act-article-17/ Sun, 24 Mar 2024 00:00:00 +0000 /note/2024/03/24/ai-act-article-17/ Article 17 of the AI Act adopted by the EU Parliament is the ideal jump-off point into other parts of the legislation. While article 16 lists the “Obligations of providers of high-risk AI systems”, Article 17 describes the main measure by which providers can ensure compliance: the quality management system. That system shall be documented in a systematic and orderly manner in the form of written policies, procedures and instructions, and shall include at least the following aspects […] Demystifying the Draft EU AI Act /linked/2024/03/16/demystifying-the-draft-eu-ai-act/ Sat, 16 Mar 2024 00:00:00 +0000 /linked/2024/03/16/demystifying-the-draft-eu-ai-act/ Speaking of AI Act details, the paper “Demystifying the Draft EU AI Act” (Veale and Borgesius, 2021) has been a real eye-opener and fundamental to my understanding of the regulation.1 Different than most coverage of the regulation, the two law researchers highlight the path by which EU law eventually impacts practice: Via standards and company-internal self-assessments. This explains why you will be left wondering what human oversight and technical robustness mean after reading the AI Act. AI Act Approved by EU Parliament /note/2024/03/14/ai-act-adopted-by-eu-parliament/ Thu, 14 Mar 2024 00:00:00 +0000 /note/2024/03/14/ai-act-adopted-by-eu-parliament/ The EU AI Act has finally come to pass, and pass it did with 523 of 618 votes of the EU Parliament in favor. The adopted text (available as of writing as PDF or Word document—the latter is much easier to work with!) has seen a number of changes since the original proposal by the EU Commission in 2021. For example, the current text reduces the set of systems considered high-risk somewhat by excluding those that are “not materially influencing the outcome of decision making” (Chapter III, Section 1, Article 6, Paragraph 3) except for those already covered by EU regulation such as medical devices and elevators. Stop Using Dynamic Time Warping for Business Time Series /note/2023/12/12/stop-using-dtw/ Tue, 12 Dec 2023 00:00:00 +0000 /note/2023/12/12/stop-using-dtw/ Dynamic Time Warping (DTW) is designed to reveal inherent similarity between two time series of similar scale that was obscured because the time series were shifted in time or sampled at different speeds. This makes DTW useful for time series of natural phenomena like electrocardiogram measurements or recordings of human movements, but less so for business time series such as product sales. To see why that is, let’s first refresh our intuition of DTW, to then check why DTW is not the right tool for business time series. Comes with Anomaly Detection Included /note/2023/12/03/anomaly-detection-included/ Sun, 03 Dec 2023 00:00:00 +0000 /note/2023/12/03/anomaly-detection-included/ A powerful pattern in forecasting is that of model-based anomaly detection during model training. It exploits the inherently iterative nature of forecasting models and goes something like this: Train your model up to time step t based on data [1,t-1] Predict the forecast distibution at time step t Compare the observed value against the predicted distribution at step t; flag the observation as anomaly if it is in the very tail of the distribution Don’t update the model’s state based on the anomalous observation For another description of this idea, see, for example, Alexandrov et al. Code Responsibly /note/2023/11/12/code-responsibly/ Sun, 12 Nov 2023 00:00:00 +0000 /note/2023/11/12/code-responsibly/ There exists this comparison of software before and software after machine learning. Before machine learning, code was deterministic: Software engineers wrote code, the code included conditions with fixed thresholds, and at least in theory the program was entirely understandable. After machine learning, code is no longer deterministic. Instead of software engineers instantiating it, the program’s logic is determined by a model and its parameters. Those parameters are not artisinally chosen by a software engineer but learned from data. A Flexible Model Family of Simple Forecast Methods /post/2023/10/19/threedx/ Thu, 19 Oct 2023 00:00:00 +0000 /post/2023/10/19/threedx/ Introducing a flexible model family that interpolates between simple forecast methods to produce interpretable probabilistic forecasts of seasonal data by weighting past observations. In business forecasting applications for operational decisions, simple approaches are hard-to-beat and provide robust expectations that can be relied upon for short- to medium-term decisions. They’re often better at recovering from structural breaks or modeling seasonal peaks than more complicated models, and they don’t overfit unrealistic trends. Video: Tim Januschowski, ISF 2023 Practitioner Speaker /linked/2023/10/05/ecb-forecasts/ Thu, 05 Oct 2023 00:00:00 +0000 /linked/2023/10/05/ecb-forecasts/ We don’t have enough presentations of industry practitioners discussing the detailed business problems they’re addressing and what solutions and trade-offs they were able to implement. Tim Januschoswki did just that, though, in his presentation at the International Symposium on Forecasting 2023. He discusses demand forecasting for optimal pricing at Zalando. Presentations such as this one are rare opportunites to peak at the design of real world solutions. My favorite quote: 'ECB Must Accept Forecasting Limitations to Restore Trust' /linked/2023/09/11/ecb-forecasts/ Mon, 11 Sep 2023 00:00:00 +0000 /linked/2023/09/11/ecb-forecasts/ Christine Lagarde, president of the European Central Bank, declared her intent to communicate the shortcomings of the ECB’s forecasts better—and in doing so, provides applied data science lessons for the rest of us. As quoted by the Financial Times: “Even if these [forecast] errors were to deplete trust, we can mitigate this if we talk about forecasts in a way that is both more contingent and more accessible, and if we provide better explanations for those errors,” Lagarde said. In Search of Verifiability: Explanations Rarely Enable Complementary Performance in AI-Advised Decision Making /linked/2023/05/31/in-search-of-verifiability/ Wed, 31 May 2023 00:00:00 +0000 /linked/2023/05/31/in-search-of-verifiability/ Raymond Fok and Daniel S. Weld in a recent Arxiv preprint: We argue explanations are only useful to the extent that they allow a human decision maker to verify the correctness of an AI’s prediction, in contrast to other desiderata, e.g., interpretability or spelling out the AI’s reasoning process. This does ring true to me: Put yourself into the position of an employee of Big Company Inc. whose task it is to allocate marketing budgets, to purchase product inventory, or to perform any other monetary decision as part of a business process. Explainability Washing /linked/2023/05/29/explainability-washing/ Mon, 29 May 2023 00:00:00 +0000 /linked/2023/05/29/explainability-washing/ Upol Ehsan ponders on Mastodon: Explainable AI suffers from an epidemic. I call it Explainability Washing. Think of it as window dressing–techniques, tools, or processes created to provide the illusion of explainability but not delivering it. Ah yes, slapping feature importance values onto a prediction and asking your users “Are you not entertained?”. This thread pairs well with Rick Saporta’s presentation. Both urge you to focus solely on your user’s decision when deciding what to build. A Framework for Data Product Management for Increasing Adoption & User Love /linked/2023/05/29/saporta-data-product-success/ Mon, 29 May 2023 00:00:00 +0000 /linked/2023/05/29/saporta-data-product-success/ You might have heard this one before: To build successful data products, focus on the decisions your customers make. But when was the last time you considered “how your work get[s] converted into action”? At Data Council 2023, Rick Saporta lays out a framework of what data products to build and how to make them successful with customers. He goes beyond the platitudes, his advice sounds hard-earned. Slides are good, talk is great. The 2-by-2 of Forecasting /note/2023/05/20/two-by-two-of-forecasting/ Sat, 20 May 2023 00:00:00 +0000 /note/2023/05/20/two-by-two-of-forecasting/ False Positives and False Negatives are traditionally a topic in classification problems only. Which makes sense: There is no such thing as a binary target in forecasting, only a continuous range. There is no true and false, only a continuous scale of wrong. But there lives an MBA student in me who really likes 2-by-2 charts, so let’s come up with one for forecasting. The {True,False}x{Positive,Negative} confusion matrix is the one opportunity for university professors to discuss the stakeholders of machine learning systems. Bayesian Intermittent Demand Forecasting at NeurIPS 2016 /linked/2023/03/25/seeger-neurips-2016/ Sat, 25 Mar 2023 00:00:00 +0000 /linked/2023/03/25/seeger-neurips-2016/ Oldie but a goodie: A recording of Matthias Seeger’s presentation of “Bayesian Intermittent Demand Forecasting for Large Inventories” at NeurIPS 2016. The corresponding paper is a favorite of mine, but I only now stumbled over the presentation. It sparked an entire catalogue of work on time series forecasting by Amazon, and like few others called out the usefulness of sample paths. On the Factory Floor /linked/2023/02/26/on-the-factory-floor/ Sun, 26 Feb 2023 00:00:00 +0000 /linked/2023/02/26/on-the-factory-floor/ What works at Google-scale is not the pattern most data scientists need to employ at their work. But the paper “On the Factory Floor: ML Engineering for Industrial-Scale Ads Recommendation Models” is the kind of paper that we need more of: Thrilling reports of what works in practice. Also, the authors do provide abstract lessons anyone can use, such as considering the constraints of your problem rather than using whatever is state-of-the-art: SAP Design Guidelines for Intelligent Systems /linked/2023/01/08/sap-design-guidelines/ Sun, 08 Jan 2023 00:00:00 +0000 /linked/2023/01/08/sap-design-guidelines/ From SAP’s Design Guidelines for Intelligent Systems: High–stakes decisions are more common in a professional software environment than in everyday consumer apps, where the consequences of an action are usually easy to anticipate and revert. While the implications of recommending unsuitable educational content to an employee are likely to be minimal, recommendations around critical business decisions can potentially cause irreversible damage (for example, recommending an unreliable supplier or business partner, leading to the failure or premature termination of a project or contract). Skillful Image Fast-Forwarding /linked/2023/01/02/darksky-weather-prediction/ Mon, 02 Jan 2023 00:00:00 +0000 /linked/2023/01/02/darksky-weather-prediction/ Russel Jacobs for Slate on the discontinued Dark Sky weather app, via Daring Fireball: Indeed, Dark Sky’s big innovation wasn’t simply that its map was gorgeous and user-friendly: The radar map was the forecast. Instead of pulling information about air pressure and humidity and temperature and calculating all of the messy variables that contribute to the weather–a multi-hundred-billion-dollars-a-year international enterprise of satellites, weather stations, balloons, buoys, and an army of scientists working in tandem around the world (see Blum’s book)–Dark Sky simply monitored changes to the shape, size, speed, and direction of shapes on a radar map and fast-forwarded those images. ChatGPT and ML Product Management /post/2022/12/09/chatgpt-and-product-management/ Fri, 09 Dec 2022 00:00:00 +0000 /post/2022/12/09/chatgpt-and-product-management/ Huh, look at that, OpenAI’s ChatGPT portrays absolute confidence while giving plain wrong answers. But ChatGPT also does provide helpful responses a large number of times. So one kind of does want to use it. Sounds an awful lot like every other machine learning model deployed in 2022. But really, how do we turn fallible machine learning models into products to be used by humans? Not by injecting its answers straight into StackOverflow. GluonTS Workshop at Amazon Berlin on September 29 /linked/2022/09/28/gluonts-workshop/ Wed, 28 Sep 2022 00:00:00 +0000 /linked/2022/09/28/gluonts-workshop/ The workshop will revolve around tools that automatically transform your data, in particular time series, into high-quality predictions based on AutoML and deep learning models. The event will be hosted by the team at AWS that develops AutoGluon, Syne Tune and GluonTS, and consist of a mix of tutorial-style presentation on the tools, discussion, and contributions from external partners on their applications. Unique opportunity to hear from industry practitioners and GluonTS developers in person or by joining online. Design a System, not an "AI" /linked/2022/09/14/design-a-system-not-an-ai/ Wed, 14 Sep 2022 00:00:00 +0000 /linked/2022/09/14/design-a-system-not-an-ai/ Ryxcommar on Twitter: I think one of the bigger mistakes people make when designing AI powered systems is seeing them as an AI first and foremost, and not as a system first and foremost. Once you have your API contracts in place, the AI parts can be seen as function calls inside the system. Maybe your first version of these functions just return an unconditional expected value. But the system is the bulk of the work, the algorithm is a small piece. Berlin Bayesians Meetup on September 27 /linked/2022/09/06/berlin-bayesians-meetup/ Tue, 06 Sep 2022 00:00:00 +0000 /linked/2022/09/06/berlin-bayesians-meetup/ The Berlin Bayesians meetup is happening again in-person. Juan Orduz is going to present Buy ‘Til You Die models implemented in PyMC: In this talk, we introduce a certain type of customer lifetime models for the non-contractual setting commonly known as BTYD (Buy Till You Die) models. We focus on two sub-model components: the frequency BG/NBD model and the monetary gamma-gamma model. We begin by introducing the model assumptions and parameterizations. Legible Forecasts, and Design for Contestability /post/2022/08/31/legible-forecasts/ Wed, 31 Aug 2022 00:00:00 +0000 /post/2022/08/31/legible-forecasts/ Some models are inherently interpretable because one can read their decision boundary right off them. In fact, you could call them interpreted as there is nothing left for you to interpret: The entire model is written out for you to read. For example, assume it’s July and we need to predict how many scoops of ice cream we’ll sell next month. The Seasonal Naive method tells us: AS 12 MONTHS AGO IN August, PREDICT sales = 3021. Where Is the Seasonal Naive Benchmark? /post/2022/08/30/where-is-the-seasonal-naive-benchmark/ Tue, 30 Aug 2022 00:00:00 +0000 /post/2022/08/30/where-is-the-seasonal-naive-benchmark/ Yesterday morning, I retweeted this tweet by sklearn_inria that promotes a scikit-learn tutorial notebook on time-related feature engineering. It’s a neat notebook that shows off some fantastic ways of creating features to predict time series within a scikit-learn pipeline. There are, however, two things that irk me: All features of the dataset including the hourly weather are passed to the model. I don’t know the details of this dataset, but skimming what I believe to be its description on the OpenML repository, I suspect this might introduce data leakage as in reality we can’t know the exact hourly humidity and temperature days in advance. When Quantiles Do Not Suffice, Use Sample Paths Instead /post/2022/07/25/use-sample-paths/ Mon, 25 Jul 2022 00:00:00 +0000 /post/2022/07/25/use-sample-paths/ I don’t need to convince you that you should absolutely, to one hundred percent, quantify your forecast uncertainty—right? We agree about the advantages of using probabilistic measures to answer questions and to automate decision making—correct? Great. Then let’s dive a bit deeper. So you’re forecasting not just to fill some numbers in a spreadsheet, you are trying to solve a problem, possibly aiming to make optimal decisions in a process concerned with the future. Be Skeptical of the t-SNE Bunny /linked/2022/07/11/tsne-bunny/ Mon, 11 Jul 2022 00:00:00 +0000 /linked/2022/07/11/tsne-bunny/ Matt Henderson on Twitter (click through for the animation): Be skeptical of the clusters shown in t-SNE plots! Here we run t-SNE on a 3d shape - it quickly invents some odd clusters and structures that aren’t really present in the original bunny. What would happen if every machine learning method would come with a built-in visualization of the spurious results that it found? Never mind the the answer to that question. Failure Modes of State Space Models /post/2022/07/05/state-space-model-intricacies/ Tue, 05 Jul 2022 00:00:00 +0000 /post/2022/07/05/state-space-model-intricacies/ State space models are great, but they will fail in predictable ways. Well, claiming that they “fail” is a bit unfair. They actually behave exactly as they should given the input data. But if the input data fails to adhere to the Normal assumption or lacks stationarity, then this will affect the prediction derived from the state space models in perhaps unexpected yet deterministic ways. This article ensures that none of us is surprised by these “failure modes”. Approach to Estimate Uncertainty Distributions of Walmart Sales /linked/2021/12/29/uncertainty-distributions-of-walmart-sales/ Wed, 29 Dec 2021 00:00:00 +0000 /linked/2021/12/29/uncertainty-distributions-of-walmart-sales/ We present our solution for the M5 Forecasting - Uncertainty competition. Our solution ranked 6th out of 909 submissions across all hierarchical levels and ranked first for prediction at the finest level of granularity (product-store sales, i.e. SKUs). The model combines a multi-stage state-space model and Monte Carlo simulations to generate the forecasting scenarios (trajectories). Observed sales are modelled with negative binomial distributions to represent discrete over-dispersed sales. Seasonal factors are hand-crafted and modelled with linear coefficients that are calculated at the store-department level. On Google Maps Directions /post/2021/10/03/on-google-maps-directions/ Sun, 03 Oct 2021 00:00:00 +0000 /post/2021/10/03/on-google-maps-directions/ Google Maps and its Directions feature are the kind of data science product everyone wished they’d be building. It augments the user, enabling decision-making while driving. Directions exemplifies the difference between prediction and prescription. Google Maps doesn’t just expose data, and it doesn’t provide a raw analysis by-product like SHAP values. It processes historical and live data to predict the future and to optimize my route based on it, returning only the refined recommendations. Forecasting Uncertainty Is Never Too Large /note/2021/09/01/forecasting-uncertainty-is-never-too-large/ Wed, 01 Sep 2021 00:00:00 +0000 /note/2021/09/01/forecasting-uncertainty-is-never-too-large/ Rob J. Hyndman gave a presentation titled “Uncertain futures: what can we forecast and when should we give up?” as part of the ACEMS public lecture series with recording available on Youtube. He makes an often underappreciated point around minute 50 of the talk: When the forecast uncertainty is too large to assist decision making? I don’t think that’s ever the case. Forecasting uncertainty being too large does assist decision making by telling the decision makers that the future is very uncertain and they should be planning for lots of different possible outcomes and not assuming just one outcome or another. What Needs to Prove True for This to Work? /post/2021/08/12/what-needs-to-prove-true-for-this-to-work/ Thu, 12 Aug 2021 00:00:00 +0000 /post/2021/08/12/what-needs-to-prove-true-for-this-to-work/ Data science projects are a tricky bunch. They entice you with challenging problems and promise a huge return if successful. In contrast to more traditional software engineering projects, however, data science projects entail more upfront uncertainty: You’ll not know until you tried whether the technology is good enough to solve the problem. Consequently, a data science endeavor fails more often, or doesn’t turn out to be the smash hit you and your stakeholders expected it to be. Everything is an AI Technique /note/2021/05/02/everything-is-an-ai-technique/ Sun, 02 May 2021 00:00:00 +0000 /note/2021/05/02/everything-is-an-ai-technique/ Along with their proposal for regulation of artificial intelligence, the EU published a definition of AI techniques. It includes everything, and that’s great! From the proposal’s Annex I: ARTIFICIAL INTELLIGENCE TECHNIQUES AND APPROACHES referred to in Article 3, point 1 (a) Machine learning approaches, including supervised, unsupervised and reinforcement learning, using a wide variety of methods including deep learning; (b) Logic- and knowledge-based approaches, including knowledge representation, inductive (logic) programming, knowledge bases, inference and deductive engines, (symbolic) reasoning and expert systems; (c) Statistical approaches, Bayesian estimation, search and optimization methods. Resilience, Chaos Engineering and Anti-Fragile Machine Learning /note/2021/01/01/resilience-chaos-engineering-and-anti-fragile-machine-learning/ Fri, 01 Jan 2021 00:00:00 +0000 /note/2021/01/01/resilience-chaos-engineering-and-anti-fragile-machine-learning/ In his interview with The Observer Effect, Tobi Lütke, CEO of Shopify, describes how Shopify benefits from resilient systems: Most interesting things come from non-deterministic behaviors. People have a love for the predictable, but there is value in being able to build systems that can absorb whatever is being thrown at them and still have good outcomes. So, I love Antifragile, and I make everyone read it. Paper Stack /paper-stack/ Sun, 08 Nov 2020 00:00:00 +0000 /paper-stack/ This file serves as a list of all papers I have come across that sound interesting enough to potentially read them eventually. Please note that I started the list in October 2020. As I only fill it up as I go, it does not reflect everything I have read, alas. Inspiration: Cosma Shalizi’s notebooks. format: authors, year, title, source, link, added_date, modified_date, tags Have Read Steven L. Scott, Hal Varian (2013). Embedding Many Time Series via Recurrence Plots /post/2020/06/14/embedding-many-time-series-via-recurrence-plots/ Sun, 14 Jun 2020 00:00:00 +0000 /post/2020/06/14/embedding-many-time-series-via-recurrence-plots/ We demonstrate how recurrence plots can be used to embed a large set of time series via UMAP and HDBSCAN to quickly identify groups of series with unique characteristics such as seasonality or outliers. The approach supports exploratory analysis of time series via visualization that scales poorly when combined with large sets of related time series. We show how it works using a Walmart dataset of sales and a Citi Bike dataset of bike rides. Rediscovering Bayesian Structural Time Series /post/2020/06/07/rediscovering-bayesian-structural-time-series/ Sun, 07 Jun 2020 00:00:00 +0000 /post/2020/06/07/rediscovering-bayesian-structural-time-series/ This article derives the Local-Linear Trend specification of the Bayesian Structural Time Series model family from scratch, implements it in Stan and visualizes its components via tidybayes. To provide context, links to GAMs and the prophet package are highlighted. The code is available here. I tried to come up with a simple way to detect “outliers” in time series. Nothing special, no anomaly detection via variational auto-encoders, just finding values of low probability in a univariate time series. Are You Sure This Embedding Is Good Enough? /post/2020/04/26/are-you-sure-this-embedding-is-good-enough/ Sun, 26 Apr 2020 00:00:00 +0000 /post/2020/04/26/are-you-sure-this-embedding-is-good-enough/ Suppose you are given a data set of five images to train on, and then have to classify new images with your trained model. Five training samples are in general not sufficient to train a state-of-the-art image classification model, thus this problem is hard and earned it’s own name: few-shot image classification. A lot has been written on few-shot image classification and complex approaches have been suggested.1 Tian et al. The Causal Effect of New Year's Resolutions /post/2020/01/18/the-causal-effect-of-new-years-resolutions/ Sat, 18 Jan 2020 00:00:00 +0000 /post/2020/01/18/the-causal-effect-of-new-years-resolutions/ We treat the turn of the year as an intervention to infer the causal effect of New Year’s resolutions on McFit’s Google Trend index. By comparing the observed values from the treatment period against predicted values from a counterfactual model, we are able to derive the overall lift induced by the intervention. Throughout the year, people’s interest in a McFit gym membership appears quite stable.1 The following graph shows the Google Trend for the search term “McFit” in Germany for April 2017 to until the week of December 17, 2017. satRday Berlin Presentation /post/2019/06/16/satrday-berlin-presentation/ Sun, 16 Jun 2019 00:00:00 +0000 /post/2019/06/16/satrday-berlin-presentation/ My satRday Berlin slides on “Modeling Short Time Series” are available here. This saturday, June 15, Berlin had its first satRday conference. I eagerly followed the hashtags of satRday Amsterdam last year and satRday Capetown the year before that on Twitter. Thanks to Noa Tamir, Jakob Graff, Steve Cunningham, and many others, we got a conference in Berlin as well. When I saw the call for papers, I jumped at the opportunity to present, trying what it feels like to be on the other side of the microphone; being in the hashtag instead of following it. Modeling Short Time Series with Prior Knowledge /post/2019/04/16/modeling-short-time-series-with-prior-knowledge/ Tue, 16 Apr 2019 00:00:00 +0000 /post/2019/04/16/modeling-short-time-series-with-prior-knowledge/ I just published a longer case study, Modeling Short Time Series with Prior Knowledge: What ‘Including Prior Information’ really looks like. It is generally difficult to model time series when there is insuffient data to model a (suspected) long seasonality. We show how this difficulty can be overcome by learning a seasonality on a different, long related time series and transferring the posterior as a prior distribution to the model of the short time series. The Probabilistic Programming Workflow /post/2019/03/23/the-probabilistic-programming-workflow/ Sat, 23 Mar 2019 00:00:00 +0000 /post/2019/03/23/the-probabilistic-programming-workflow/ Last week, I gave a presentation about the concept of and intuition behind probabilistic programming and model-based machine learning in front of a general audience. You can read my extended notes here. Drawing on ideas from Winn and Bishop’s “Model-Based Machine Learning” and van de Meent et al.’s “An Introduction to Probabilistic Programming”, I try to show why the combination of a data-generating process with an abstracted inference is a powerful concept by walking through the example of a simple survival model. Problem Representations and Model-Based Machine Learning /post/2019/02/24/problem-representations-and-model-based-machine-learning/ Sun, 24 Feb 2019 00:00:00 +0000 /post/2019/02/24/problem-representations-and-model-based-machine-learning/ Back in 2003, Paul Graham, of Viaweb and Y Combinator fame, published an article entitled “Better Bayesian Filtering”. I was scrolling chronologically through his essays archive the other day when this article stuck out to me (well, the “Bayesian” keyword). After reading the first few paragraphs, I was a little disappointed to realize the topic was Naive Bayes rather than Bayesian methods. But it turned out to be a tale of implementing a machine learning solution for a real world application before anyone dared to mention AI in the same sentence. Videos from PROBPROG 2018 Conference /note/2018/11/11/probprog-videos/ Sun, 11 Nov 2018 00:00:00 +0000 /note/2018/11/11/probprog-videos/ Videos of the talks given at the International Conference on Probabilistic Programming (PROBPROG 2018) back in October were published a few days ago and are now available on Youtube. I have not watched all presentations yet, but a lot of big names attended the conference so there should be something for everyone. In particular the talks by Brooks Paige (“Semi-Interpretable Probabilistic Models”) and Michael Tingley (“Probabilistic Programming at Facebook”) made me curious to explore their topics more. Videos from Exploration in RL Workshop at ICML /note/2018/09/30/videos-from-exploration-in-rl-icml-workshop/ Sun, 30 Sep 2018 00:00:00 +0000 /note/2018/09/30/videos-from-exploration-in-rl-icml-workshop/ One of the many fantastic workshops at ICML this year was the Exploration in Reinforcement Learning workshop. All talks were recorded and are now available on Youtube. Highlights include presentations by Ian Osband, Emma Brunskill, and Csaba Szepesvari, among others. You can find the workshop’s homepage here with more information and the accepted papers. SVD for a Low-Dimensional Embedding of Instacart Products /post/2018/07/25/svd-instacart-product-embedding/ Wed, 25 Jul 2018 00:00:00 +0000 /post/2018/07/25/svd-instacart-product-embedding/ Building on the Instacart product recommendations based on Pointwise Mutual Information (PMI) in the previous article, we use Singular Value Decomposition to factorize the PMI matrix into a matrix of lower dimension (“embedding”). This allows us to identify groups of related products easily. We finished the previous article with a long table where every row measured how surprisingly often two products were bought together according to the Instacart Online Grocery Shopping dataset. Pointwise Mutual Information for Instacart Product Recommendations /post/2018/06/17/instacart-products-bought-together/ Sun, 17 Jun 2018 00:00:00 +0000 /post/2018/06/17/instacart-products-bought-together/ Using pointwise mutual information, we create highly efficient “customers who bought this item also bought” style product recommendations for more than 8000 Instacart products. The method can be implemented in a few lines of SQL yet produces high quality product suggestions. Check them out in this Shiny app. Back in school, I was a big fan of the Detective Conan anime. For whatever reason, one of the episodes stuck with me. Pokémon Recommendation Engine /post/2017/07/01/pok%C3%A9mon-recommendation-engine/ Sat, 01 Jul 2017 00:00:00 +0000 /post/2017/07/01/pok%C3%A9mon-recommendation-engine/ Using t-SNE, I wrote a Shiny app that recommends similar Pokémon. Try it out here. Needless to say, I was and still am a big fan of the Pokémon games. So I was very excited to see that a lot of the meta data used in Pokémon games is available on Github due to the Pokémon API project. Data on Pokémon’s names, types, moves, special abilities, strengths and weaknesses is all cleanly organized in a few dozen csv files. Look At All These Links /post/2017/01/25/look-at-all-these-links/ Wed, 25 Jan 2017 00:00:00 +0000 /post/2017/01/25/look-at-all-these-links/ By now, some time has passed since NIPS 2016. Consequently, several recaps can be found on blogs. One of them is this one by Eric Jang. If you want to make your first steps in putting some of the theory presented at NIPS into practice, why not take a look at this slide deck about reinforcement learning in R? The RStudio Conference also took place, and apparently has been a blast. Multi-Armed Bandits at Tinder /note/2016/12/14/multi-armed-bandits-at-tinder/ Wed, 14 Dec 2016 00:00:00 +0000 /note/2016/12/14/multi-armed-bandits-at-tinder/ In a post on Tinder’s tech blog, Mike Hall presents a new application for multi-armed bandits. At Tinder, they started to use multi-armed bandits to optimize the photo of users that is shown first: While a user can have multiple photos in his profile, only one of them is shown first when another user swipes through the deck of user profiles. By employing an adapted epsilon-greedy algorithm, Tinder optimizes this photo for the “Swipe-Right-Rate”. Look At All These Links /post/2016/11/06/look-at-all-these-links/ Sun, 06 Nov 2016 00:00:00 +0000 /post/2016/11/06/look-at-all-these-links/ At Airbnb, the data science team has written their own R packages to scale with the company’s growth. The most basic achievement of the packages is the standardization of the work (ggplot and RMarkdown templates) and reduction of duplicate effort (importing data). New employees are introduced to the infrastructure with extensive workshops. This reminded me of a presentation by Hilary Parker in April at the New York R Conference on Scaling Analysis Responsibly. Three Types of Cluster Reproducibility /post/2016/06/14/three-types-of-cluster-reproducibility/ Tue, 14 Jun 2016 00:00:00 +0000 /post/2016/06/14/three-types-of-cluster-reproducibility/ Christian Hennig provides a function called clusterboot() in his R package fpc which I mentioned before when talking about assessing the quality of a clustering. The function runs the same cluster algorithm on several bootstrapped samples of the data to make sure that clusters are reproduced in different samples; it validates the cluster stability. In a similar vein, the reproducibility of clusterings with subsequent use for marketing segmentation is discussed in this paper by Dolnicar and Leisch. Assessing the Quality of a Clustering Solution /post/2016/05/30/assessing-the-quality-of-a-clustering-solution/ Mon, 30 May 2016 00:00:00 +0000 /post/2016/05/30/assessing-the-quality-of-a-clustering-solution/ During one of the talks at PyData Berlin, a presenter quickly mentioned a k-means clustering used to group similar clothing brands. She commented that it wasn’t perfect, but good enough and the result you would expect from a k-means clustering. There remains the question, however, how one can assess whether a clustering is “good enough”. In above case, the number of brands is rather small, and simply by looking at the groups one is able to assess whether the combination of Tommy Hilfiger and Marc O’Polo is sensible. Taxi Pulse of New York City /post/2015/09/21/taxi-pulse-of-new-york-city/ Mon, 21 Sep 2015 00:00:00 +0000 /post/2015/09/21/taxi-pulse-of-new-york-city/ I don’t know about you, but I think taxi data is fascinating. There is a lot you can do with the data sets as they usually contain observations on geolocation as well as time stamps besides other information, which makes them unique. Geolocation and timestamps alone, as well as the large number of observations in cities like New York enable you to create stunning visualizations that aren’t possible with any other set of data. Analyzing Taxi Data to Create a Map of New York City /post/2015/08/26/analyzing-taxi-data-to-create-a-map-of-new-york-city/ Wed, 26 Aug 2015 00:00:00 +0000 /post/2015/08/26/analyzing-taxi-data-to-create-a-map-of-new-york-city/ Yet another day was spent working on the taxi data provided by the NYC Taxi and Limousine Commission (TLC). My goal in working with the data was to create a plot that maps the streets of New York using the geolocation data that is provided for the taxis’ pickup and dropoff locations as longitude and latitude values. So far, I had only used the dataset for January of 2015 to plot the locations; also, I hadn’t used the more than 12 million observations in January alone but a smaller sample (100000 to 500000 observations). About Minimize Regret /about/ Mon, 01 Jan 0001 00:00:00 +0000 /about/ Hi, my name is Tim. You’re reading Minimize Regret, the site where I write about things I’ve recently learned. I live in Berlin, where I work as a data scientist. I’m interested in quantifying uncertainty, and optimal decision making under uncertainty. Fittingly, topics such as probabilistic programming, reinforcement learning and stochastic optimal control, as well as time series theory and applications are near and dear to my heart. I recently finished my master’s in statistics and graduated with honors. Blogroll /blogroll/ Mon, 01 Jan 0001 00:00:00 +0000 /blogroll/ Statistical Modeling, Causal Inference, and Social Science by Andrew Gelman and others Hyndsight by Rob J. Hyndman Three-Toed Sloth by Cosma Shalizi Worry Dream by Bret Victor Unofficial Google Data Science I’m a Bandit by Sébastian Bubeck Michael Betancourt arg min (now Substack), previously arg min (a blog) by Benjamin Recht David Stutz by David Stutz The Learning Theory Alliance Blog