Sergiu Hart / papers / "Calibeating": Beating Forecasters at Their Own Game

"Calibeating": Beating Forecasters at Their Own Game

Dean P. Foster and Sergiu Hart



   
   
(Acrobat PDF files)

(*) Corrections
Multi-Calibeating and Stronger Experts

Since the refinement score is the minimal Brier score over all relabelings of the bins (see (2) and the last paragraph of Section 2), By comparison, when all forecasts are probability distributions on \(A,\) i.e., all \(B^{n}\) are subsets of \(\Delta(A)\), Foster (1991) defines (see (#) below): Thus: calibeating is stronger than being the "stronger expert."


(#) In the expansive literature on experts, these notions are referred to as "prediction with no regret"; see Appendix A.9 of the full paper for the parallel results with the logarithmic scoring rule.

Abstract

In order to identify expertise, forecasters should not be tested by their calibration score, which can always be made arbitrarily small, but rather by their Brier score. The Brier score is the sum of the calibration score and the refinement score; the latter measures how good the sorting into bins with the same forecast is, and thus attests to "expertise." This raises the question of whether one can gain calibration without losing expertise, which we refer to as "calibeating." We provide an easy way to calibeat any forecast, by a deterministic online procedure. We moreover show that calibeating can be achieved by a stochastic procedure that is itself calibrated, and then extend the results to simultaneously calibeating multiple procedures, and to deterministic procedures that are continuously calibrated.







   


Last modified:
© Sergiu Hart