Bayesian statistics is an alternative to classical statistics. Classical stats is the one you're probably familiar with - confidence intervals, significance levels, p-values, estimators, and overfitting. Bayesian learning is more theoretically unified and optimal, and automatically builds in a preference for model simplicity i.e. doesn't overfit. Before computers and sampling methods for marginalizing probability distributions (evaluating integrals), Bayesian learning was usually intractable except in some special cases. Bayesian learning models have still been adopted slowly because the math can look hard- in Bayesian models you usually add more variables (i.e. greek letters) than the ones you start with.
The following are some of the papers I read last week (the good ones).
This Bayesian Vector Autoregression paper is just a straightforward example from the economic prediction literature. It has very positive results, another algorithm to keep in mind even though it's from 1986.
These last two, on stacking, are really interesting if you've thought about the problem of combining multiple signals/systems, especially ones that are likely somewhat correlated. Stacking is sort of like crossvalidation, but for optimizing ensembles of models instead of a single model. The literature on ensembles/combining multiple learners has some really interesting unexplained results - especially the obvious one - why does it even improve the overall accuracy of the individual models? These papers on stacking shed some light on it. Tibshirani & LeBlanc 1993; Breiman 1996.
I read Ralph Vince's new book, The Leverage Space Trading Model, this evening. It was released very recently on May 26th '09. Previously I read one of his older books, The Handbook of Portfolio Mathematics. Vince writes about money management, i.e. position sizing, which tries to answer the question, "How much of my capital should I bet on any given trade in order to maximize my wealth over time".
