I read the Ph.D thesis of Cengiz Y. Belentepe titled "A Statistical View of Universal Portfolios" because I'd been looking for a concrete paper on universal portfolios (UP).


The UP is a portfolio allocation scheme that is supposed to perform as well as the best constantly rebalanced portfolio constructed with the benefit of hindsight. A constantly rebalanced portfolio is one where you choose certain weights for each asset and then rebalance your allocations whenever one rises or falls to match the original weights. With hindsight means you can choose the best portfolio among all fixed-weight portfolios knowing what the future will be like. The UP is supposed to be able to match this because it is a dynamic, rather than static, allocation scheme.

However, it's important to know that the UP is only supposed to perform as well in the sense of a lower bound. If you know computer science then this is like saying that the Big-O efficiency (in terms of profits not time) is the same. However, a constant multiple can be enormously important in investments.

Anyway, the thesis was really interesting (although I imagine that for a thesis committee it wouldn't be obfuscated by enough complicated and excessive but impressive math).


Here are some of the most interesting bits I learned with page numbers so you can look them up too (p. book/pdf).

1. Universal Portfolios are, down to an approximation, equivalent to Markowitz's mean-variance optimization. (p.36/52)

2. A constantly rebalanced portfolio is much more powerful than it appears. For example it can "turn two nags (assets A and B) into a thoroughbred (equity curve)":








In this case we have two assets that just go up and down with -1 correlation and our fixed weights are half of our equity in each asset, rebalanced at the end of every period. He also shows a similar result on real stocks. Essentially the equal-weight constantly rebalanced portfolio is the simplest mean-reversion strategy. (p.22/38)

3. There is a nice quote by Cover summarizing the UP on page 33/49

4. There is an extremely concise and practical explanation of two methods of intelligently estimating a covariance matrix - by exponential smoothing to decrease the influence of points in the past and eigendecomposition to shrink/regularize/decrease overfitting. (p.62/74)

5. The UP can be beaten by the most naive portfolio allocation scheme - uniform allocation. He humorously calls it the capitulation portfolio. Results on real stocks show that this can hold in the real world too. (p.63/75)

6. He uses Cover's own data to show that the UP is not that effective. (p.60/76)

7. And finally lists some good heuristic learning rates to try with an exponentiated gradient (EG) online learner. I won't go into details here (probably later) but EG is a powerful, efficient, theoretically pretty algorithm good for adversarial learning settings. I always like seeing heuristics because most academic papers are missing them but in practice they are so helpful. (p.66/82)

Overall I enjoyed it a lot more than I expected and it was an easy read. Please leave a comment if you know of any good sources on modern UP work.

13 comments:

Anonymous said...

Hi,
this is an interesting paper. Thks for mentioning it.
The first time I heard about UP was when reading the Research section of this site http://www.yats.com/
Regards

Max Dama said...

Thanks

vJD said...

There could be an interesting link between dynamic trading strategies and option payout functions/option pricing:

The famous Black-Scholes Framework is usally derived using a hedging approach where a self-financing portfolio is constructed and the resulting stochastic differential equation is being solved under some conditions.

The self-financing portfolio is basically a dynamic trading strategy where according to the actual price development and the strike of the replicated option parts of the underlying are being bought or sold. For example when you want to replicate a call you would have to buy when the price rises and sell when it develops in the direction of the strike.

Do you think it is possible to build a framework where you could match different dynamic trading strategies with derivatives. For example I would like to find the characteristics of a moving average approach in terms of a derivative, e.g. the P/L-diagram of this would in my opinion look like some kind of collar.

There should also be a corresponding P/L for the Universal Portfolio.

How would you do that?

First of all I guess in this example one could take the price process (e.g. a geometric brownian motion) and e.g. convolute that with a rectangular function to model the moving average. After that one would have to find the resulting distribution density function... but all of that is just speculation... (also quite literally ;-)

Has any one of you some ideas on how to do that - or are there even ready-to-use approaches (on the web, in articles, books, etc...)?

In the end one could use the complete powerful option pricing machinery on any kind of dynamic trading strategies!

Max Dama said...

Very interesting ideas vJD. I don't have time to see what the payoff diagrams might look like now but if you think about it further I'd really like to see what you come up with.

Regards,
Max

Li Bin said...

hi,

Following methods maybe useful, you can search their names for original papers.

1. Online Newton method;
2. Nonparametric kernel-based investment strategy;
3. Nearest neighbor strategy.
4. Anticor.

I like your blog. It is related with my research.

You comments on this topic is true. People in academic mainly focus on theoretically analysis (universality), while few guys focus on the heuristic methods even though heuristic methods generates much higher return than the theoretical bounded methods.

Max Dama said...

Li Bin,

Thanks a lot for the keywords. I need some more time to think about it before I have an opinion, but for others interested in the Anticor allocation algorithm, I thought this paper was a good introduction.

Regards,
Max

fle said...

Hi again,
I digging into the references mentioned at yats.com, I found this interesting site by Gyorfi, with lectures on UP : http://www.szit.bme.hu/~oti/portfolio/

Regards

Max Dama said...

fle,

Thanks for the link. However, I've seen Gyorfi in the literature a few times and tried watching his recorded lectures, for example this one, but he's not a good speaker and he overuses notation without having that many innovative ideas in his papers it seems to me.

Regards,
Max

Li Bin said...

Guys in machine learning try to minimize the regret between their strategies and the Best Constant Rebalanced Portfolios. It is a hot topic in online learning which is a sub field of machine learning. The most recent paper is 'Efficient learning algorithms for changing environments’ on ICML 2009 and 'on stochastic and worst-case models for investing' on NIPS 2009. Another important paper I have read is 'Algorithms for portfolio management based on the Newton Method' on ICML 2006.

Anticor is a heuristic algorithm which utilizes the cross correlation and auto correlation between the time windows. 'Can we learning to beat the best stock' on NIPS 2003 is the original paper.

Gyorfi's algorithms is powerful on experiments. However, personally I feel just so so on the algorithms.

Actually, if you have tried the algorithm with simple ideas such as mean-reverting, you will find that it outperforms all the existing universal or non-universal strategies. I don't know the reason why it happens.

Regards,
Bin

fle said...

Bin,
do you mean that an oscillator like a RSI could be used as a forecaster ?
and that a portfolio component weight could just be the accrude wealth for this particular component ?
because then a simple system would be : the position / importance of a commodity in the portfolio would depend on its RSI (or a combination of RSIs or other indicators) and would be weighted by the past performance of this commodity ?
it sounds so simple...
Regards

Bin said...

fle,

As far as I know, these indicators were used in some automatic trading system in the sense of a single asset, while universal portfolio and the methods I listed is in the sense of portfolio. They generate the buy/sell signals, however, the position is another issue, or money management.

For example, nonparametric kernel-based investment strategy (2006) adopts experts, each expert generates a portfolio according to idea of log-optimal portfolio based on the similar historical windows. Then it combines the portfolio of each expert into a final portfolio according to their historical performances.

Regards,

JJones07 said...

Hi Max,

I've been following your blog off and on for the past few months. You are a seriously smart and driven guy and I hope you make more $$$ than James Simon some day.

I'm going to check out some of these papers, I have the T. Cover book on Information theory which discusses UPs and log-optimal portfolios and their limitations. The bibliography of the book may be worth checking into also.

As far as this post, however, I like how you explain the strat. for converting a nag with perfect antithetical correlation into an equity profile. Can you comment, on why this is different from a pairs mean-reversion trading srat., if any? I haven't really checked recently, but GLD and .VIX have more or less moved in opposite directions since the beginning of last year. Perhaps these two would be a good test of the theory. What do you think?

Max Dama said...

Jones,

It's basically a mean-reversion strategy, yes. That could be a good pair to look at, I'm not sure.

Regards,
Max