Survivorship bias is a simple idea but it's a good time to bring it up after the previous notes on overfitting.
Survivorship bias exaggerates long-only strategies' performance during backtesting, especially those with a long holding period (years).
Most data sets will not include companies that went bankrupt or were delisted because their share prices went too low. Survivorship bias describes how the companies that we see today are those that did the best in the past. The failures have disappeared.
For ex. if you take ten years of data for all stocks currently included in the S&P 500 the performance will be better than the actual ten year return of the S&P 500.
Another ex. is if your strategy only applies to one stock, then the model you build of it based on historical data will not include the possibility of bankruptcy. Bankruptcy from the perspective of a time series is a floor on the semi-random walk. Maybe the model predicts the stock will drop by $1 when it is at $0.25, not understanding that it can't go below $0.00. This is not the usual usage of survivorship bias as I explained above but it can be generalized.
I know some professional system development & backtesting products such as MarketQA use complete data sets to eliminate potential survivorship bias. For a retail trader all you can do is keep it in mind. Fortunately survivorship bias's influence is not too high on most algorithmic systems which are usually high-frequency or at least hold for <1 week. Please share if you have a simple solution to this problem or if you have a quantitative example of this bias from previous work, preferably illustrated by a chart. Unfortunately this note was mostly generalizations and words.
Topic:
Automated Trading,
Data
Subscribe to:
Post Comments (Atom)
12 comments:
Paul,
What kind of strategy are you using?
Sorry, I had to delete you comment for language:
Hey _ how are you doing in the olympiad, Im up 2.5% right now
~Paul L
Regards,
Max
Max,
is better a degree inn Math or computer for make "quant" trading sistems???
Thanks,
Steve
Steve,
Probably computer science. I'm not the most qualified to answer though.
Regards,
Max
Max, hope you are doing well in the contest.
I am deficient when it comes to math/computer skills, so trying to navigate statistical measure to evaluate systems is fascinating but challenging for me. Your blog is fantastic but unfortunately much of the math and terminology is way over my head. One day at a time, right?
http://www.verticalsolutions.com/notes/control_charts_and_forecasting.html
The link above I find very interesting and I was wondering if this is a simple explanation of some of methods you are using to build and evaluate systems?
Regards,
Woodshredder,
I don't use anything like that. It's a good idea though and in the future I would like to try it. The control chart looks like it comes from operations research. You can think of it as being similar to bollinger bands, but when it breaks out that's a sign the system is malfunctioning.
Regards,
Max
a good example of survivor bias is spoken about in Chan's book. In it he describes selecting stocks to go long that are down a significant amount "he predicts a bounce" longer term. I selecting extreme downtrending stocks the "survivors" are by definition those that bounced or didn't go bankrupt. The dotcom era is a great example of this. If you select equities with major declines in up to 2001, then buy, the surivor filter will have a significant data skew.
check out www.portfolio123.com
I like it much better than Zacks backtester which I tried.
I'm getting Clarifi soon and look forward to trying it.
Thanks for the tip sc.
Nick,
I was a bit disappointed with Chan's book. It was very light.
I looked at your profile on LinkedIn, you have an interesting background.
Regards,
Max
With respect to equities trading system design, survivorship bias is a very significant issue. Most stocks underperform the indexes they are members of and a large number of them lose virtually all their value. Some details in this paper:
http://michaelcovel.com/pdfs/
TrendingStocksDriveTheMarket.pdf
Yes Cyrus, most people don't realize just how many tickers are added and removed each day.
Regards,
Max
Post a Comment