立即打开
大数据的预测盲区

大数据的预测盲区

Kurt Wagner 2013-04-28
美国统计学家内特•希尔是个数学天才,长于利用大数据进行预测。去年美国总统大选期间,他非常准确的预测了美国50个州的投票胜负。但他认为,大数据也不是万能的,有些领域的预测成功率就很低,比如地震,比如股市。

    有什么问题是数据和分析不能回答的吗?

    这都存在于一定的范围内。要知道,相对于我们的潜力,我们做得有多好,与某件事物在本质上有多大的可预测性,二者是有区别的。以棒球为例,虽然分析师已经研究棒球很久了,但是即便是最优秀的棒球队,胜率也只有三分之二。就算是最优秀的击球手,也只有40%的机会上垒。所以在某种意义上,它在本质上仍然是不可预测的,但是我们有了比较好的方法来衡量和了解我们所知道和不知道的事情。

    在很多领域,数据分析还没有广泛应用。比如我在我的《信号与声音:为什么很多地震预测失败了,但有些预测说中了》一书中谈到了地震的预报。千百年来人们一直在尝试预报地震,我们了解了一些现象——比如加州的地震要比新泽西州多,但是在某一时刻及时、精确地预报一场地震的能力可以说毫无进展。甚至就连经济也是如此,一旦我们试图做出长期的经济预测,我们大多数都会做得比较差。

    是否有行业已经在关注大数据分析可能带来的影响?

    有时并一不定是非常热门的行业。比如零售企业有大量的每个消费者的交易记录,也有大量的供应链管理方面的数据,所以在制定库存优化战略、定价优化战略以及供应链应急管理战略时都会用到大数据分析。并不是非常抢眼的东西,但这些人有非常好的数据储备,通常是高质量的数据,因此可以做出更好的决策。我相信有些企业已经这样做了,因此它会带来一些前所未有的效率。

    另外还有其它案例,比如你可以看看电视行业是如何让人们花钱的。我认为广告行业定位顾客的手段变得更先进了。讽刺的是,这种效率从某种程度上也给媒体公司带来了坏处。广告业有一句老话:“你只有一半的广告预算花对了,但你不知道是哪一半。”现在人们可能知道那是哪一半了,所以他们只会花这一半。

    人们能否通过数据或者分析法精确预测股市?

    股市是一场竞赛,你在和其他股民进行竞争。所以问题来了:是否股市的某些交易者要比其他人更厉害?我认为答案可能是“是的”。我不是一个纯粹的股民,不过我玩扑克很久了。我认为玩扑克跟炒股在很多方面是相通的,你知道有些人越到长期越得心应手,而且更擅长应对不确定因素等等。不过股市里还有很多不稳定因素和很多运气成分,一个市场周期可以长达几个月或几年。很多不正当的刺激因素可能会影响股市。所以尽管我认为有些很好的交易者在短期甚至五到十年内都可以顺风顺水,但最终很大程度还是由运气决定的,所以很复杂。

    Are there any questions out there that can't be answered using data and analytics?

    So I think it all exists along a spectrum. It's important to know, too, that there's a difference between how good we are relative to our potential and how intrinsically predictable something might be. So for example if you look at baseball where analytics have come an awful long way, well it's still the case that the best baseball teams only win two-thirds of their games. The best hitters only get on base about 40% of the time. So it's still intrinsically unpredictable in a sense, but we have a good way of measuring and knowing what we know and what we don't know.

    But there are a lot of fields where analytics have not come very far. I discuss earthquake forecasting in my book [The Signal and the Noise: Why So Many Predictions Fail -- but Some Don't] for instance, where people have been trying for centuries. We know something -- there are more earthquakes here in California than in New Jersey -- but the ability to anticipate a particular earthquake with any precision at a particular moment in time has not gone very well at all. Even economics -- when we try to do long-term economic forecasting, it has been pretty poor for the most part.

    Are there any industries out there that are overlooking the possible impact of big data analytics?

    It's sometimes industries that aren't very sexy necessarily, so big retail businesses for example have tons of records on every consumer transaction they have [completed]. They have a ton of data on supply chain management, so things like that in terms of optimal inventory strategies or optimal pricing strategies or robust strategies for disruption in the supply chain. Not super-sexy stuff, but those people have really good data sets, often high quality data, and can make better decisions as a result. I'm sure that some are doing it already, and there are some efficiencies there that you weren't seeing before.

    There are also cases like if you look at how people consume television, for example. I think the advertising industry is something that's gotten more sophisticated in terms of targeting customers. The irony is that efficiency has become harmful in some ways for media companies, because what was the old adage? "Half your advertising budget is well spent, but you don't know which half." Now people might know which half, so they're only spending half as much.

    Can people use data or analytics to accurately predict the stock market in any way?

    The problem is the stock market is this whole contest where you're competing against other creators. So the question is: Are there some traders that are better than others? I think the answer is probably, Yes. I'm not a pure markets guy, I played poker long enough which I think has parallel skills to trading in many respects where you know that some people are better over the long-term and better at accounting for uncertainty and so forth. But there's a lot of volatility and a lot of luck where a market cycle can last for months or years. There are a lot of perverse incentives that get in the way. So while I think there are some good traders, in the near term, even over a period of five or 10 years, it will mostly be dictated by luck, so it's tricky.

热读文章
热门视频
扫描二维码下载财富APP