Cuidado com os Grandes Erros em Big Data - Nassim Taleb
2013 Feb 11Depois do Stephen Few, chega a vez do Nassim Taleb realizar algumas considerações sobre o Big Data:
[…] But beyond that, big data means anyone can find fake statistical relationships, since the spurious rises to the surface. This is because in large data sets, large deviations are vastly more attributable to variance (or noise) than to information (or signal). It’s a property of sampling: In real life there is no cherry-picking, but on the researcher’s computer, there is. Large deviations are likely to be bogus. […]