Tech

Will Big Data Give Us a Whole Bunch of Questionable Correlations?

ByMike McBride February 26, 2020March 6, 2020 Reading Time: 3 minutes

I think, statistically speaking, there’s no way that it won’t.

I’m listening to a recent episode of Seth Godin’s podcast entitled Sample Size http://aca.st/0d21e1.

Go listen to it, really.

What struck me from some of the examples that he gave, especially when talking about the football prediction sites, is that as we collect more and more data, we will, by shear force of numbers, end up correlating different things that actually have nothing to do with one another.

To wit, Seth describes starting up 200 websites with 100 predicting Team A would win, and 100 Team B. Then, after Team B won, shutting down all the sites that predicted Team A, and so on for weeks on end. Until you got to the Super Bowl and had one site left that had predicted every game correctly so far. People viewing the site would assume, incorrectly, that the person making these predictions must really know something. What they know is that if you run the predictions randomly enough times, over enough websites, one of them will likely end up predicting them all correctly. But it means nothing for the next game, because it was all just random noise in all of the data points.

Since I usually read stuff about Big Data and AI, this caught my attention immediately, because when we feed enough data into an algorithm, looking for correlations, it will find tons of them. In fact, we are quickly coming up on a time when finding correlations is easy. It takes almost no skill. Anyone will be able to do it. (If we aren’t already there)

The smart people, however, will understand which ones matter, and which ones can be acted upon.

For example, if I own a business, and I have enough data, I may discover that over the last year, my online store sells more widgets during a week after there’s a music festival in Terre Haute, Indiana. I might, therefore, be tempted to try and sponsor a new music festival in Indiana, to give me another week of increased sales, right? There’s a correlation!

But is the increase in sales being driven by that music festival? Is it just a random coincidence that our online sales went up slightly during those same couple of weeks?

Now this may seem obvious and probably doesn’t require a lot of understanding to say that it’s unlikely that one is causing the other, but there are going to be, literally, hundreds of these kinds of correlations that data and AI are going to bring to light. The people who are able to understand which ones matter, and which ones are not relevant, will win in business. Those who don’t, will end up making big mistakes chasing down correlations that are not relevant, but just random.

Then, extrapolate that out into the public realm. As AI and big data shapes more and more of our public policy decisions, will the people shaping those policies be smart enough to understand the correlations, and which ones matter? Will our decisions about policing, criminal sentencing, economics, even predicting things like natural disasters, climate change, or terrorism risks get skewed by random data points that look like a true causation but is really just a correlation of no consequence? And will we end up missing true, very real, risks as they get lost in the noise of hundreds, maybe thousands, of random correlations?

These are serious questions, and it’s going to take some really smart people to understand what the data spits out and act on it appropriately.

Do we have enough of them?

Follow these topics: Tech

Tech

Linked – Is AI the new bloatware?
ByMike McBride September 5, 2024September 5, 2024 Reading Time: 3 minutes

Whether you consider it bloatware or not may depend on your plan to use AI on a mobile device, but one thing is for sure about all hardware and many services that are adding AI features: They’re getting more expensive.

Adding the power to run AI tools locally costs money. If all Pixel phones are going to do all the AI work on photos and all the iPhones are going to process ChatGPT interactions locally, that’s going to require more expensive hardware.

If all Windows PCs will come with Recall, the same thing applies. The chips that can handle these transactions are in high demand and are not cheap.

Like this:
Like Loading…

Read More Linked – Is AI the new bloatware?
LitigationSupport

ABA Techshow Day 2 -EDiscovery from the front lines
ByMike McBride March 14, 2008July 19, 2014 Reading Time: 4 minutes

Browning Marean pointed out that the Qualcomm case is a good place to start discussing the dangers. Mess up discovery and your firm can be sanctioned, you can be sanctioned, etc. There’s real danger in not handling evidence correctly. Judge Facciola “I re-read the Qualcomm case the other night, closed it and thanked God that…

Like this:
Like Loading…

Read More ABA Techshow Day 2 -EDiscovery from the front lines
Uncategorized

Gloomy morning
ByMike McBride June 6, 2002 Reading Time: 2 minutes

Man what a gloomy start to the day we got here in Columbus. Rainy, gray, blah. It’s one of the few days I actually am glad my office doesn’t have any windows. The password switch didn’t go too badly, the ISP was kind enough to bend the rules a tad and send me a list…

Like this:
Like Loading…

Read More Gloomy morning
Artificial Intelligence

Worth Reading – Artificial Intelligence Can’t Fix the Work Environment
ByMike McBride August 5, 2025August 5, 2025 Reading Time: 2 minutes

When you read some of the statistics about email, meetings, interruptions, etc., it’s hard not to see the same glaring red flag that Sharlyn sees. We might suck at communicating.

Like this:
Like Loading…

Read More Worth Reading – Artificial Intelligence Can’t Fix the Work Environment
Microsoft | Tech

Why did they do that?
ByMike McBride February 22, 2007November 8, 2014 Reading Time: 1 minute

Something that I thought about today when I was talking to someone on the Friends in Tech forums. Their question was about being unable to see the Temporary Internet Files folder even after showing hidden folders. I’ve found that when I installed IE7, that, and the history folder are “protected operating system file and folders”,…

Like this:
Like Loading…

Read More Why did they do that?
LawFirms | Links

Linked – How Law Firms Can Guard Critical Information from Cyber Attacks
ByMike McBride January 2, 2017December 29, 2016 Reading Time: 1 minute

“Cybercriminals have made a living targeting industries which are reliant upon information that needs to be readily available and is strictly confidential. This trend puts law firms at high risk due to the nature of their work, and the recent movement of data to cloud-based case management, record searching, research, and platforms for communication. Legal…

Like this:
Like Loading…

Read More Linked – How Law Firms Can Guard Critical Information from Cyber Attacks

Will Big Data Give Us a Whole Bunch of Questionable Correlations?

Like this:

Linked – Is AI the new bloatware?

Like this:

ABA Techshow Day 2 -EDiscovery from the front lines

Like this:

Gloomy morning

Like this:

Worth Reading – Artificial Intelligence Can’t Fix the Work Environment

Like this:

Why did they do that?

Like this:

Linked – How Law Firms Can Guard Critical Information from Cyber Attacks

Like this:

Leave a ReplyCancel reply

Follow Me!

Top Posts

Like this:

Similar Posts

Like this:

Like this:

Like this:

Like this:

Like this:

Like this:

Leave a ReplyCancel reply

Follow Me!

Top Posts