Not All Machine Learning is Good Machine Learning

ByMike McBride February 8, 2017February 4, 2020 Reading Time: 3 minutes

I know that you’ve seen all the hype around AI and Machine Learning. It’s probably warranted. AI is in the process of making huge changes in how we work and live.

Recently, though, I saw a good reminder of how much that machine learning is still dependent on the proper inputs, and of course, that means someone giving the machine the proper data.

You may have seen a recent Washington Post story about Family Tree Now, a website that seems to crawl various public databases and grab all sorts of information about people, when they were born, where they lived, etc. Yeah, it’s creepy to think about all of that information being collected up for the world to look at, and the Post article focuses on letting us know how to “opt-out” of that site.

Naturally, my wife and I decided to look ourselves up and see what the site knew about us, and clearly it had access to a lot of public records, it had a variety of address records going back years. Creepy? Sure. The site also had a list of possible relatives and associates, and that’s where someone seems to have made some poor choices when it came to inputs.

The first thing she noticed about her information was that the first possible relative listed, was my first wife. As you might imagine, she was not thrilled, or impressed with the AI. Clearly, Family Tree Now missed some public records, like my divorce! For myself, yeah my first wife was listed as a possible relative, as were her parents and siblings. Again, they missed a record, but fair enough. I also noticed a long list of potential associates, people who I had no connection to at all. Upon further inspection, I realized that much of that list seemed to be made up of people who lived at one of my former addresses, well after I had left. I’m not sure who decided that made for a potential associate.

In short, the technology to crawl through public records seems pretty decent, but maybe incomplete. The learning about what makes for a connection seems pretty illogical. But that all goes back to the programmers. The AI, I assume, was programmed to crawl, but someone didn’t include some records that would have made it clear that some family relationships had been annulled. It also used an overly simply logic to match up dates without looking at the end dates of residences. The machine knew that I lived somewhere in 1996-1997, and it knew I had a different address after that, it said so, but it was still looking at people from the same address 10 years later and assuming a connection. That’s a logical fallacy. The machine didn’t do that. 😉

Why is this important? Because whether you’re talking about Big Data analytics for business and marketing, or TAR in the eDiscovery industry, if the inputs and algorithms aren’t correct, you may end up with the wrong results. Don’t just assume the machine knows, make sure it’s measuring what you think it should be.

Follow these topics: LitigationSupport, Tech

Blogging | Photography | Tech

The Scourge of Amateurs
ByMike McBride July 19, 2012 Reading Time: 1 minute

So now it’s Instagram ruining photography, eh? I can remember, like Matthew, when the same complaints were leveled against blogs and twitter. Heck I can remember when the professional photographer world was up in arms about how DSLR technology led to any MWAC (Mom With A Camera) thinking they could make money taking portraits, and…

Like this:
Like Loading…

Read More The Scourge of Amateurs
Personal | Photography

Lunch break from Class
ByMike McBride October 10, 2007July 20, 2014 Reading Time: 2 minutes

Utilizing the Wi-Fi in the training room to catch up on email, and uploading a handful of photos from the last couple of days. Class has been good in terms of learning, but it’s a mental drain. I’m looking forward to Friday and Saturday and being able to go back to just being a tourist….

Like this:
Like Loading…

Read More Lunch break from Class
Weekly Links

What I’m Sharing (weekly) Sept. 27, 2020
ByMike McBride September 27, 2020October 4, 2020 Reading Time: 1 minute

Everyone Agrees – We Need a Comprehensive U.S. Privacy Law

The Best of Relativity Fest 2020: Our Favorite Commentary

The Re:Set Guide to Recognizing and Tackling Work From Home Burnout

Microsoft Teams is getting virtual commutes and Headspace meditation

Staying In Touch

Our guidance on staying in touch with your network

Algorithms control your online life. Here’s how to reduce their influence.

Inbox Zero: Merlin Mann’s Tips for Managing Your Life Online

If “Angels Fear to Tread” into Search Terms, Why Are Lawyers So Confident About Them?

The Cognitive Biases that Make Us All Terrible People

Relativity Fest Day One Report

How to Network Professionally During the Coronavirus Pandemic

Like this:
Like Loading…

Read More What I’m Sharing (weekly) Sept. 27, 2020
Tech

Google Wave Open to Everyone, Does Anyone Care?
ByMike McBride May 19, 2010 Reading Time: 1 minute

So Google opened up Wave to the public today, instead of restricting it to invites only, as if there’s some sort of pent up demand for it or something. Case in point, does anyone out there who tried out Wave not have a pile of invites they couldn’t give away? I know I do, and…

Like this:
Like Loading…

Read More Google Wave Open to Everyone, Does Anyone Care?
Tech

G.ho.st Update
ByMike McBride July 30, 2007February 6, 2015 Reading Time: 2 minutes

You may recall a while back I talked about G.ho.st the online Virtual Machine project. I haven’t gone back to look at it again in some time, but I got an email from them today, and given the new features, I may have to take a look very soon: As part of our continuous efforts…

Like this:
Like Loading…

Read More G.ho.st Update
Links | LitigationSupport

Linked: The Foreign Language of E-Discovery
ByMike McBride May 25, 2020May 24, 2020 Reading Time: 1 minute

If this is you, you really should take their advice, and go learn something about eDiscovery technology. Have you ever been involved in a meet and confer regarding electronically stored information and felt your adversary was speaking a foreign language? Is active machine learning an unfamiliar concept to you? Is BYOD an acronym for who-knows-what?…

Like this:
Like Loading…

Read More Linked: The Foreign Language of E-Discovery

Not All Machine Learning is Good Machine Learning

Like this:

The Scourge of Amateurs

Like this:

Lunch break from Class

Like this:

What I’m Sharing (weekly) Sept. 27, 2020

Like this:

Google Wave Open to Everyone, Does Anyone Care?

Like this:

G.ho.st Update

Like this:

Linked: The Foreign Language of E-Discovery

Like this:

Leave a ReplyCancel reply

Follow Me!

Top Posts

Like this:

Similar Posts

Like this:

Like this:

Like this:

Like this:

Like this:

Like this:

Leave a ReplyCancel reply

Follow Me!

Top Posts