Linked – Using Near-Duplication to Dedupe Document Collections Can be Dangerous

ByMike McBride November 9, 2015November 8, 2015 Reading Time: 2 minutes

“The three major distinctions are:-Per Family (email + attachment) vs. Per Document
Deduplication is performed on the family level, while near-duplication is performed on the document level.
–Textual Analysis vs. File Analysis
Near-duplicate detection uses only the text AND white space to compare documents, but deduplication uses a set of criteria based on the actual metadata of the files.
–Duplicates vs. Similarities
Deduplication removes identical document families, while near-duplicate detection groups documents together by similarity.”

Deduplication is not the same as identifying near-duplicates. On the other hand, there are a lot of reasons to do both, so long as you understand the differences, and the different things you are trying to accomplish with each.

I’m a big fan of using near duplication technologies to cluster together similar content. Our brains simply function better if we can focus on one subject at a time, so document review done in this manner is more efficient, period.

Using Near-Duplication to Dedupe Document Collections Can be Dangerous

Follow these topics: Links, LitigationSupport

LitigationSupport

The Lit Support Conundrum
ByMike McBride February 7, 2011April 4, 2014 Reading Time: 2 minutes

How do you get buy in for technology tools that make legal work more efficient from people who are being “measured” by the number of hours they bill? This is not a silly question for those of us working in this field. I can sit here all day and talk about the benefits of using…

Like this:
Like Loading…

Read More The Lit Support Conundrum
Links | Tech

Linked – Google China Prototype Links Searches to Phone Numbers
ByMike McBride September 16, 2018September 16, 2018 Reading Time: 1 minute

If the details in this story are true, Google is about to be absolutely complicit in violating the human rights of millions of Chinese citizens. Not just linking search history to individual phone numbers, but actively blocking material, replacing factual material with information straight from the government on things like weather and pollution levels. And…

Like this:
Like Loading…

Read More Linked – Google China Prototype Links Searches to Phone Numbers
LitigationSupport | Tech

The great laptop experiment ends
ByMike McBride September 27, 2007July 20, 2014 Reading Time: 1 minute

The experiment that had me trying to see if we could have a couple of pool laptops to go to trial sans MS Office officially came to an end today. Much as we don’t want to, it looks like we’re going to have to order the 5 user open license of 2007 so we can…

Like this:
Like Loading…

Read More The great laptop experiment ends
LitigationSupport

ALSP Redesign
ByMike McBride May 4, 2008July 20, 2014 Reading Time: 2 minutes

I got an email a couple of weeks ago with my login information for the new ALSP website. Supposedly, logging in was going to allow me to access all parts of the site, but I haven’t actually seen any pages that I can’t get to without logging in, so I’m not sure just what is…

Like this:
Like Loading…

Read More ALSP Redesign
LawFirms | LitigationSupport | Tech

Showing Off Tools
ByMike McBride April 19, 2010 Reading Time: 2 minutes

I’ve been doing a bunch of demo’s of Trial Director around the firm lately, and getting some pretty decent response from the folks who see it. The demo’s do prove a couple of things to me though, especially as compared to Summation demo’s I’ve done for these same people. 1. You can’t possibly know how…

Like this:
Like Loading…

Read More Showing Off Tools
Links | Mental Health

Linked: Employer initiatives to increase staff wellbeing found to be ineffective
ByMike McBride August 27, 2021August 26, 2021 Reading Time: 2 minutes

You do see the problem here, right? As an employee, great that there’s a webinar planned on stress management, but if I now have to work an hour later that day in order to attend the webinar, it’s not helping. Lots of HR departments are making tools available, but managers are still expecting the same amount of work, with the same crazy deadlines and expectations, from a likely short-staffed team, so who has time to use them?

So they don’t help. Not because they aren’t helpful, but because you’ve made self-care and wellbeing yet another thing for your employees to do.

Employee burnout does not exist solely because your employees haven’t figured out how to meditate. It’s systemic to our way of doing business. Unless that changes, we’re just rearranging deck chairs.

Like this:
Like Loading…

Read More Linked: Employer initiatives to increase staff wellbeing found to be ineffective

Linked – Using Near-Duplication to Dedupe Document Collections Can be Dangerous

Like this:

The Lit Support Conundrum

Like this:

Linked – Google China Prototype Links Searches to Phone Numbers

Like this:

The great laptop experiment ends

Like this:

ALSP Redesign

Like this:

Showing Off Tools

Like this:

Linked: Employer initiatives to increase staff wellbeing found to be ineffective

Like this:

Leave a ReplyCancel reply

Follow Me!

Top Posts

Like this:

Similar Posts

Like this:

Like this:

Like this:

Like this:

Like this:

Like this:

Leave a ReplyCancel reply

Follow Me!

Top Posts