HTTrack

ByMike McBride February 27, 2008July 20, 2014 Reading Time: 2 minutes

Thanks to Andy, and others, for the suggestion of HTTrack, While it didn’t download the whole site cleanly, being able to see what it was doing helped me realize what problem I was running into.

It seems that this particular site uses a 404 in place of any page that doesn’t exist, rather than redirecting to a 404. For example, if you went to brokenlink.html in the main root of the site, you’d get that address, but the page that loaded would be the contents of the 404. That wasn’t were the problem lay, as it’s fairly common. (I think the 404 on this site does the same thing). The problem was that the links and images from the 404 used relative paths. If you are on the root of the site, it works great. If you’re loading say badfolder\brokenlink.html and get the error page in place of that, none of the images load, and the links are bad, because they’re relative to the site root, which you aren’t in.

If you’re thinking ahead with me here, you’ve probably already realized what happens when a spider that is grabbing pages and following links hits this. It gets the error page, follows the link that points to another page that doesn’t exist in the relative path from the current folder, and gets the error page again. Rinse and repeat, to infinity and beyond.

No wonder the downloads just kept getting larger and larger until they crashed out of memory.

On the plus side, I think, between the various attempts, we probably have everything we needed in the first place, so I don’t have to try and do this again, on the other hand, I’d like to figure out just how I can tell HTTrack or any other tools to stop itself from getting in this loop. Any ideas?

Technorati Tags: Httrack,websitecapture,offlinebrowsing

Follow these topics: LitigationSupport, Tech

Share with your friends!

LinkedIn Bluesky Mastodon Threads Facebook Reddit Buffer

Career | Personal | PhotoExport | Photography | Tech

Weekend in Charleston And Why It took So Long to Post
ByMike McBride October 8, 2012October 4, 2020 Reading Time: 2 minutes

Yes, the photos were taken in June, which is when we actually spent a weekend driving down to Charleston, SC. Yes, it’s taken that long to actually get them shared on Flickr. This just goes to show the difference between being a professional, and photography as a hobby. No one was waiting on me to…

Like this:
Like Loading…

Read More Weekend in Charleston And Why It took So Long to Post
LitigationSupport

Why Law Firms Shouldn’t do Forensics
ByMike McBride September 24, 2009 Reading Time: 1 minute

Here’s a good reason why law firms, even when they have qualified people in house, shouldn’t get started in the business of forensic collections. You never, ever have every hard drive connector you need. With all the various types of drives in use now in laptops, (IDE, SATA, Ultra-ATA/PATA, Zif connectors, SSD drives) let alone…

Like this:
Like Loading…

Read More Why Law Firms Shouldn’t do Forensics
Links | LitigationSupport

Linked – Producing Search-Term Hit Reports: Another Form of Discovery on Discovery
ByMike McBride July 17, 2016July 16, 2016 Reading Time: 1 minute

“Hit reports, by themselves, are of limited value, however. They do not speak to whether the hits are relevant, but merely show how many documents hit on a particular word. They can, however, sometimes identify anomalies that can be researched further by looking at sample documents. For example, if a hit report indicates that a…

Like this:
Like Loading…

Read More Linked – Producing Search-Term Hit Reports: Another Form of Discovery on Discovery
Links | Tech

Linked: Dutch MPs in video conference with deep fake imitation of Navalny’s Chief of Staff
ByMike McBride April 27, 2021April 25, 2021 Reading Time: 1 minute

As I think about this, it occurs to me that a lot of the things that we think would give away deep fake videos are things that happen all the time in Zoom or Teams calls, right? The video being a little slow, or jerky, or not keeping up fluidly with the movement of people on screen, etc. So it could be harder to tell that the “person” on the call with you isn’t really who you think it is, and then we can begin to wonder who it was, and what information they got from being there, pretending to be someone else.

Are we ready for that?

Like this:
Like Loading…

Read More Linked: Dutch MPs in video conference with deep fake imitation of Navalny’s Chief of Staff
Links | LitigationSupport | SocialNetworking

Linked – Crime, Punishment and Biased Sentencing
ByMike McBride June 7, 2017June 5, 2017 Reading Time: 2 minutes

This is some interesting stuff to think about: The students, part of a university honors class this semester called When Machines Decide: The Promise and Peril of Living in a Data-Driven Society, were tasked with creating a mobile app that teaches the public how a machine-learning algorithm could develop certain prejudices. “It was created to…

Like this:
Like Loading…

Read More Linked – Crime, Punishment and Biased Sentencing
Blogging | LitigationSupport | Photography

No Link Posts in the Feed
ByMike McBride January 25, 2009 Reading Time: 1 minute

Those of you who subscribe to the RSS feed for this blog haven’t been getting the daily summary of Del.ico.us links that is usually included in the feed, because, apparently, the new and improved Feedburner, broke that feature. Also, the Photography and Lit Support pages, which normally incorporate those posts into the page by using…

Like this:
Like Loading…

Read More No Link Posts in the Feed

One Comment

Andy says:

February 28, 2008 at 8:07 pm

its been a long time since i’ve used it, but I think you can limit it to X hops so it will bomb out after 5 links (for example). limit it to something like 20 links may help?

Loading...

Reply

HTTrack

Like this:

Weekend in Charleston And Why It took So Long to Post

Like this:

Why Law Firms Shouldn’t do Forensics

Like this:

Linked – Producing Search-Term Hit Reports: Another Form of Discovery on Discovery

Like this:

Linked: Dutch MPs in video conference with deep fake imitation of Navalny’s Chief of Staff

Like this:

Linked – Crime, Punishment and Biased Sentencing

Like this:

No Link Posts in the Feed

Like this:

One Comment

Leave a ReplyCancel reply

Follow Me!

Top Posts

Like this:

Similar Posts

Like this:

Like this:

Like this:

Like this:

Like this:

Like this:

One Comment

Leave a ReplyCancel reply

Follow Me!

Top Posts