How to find all links / pages on a website

Jonathan Lyon picture Jonathan Lyon · Sep 17, 2009 · Viewed 424.9k times · Source

Is it possible to find all the pages and links on ANY given website? I'd like to enter a URL and produce a directory tree of all links from that site?

I've looked at HTTrack but that downloads the whole site and I simply need the directory tree.

Answer

Hank Gay picture Hank Gay · Sep 17, 2009

Check out linkchecker—it will crawl the site (while obeying robots.txt) and generate a report. From there, you can script up a solution for creating the directory tree.