How to Use the Internet Archive to Restore a Web Page

If you need to restore a page (or more) of your site, the first suggestion is to restore a copy from a backup. So, what do you do if you do not have a backup you can restore from? In a previous article, we walked you through using Google's cache to restore a page, but that isn't an option if Google's cache has updated and no longer contains the page you want to restore.

Fortunately, there is another option you can try. The Internet Archive is a non-profit group whose goal is to create an Internet library. Using their "Wayback Machine" you can search their archive for a prior version of your site (and pages) which you can then use for rebuilding your page.

How to Restore your Website with the Internet Archive

Begin by navigating to the Internet Archive: Wayback Machine.
Type in the full URL of the page you want to look for (e.g. yourdomain.com/index.html)
Click the "Take Me Back" button.
On the next page you should see a calendar showing years near the top of the page and the months of that year in the middle of the page. Blue highlights denote days that the site was archived (referred to as a "snapshot"). You can click on a date to open a snapshot of your page from that day.

If you would like to see a list of the pages contained in the archive for a site, add an asterisk after the domain name (e.g. http://yourdomain.com*). You can also filter this list by file extension if you like (.html, .pdf, etc.).

When you open a page in the Wayback machine, you'll notice a header at the top with information and navigation for the Wayback Machine.

To view the page without this code so that you can easily restore your page, add "id_” (without the quotes) between the date and the forward slash before your URL.

Now you can view the source code for the page (in most browsers simply right click and select View Page Source or something similar). Copy the code and paste it into either a text editor where you can save it as an HTML file and view it locally or in a blank test HTML file on the server. Once you are satisfied with the recreated page, rename it as the page you need to replace.

Please note, there is no guarantee that the Internet Archive will have a copy of your site files or that the files will work as you expect them to. This should be an alternative to restoring an actual backup of your file.

Did you find this article helpful?

We value your feedback!

Why was this article not helpful? (Check all that apply)

The article is too difficult or too technical to follow.
There is a step or detail missing from the instructions.
The information is incorrect or out-of-date.
It does not resolve the question/problem I have.

How did you find this article?

Please tell us how we can improve this article:

Email Address

Name

new! - Enter your name and email address above and we will post your feedback in the comments on this page!

Did you find this article helpful?

« Prev

How to Remove Your Domain From Security Block Lists

Steps to Take if Your Website is Hacked

Comments

Post a comment

hartator

n/a Points

2015-08-15 9:56 pm

I've made a tool to generate a backup from the Wayback Machine: https://github.com/hartator/wayback-machine-downloader

nobody

n/a Points

2015-10-17 2:34 pm

thanks hartator...working like a charm

Roger

n/a Points

2018-04-11 4:42 am

Hey Hartator,When using any wayback downloader, will I be able to reupload the file and have full functionality of the site?

arnelc

Staff

17,314 Points

2018-04-11 3:39 pm

The page is typically a cached version of the website. Code that can be captured in the displayed page can be used to help rebuild the site. Anything server-side that is part of the back end of the original site may not necessarily be caught by the service.

Rome

n/a Points

2015-10-05 5:12 am

hartator... if you really made that, it is f'n awesome !!!

Niraj patel

n/a Points

2017-08-14 12:49 pm

nice downloader

Michael

n/a Points

2017-11-02 10:40 pm

I coded a web-based tool that recovers entire website - and removes any reference to archive.org.

You can test it here: https://www.waybackmachinedownloader.com/

Also, this article is a bit outdated, as a "blue circle" isn't the same anymore of what it used to be. You now also have red, yellow and green circles. From our FAQ:

A blue circle means a status code of 2xx, such as 200. This is the normal status code for a regular web page on the Wayback Machine. A blue circle is usually a safe choice.
A green circle signifies a 3xx status code, which means a redirect. Try to avoid the green dots when picking a date to scrape. It's better to get the target URL which the redirect leads to.
Orange means an error with a 4xx status code.
A red dot around the date means a server-side error, which carries a 5xx status code.

rajuakon

n/a Points

2019-10-24 7:51 pm

please help me i can not get data through access into the "https://web.archive.org/". how can I get access?

johnpaulb-whh2

Staff

12,339 Points

2019-10-24 9:30 pm

Access to the Internet Archive is free. The only available archive of your site is on June 29, 2019. If that does not have what you are looking for, you will have to find an alternate backup to restore.

Name:
Email Address:
Phone Number:
Comment:	p

Submit	Please note: Your name and comment will be displayed, but we will not show your email address.

News / Announcements

Help Center Login

Don't want to login using Facebook or Google+?

Email:	support@WebHostingHub.com	Ticket:	Submit a Support Ticket
Call:	757-416-6627	Chat:	Click To Chat Now

Knowledge Base

Community Q&A

Learning Corner

Website Troubleshooting

How to Use the Internet Archive to Restore a Web Page

How to Restore your Website with the Internet Archive

Comments

Post a Comment

News / Announcements

Help Center Login

Related Questions

Help Center Search

Current Customers

Ask the Community

Not a Customer?