What is Archivarix?
Archivarix is a free opensource CMS combined with an online website downloader and a wayback rebuilder. With our
system you can restore
from The Wayback Machine (web.archive.org) exactly like it was. Or you can
download an existing website
and get it
in a zip file. When scraping process will be completed you will get a fully workable copy of restored/downloaded
site with Archivarix CMS, so you can easily modify and operate it.
Restore a website that previously existed and was crawled by Wayback Machine. We process all restored data to
provide a final ready-to-upload website with a lot of additional improvements, code fixes, ads removal, images
a live website
Download or convert any existing website to make it secure, optimized and editable through our CMS. You can also
download websites with expired domain but working hosting.
Edit and manage
We've created Archivarix CMS for a convenient way to edit restored or downloaded websites. Single file to
upload, no installation required.
Recently our system has been updated and now we have two new options:
- You can download Darknet .onion sites. Just enter .onion website address in the "Domain" field here and our system will download it from the Tor network just like a regular website.
- Content extractor. Archivarix can not only download existing sites or restore them from the Web Archive but can also extract content from them. In the "Advanced options" field you need to select "Extract structured content". After that you will recieve a complete archive of the entire site, and an archive of articles in xml, csv, wxr and json formats. When creating an archive of articles our parser takes into account only meaningful content excluding duplicate articles, elements of design, menus, ads and other unwanted elements.
2019.09.18 New features and improvements:
- Create a website with a www default subdomain.
- Set a referer to bypass cloaking with our Live Website Downloader.
- New mode for returning 404 code instead of default 301 for missing urls.
- Improved external iframes removal.
- Improved loader (index.php).
2019.05.20 A new feature - select User Agent for downloading live websites. Do you need a version
that is shown to a Googlebot only? Now you can have it.
2019.05.09 A new feature - preserving 301/302 redirects for websites restored from Wayback Machine
and downloaded from live originals.
2019.04.26 A new version for our Archivarix Website Downloader. A huge speed improvements and
better crawling with support of modern websites.
Archivarix Affiliate Program
available! Start making money now. Get 15% from your referrals for life.
Creating custom modules? Intergrating your existing link exchange system with Arvhivarix
restores? We've released a new loader (index.php) that has additional important variables for developers who create
their own custom include modules. All new restores come with an updated loader. You can download it manually
to update your existing website. It's
100% compatible with any previous restore made with our system. This update also has a significant speed
improvements and low memory consumption for sitemap.xml on big websites. Tested on 3+ mil pages restores.
2019.02.25 You can now Sign In or Register in a single click with Google.
2019.01.21 Improved live website downloader.
2019.01.09 Additional CMS update. Multiline search! Finally!
2019.01.07 CMS update. Default locale detection, external redirects support, Search & Replace
within other text formats, statistics with charts.
2018.12.01 We switching from 'Ālep to Bēt version of Archivarix! Live websites download
functionality is publicly available. All restored and downloaded websites are fully compatible with our Archivarix
CMS. Thank you, all our testers!
An update for our Archivarix CMS
improvements, more intuitive password setup and additional limits that allow working with big (50k+ files) restores
with little memory. We are working on switching "lean" mode on without code editing. Stay tuned!
2018.08.03 More support for very old and rare charsets.
2018.05.24 You can now use custom password protection for CMS by setting ACMS_LOGIN_PAGE =
'mypassword'; variable. Password must have at least 6 characters. Improved support for encoding on servers that
don't have mbstring.
Our new CMS is officially released! Thank you, all alpha testers for working together on
a script that brings restoring websites to a completely new level. You can edit pages, add new ones, search and
replace matches... and a lot more. The latest version is available on our
Archivarix CMS page
2018.04.02 XML Sitemaps! Just set ARCHIVARIX_SITEMAP_PATH in our index.php loader. If your restore
does not contain a new loader (release 20180403) with a sitemap support - you can create a new clone or contact us
and we will reassemble all your restores to the latest version. We also prepare all new restores to work with our
own CMS that we are working on. All old restores will be automatically converted to a new version before we release
2018.04.01 Improved charset detection for text/html mime-type files.
Loader: improved handling
for missing .css and .js with query; support for trailing slash in all URLs without queries.
2018.03.24 Improved support for hostings with old PDO_SQLITE version. Error messages in our
index.php loader are more user-friendly. "Make internal links relative" option is more intelligent now.
2018.03.08 Six new languages on user interface! Write us if you see any grammar errors.
2018.02.28 Restored websites will work on a different domain name by default. No need to set
ARCHIVARIX_CUSTOM_DOMAIN. It just works!
2018.02.26 Big important update! Our system can restore websites that were restricted by
robots.txt. We are very proud of this update.
2018.02.22 Fix to support some rare cases where $_SERVER['HTTPS'] is set to 'off' instead of empty
2018.02.20 Fix with encoding detection. "Optimize HTML-code" feature now works as expected even if
the website had a rare non-utf8 charset.
2018.02.12 ¡Hablamos español! And some fixes and improvements for correct restores of websites with
mixed HTTP/HTTPS content.
2018.01.12 An additional CMS mode for Wordpress and other systems. A new option
ARCHIVARIX_CUSTOM_DOMAIN for the restored website to run on another domain or localhost.
2017.12.20 Fixes for missing subdomains on some recovered websites.
2017.12.01 We have added MIME types statistics on a download page. Now you can see how many jpegs,
htmls and other file types you will get in the archive.
2017.11.21 Bug with an infinite redirect loop on some sites is fixed.
2017.11.14 We have made content downloader based on PHP and SQLITE - we have a version for
Apache+PHP, NGINX+PHP and a legacy version with .htaccess only. Recovered sites will work much faster now. Other
features - integration with Wordpress, integration with any TDS or other custom scripts and so on - see full
description in "Tutorial and prices" section.
2017.10.21 Now system can find JS trackers and delete it when you select option "Remove trackers
2017.10.17 Fix for compatibility with ModPagespeed hosting
2017.10.11 New features added: "Remove trackers and analytics" - You will get recovered site clean
of any adds and banners (We have over 60000 code signatures in our database). "Make internal links relative" option
- system will update all links in downloaded site to relative.
2017.10.10 We have made tutorial videos in Russan and English. You can see it in "Tutorial and
2017.10.09 Error with CSS styles in some restored sites fixed. Some other minor errors are fixed.
2017.10.03 Big performance improvements. Our system is running faster now.
2017.09.29 We have launched our service. Downloaded archive contains only non-php version of
website. Own CMS and integration with other CMS like Wordpress, Drupal, Joomla are planned.