Archivarix.net - Web archive and search engine.

Published: 2020-09-18

 

In the near future, our team plans to launch a unique service that combines the capabilities of the Internet Archive (archive.org) and a search engine.
We plan to index the main pages of all sites that have ever been saved in the web archive. Our website database will also contain an archive of various metrics such as Alexa, Ahrefs, Majestic, keywords, WHOIS domain and other historical data from the entire lifetime of the website. Thus, it will be possible to search for the desired site to restore drops or extract deleted content by a huge number of parameters, such as keywords in text and tags, traffic on a specific date, the presence of historical links, name server, and so on. Using this system, it will be possible to make selections of any complexity, for example, find all domains deleted in 2018 containing the words "webmaster analytics", which then had an Alexa rating of less than 300k and which had more than 30 unique visitors per day for a specific keyword in 2016 year. Our service will have a convenient interface for finding the required data and will contain screenshots of websites with all parameters in the form of graphs and tables.
The next stage in the development of the system will be the indexing of live sites and the expansion of the indexed content base. For search, media files and some internal pages of the site will be available, selected according to an algorithm that takes into account the importance of this page. As a result, the system will use mainly its own database of archived sites and become independent of Archive.org.

 

Archivarix.net

The use of article materials is allowed only if the link to the source is posted: https://archivarix.com/en/blog/archivarix-net/

Archivarix Broken Links Recovery: Free WordPress Plugin for Finding and Fixing Broken Links

Over time, external links in WordPress posts inevitably break, pages get deleted, domains expire, videos become unavailable. Checking hundreds or thousands of links manually is impractical. Archivarix…

1 week ago
How the Internet Archive Decides What to Archive: Priorities, Frequency, and Data Sources

One trillion saved pages. Over 99 petabytes of data. Hundreds of crawls running simultaneously every day. Behind these numbers lies a question that everyone who professionally works with web archives …

2 weeks ago
How to Find and Buy an Expired Domain with a Good History

Buying an expired domain with history is one of the most effective ways to launch a new project with an already existing backlink profile, trust, and even traffic. Instead of promoting a bare domain f…

3 weeks ago
Common Crawl as an Alternative Data Source for Website Restoration

When it comes to restoring websites from archives, almost everyone thinks only of the Wayback Machine. That's understandable: archive.org is well known, it has a convenient interface, a trillion saved…

1 month ago
Archivarix Cache Viewer Extension for Chrome, Edge and Firefox

We've released a browser extension called Archivarix Cache Viewer. It's available for Chrome, Edge and Firefox. The extension is free and contains no ads whatsoever.
The idea is simple: quick access …

1 month ago
AI Content on Restored Websites: How to Detect It and What to Do About It

When you restore a website from the Web Archive, you expect to get original content that was once written by real people. But if the site's archives were made after 2023, there's a real chance of enco…

1 month ago
Web Archive in 2026: What Has Changed and How It Affects Website Restoration

In October 2025, the Wayback Machine reached the milestone of one trillion archived web pages. Over 100,000 terabytes of data. This is a massive achievement for a nonprofit organization that has been …

1 month ago
Archivarix External Images Importer 2.0 - New Plugin Version for WordPress

We are pleased to introduce version 2.0 of our WordPress plugin for importing external images. This is not just an update, the plugin has been completely rewritten from scratch based on modern requir…

1 month ago
Black Friday & Cyber Monday Coupons

Dear friends!
Black Friday and Cyber Monday are the best time to save on future website restores.
If you plan to restore websites, top up your balance in advance, or simply want to get more – now is…

3 months ago
Archivarix is 8 years old!

Dear friends!
Today we celebrate Archivarix's 8th anniversary, and it's the perfect occasion to say a huge thank you!
We are truly grateful that you chose our service for website recovery from web a…

5 months ago