ArchiveBot

Last updated 11 hours ago.

What is ArchiveBot?

About

ArchiveBot is an intelligence gatherer operated by Wikimedia. It's not currently known to be artificially intelligent or AI-related. If you think that's incorrect or can provide more detail about its purpose, please contact us.

Detail

Operator Wikimedia
Documentation https://meta.wikimedia.org/wiki/InternetArchiveBot/FAQ_for_sysadmins

Type

Intelligence Gatherer
Searches for useful insights

Expected Behavior

The behavior of intelligence gatherers depends on the goals of their clients. For example, a client might be interested in brand sentiment, in which case the agent would crawl related social media or blog posts at a more frequent rate than unrelated websites.

Insights

Activity on Your Website

Half of your website's traffic probably comes from artificial agents, and there are more of them every day. Track their activity with the API or WordPress plugin.

Set Up Agent Analytics

Other Websites

0%
of top websites are currently blocking ArchiveBot in some way
Learn How →

Access Control

Should I Block ArchiveBot?

Probably not, especially if you benefit from an intelligence gathering service yourself. However, you might choose to block them if you're concerned about things like server resource usage.

Using Robots.txt

User Agent Token Description
ArchiveBot Should match instances of ArchiveBot

You can block ArchiveBot or limit its access by setting user agent token rules in your website's robots.txt.

# robots.txt
# This should block ArchiveBot

User-agent: ArchiveBot
Disallow: /

Instead of doing this manually, you can generate a robots.txt using the API or WordPress plugin that stays up to date with the agent list automatically. The WordPress plugin can also enforce your robots.txt and block agents who try to ignore the rules.

Set Up Your Robots.txt