crawler4j

What is crawler4j?

About

crawler4j is an uncategorized agent. If you think this is incorrect or can provide additional detail about its purpose, please contact us. You can see how often crawler4j visits your website by setting up Dark Visitors agent analytics.

Expected Behavior

Behavior will vary depending on whether this agent is a search engine crawler, data scraper, archiver, one-off fetcher, etc.

Type

Uncategorized
Not currently assigned a type

Detail

Last Updated 9 hours ago

Insights

Top Website Robots.txts

1%
1% of top websites are blocking crawler4j
Learn How →

Country of Origin

United States
crawler4j normally visits from the United States

Global Traffic

The percentage of all internet traffic coming from Uncategorized Agents

Get These Insights for Your Website
Use the WordPress plugin, Node.js package, or API to get started in seconds.

Robots.txt

Should I Block crawler4j?

It's difficult to say without a type. Its purposes could either be good or bad for your website, depending on what it is.

How Do I Block crawler4j?

⚠️ Manual Robots.txt Edits Are Not Scalable
New agents are created every day. Instead, serve a continuously updating robots.txt that blocks new agents automatically.

You can block crawler4j or limit its access by setting user agent token rules in your website's robots.txt. Set up Dark Visitors agent analytics to check whether it's actually following them.

User Agent String crawler4j (https://github.com/yasserg/crawler4j/)
# robots.txt
# This should block crawler4j

User-agent: crawler4j
Disallow: /