crawler4j
What is crawler4j?
About
crawler4j is an uncategorized agent. If you think this is incorrect or can provide additional detail about its purpose, please contact us. You can see how often crawler4j visits your website by setting up Dark Visitors agent analytics.
Expected Behavior
Behavior will vary depending on whether this agent is a search engine crawler, data scraper, archiver, one-off fetcher, etc.
Type
Uncategorized
Not currently assigned a type
Detail
Last Updated | 9 hours ago |
Insights
Top Website Robots.txts
Country of Origin
United States
crawler4j normally visits from the United States
Global Traffic
The percentage of all internet traffic coming from Uncategorized Agents
Get These Insights for Your Website
Use the WordPress plugin, Node.js package, or API to get started in seconds.
Robots.txt
Should I Block crawler4j?
It's difficult to say without a type. Its purposes could either be good or bad for your website, depending on what it is.
How Do I Block crawler4j?
⚠️ Manual Robots.txt Edits Are Not Scalable
New agents are created every day. Instead, serve a continuously updating robots.txt that blocks new agents automatically.
You can block crawler4j or limit its access by setting user agent token rules in your website's robots.txt. Set up Dark Visitors agent analytics to check whether it's actually following them.
User Agent String | crawler4j (https://github.com/yasserg/crawler4j/) |
# robots.txt
# This should block crawler4j
User-agent: crawler4j
Disallow: /