Scrapy

What is Scrapy?

About

Scrapy is a scraper operated by Zyte. If you think this is incorrect or can provide additional detail about its purpose, please contact us. You can see how often Scrapy visits your website by setting up Dark Visitors Agent Analytics.

Agent Type

Scraper
Downloads web content for possibly malicious purposes

Expected Behavior

Scrapers extract data from websites for various purposes including research, price monitoring, content aggregation, lead generation, and unauthorized copying. Their behavior is highly unpredictable due to diverse use cases and operators. Scrapers are frequently configured to ignore robots.txt rules and may aggressively crawl sites to achieve their collection goals. They can range from respectful tools that identify themselves clearly, to aggressive bots that disguise their identity, overwhelm servers, and extract content without permission.

Detail

Operated By Zyte
Last Updated 2 hours ago

Insights

Top Website Robots.txts

7%
7% of top websites are blocking Scrapy
Learn How →

Country of Origin

Belgium
Scrapy normally visits from Belgium

Global Traffic

The percentage of all internet traffic coming from Scrapers

Top Visited Website Categories

Computers and Electronics
Business and Industrial
Jobs and Education
People and Society
Finance
How Do I Get These Insights for My Website?
Use the WordPress plugin, Node.js package, or API to get started in seconds.

Robots.txt

Should I Block Scrapy?

Probably. Scrapers usually download publicly available internet content, which is freely accessible by default. However, you might want to block them if you don't want your content to be used for unauthorized purposes.

How Do I Block Scrapy?

You can block Scrapy or limit its access by setting user agent token rules in your website's robots.txt. Set up Dark Visitors Agent Analytics to check whether it's actually following them.

How Do I Block All Scrapers?
Serve a continuously updating robots.txt that blocks new scrapers automatically.
User Agent String Scrapy/2.12.0 (+https://scrapy.org)
# In your robots.txt ...

User-agent: Scrapy # https://darkvisitors.com/agents/scrapy
Disallow: /

⚠️ Manual Robots.txt Editing Is Not Scalable

New agents are created every day. We recommend setting up Dark Visitors Automatic Robots.txt if you want to block all agents of this type.