scrapingrobot.com
robots.txt

Robots Exclusion Standard data for scrapingrobot.com

Resource Scan

Scanned	2025-07-02T07:13:31+00:00
URL	https://scrapingrobot.com/robots.txt
Domain IPs	104.21.31.233, 172.67.180.190, 2606:4700:3031::6815:1fe9, 2606:4700:3036::ac43:b4be
Response IP	172.67.180.190
Found	Yes
Hash	e877c041c69e367bf0a5ecc9f54116070df00a685f1972cd2add583d986b1434
SimHash	d9405c18e993

Rule

Path

Disallow

/wp-admin/

Disallow

/wp-includes/

Disallow

/legacy/*

Disallow

/blog/cases/

Disallow

/api-modules/

Back to top