scrapingrobot.com
robots.txt
Robots Exclusion Standard data for scrapingrobot.com
Resource Scan
Scan Details
Site Domain | scrapingrobot.com |
Base Domain | scrapingrobot.com |
Scan Status | Ok |
Last Scan | 2025-07-02T07:13:31+00:00 |
Next Scan | 2025-08-01T07:13:31+00:00 |
Last Scan
Scanned | 2025-07-02T07:13:31+00:00 |
URL | https://scrapingrobot.com/robots.txt |
Domain IPs | 104.21.31.233, 172.67.180.190, 2606:4700:3031::6815:1fe9, 2606:4700:3036::ac43:b4be |
Response IP | 172.67.180.190 |
Found | Yes |
Hash | e877c041c69e367bf0a5ecc9f54116070df00a685f1972cd2add583d986b1434 |
SimHash | d9405c18e993 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Disallow | /legacy/* |
Disallow | /blog/cases/ |
Disallow | /api-modules/ |