progscrape.com
robots.txt

Robots Exclusion Standard data for progscrape.com

Resource Scan

Scan Details

Site Domain progscrape.com
Base Domain progscrape.com
Scan Status Ok
Last Scan2024-09-05T10:22:54+00:00
Next Scan 2024-10-05T10:22:54+00:00

Last Scan

Scanned2024-09-05T10:22:54+00:00
URL https://progscrape.com/robots.txt
Domain IPs 104.21.64.3, 172.67.173.182, 2606:4700:3032::6815:4003, 2606:4700:3033::ac43:adb6
Response IP 172.67.173.182
Found Yes
Hash 4d41296ddc461b812749c42691316268642da2a7982203a0895bf1bad5e7b4f6
SimHash 0c495c64419f

Groups

*

Rule Path
Disallow /s/*
Disallow /feed*

Other Records

Field Value
crawl-delay 600

googlebot

Rule Path
Disallow /s/*
Disallow /feed.json

Comments

  • Sitemap: http://www.progscrape.com/sitemap.xml