rpmweb.ca
robots.txt

Robots Exclusion Standard data for rpmweb.ca

Resource Scan

Scan Details

Site Domain rpmweb.ca
Base Domain rpmweb.ca
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-26T12:39:44+00:00
Next Scan 2024-06-25T12:39:44+00:00

Last Successful Scan

Scanned2024-04-27T12:37:46+00:00
URL https://rpmweb.ca/robots.txt
Domain IPs 104.21.5.19, 172.67.132.191, 2606:4700:3034::6815:513, 2606:4700:3037::ac43:84bf
Response IP 172.67.132.191
Found Yes
Hash 0b7c481facf50fa29e0540ef5d5ab13ae928f8937357138e8f74efb3e2d058c4
SimHash e1581d562693

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://rpmweb.ca/sitemaps-1-sitemap.xml
sitemap https://crinque.rpmweb.ca/sitemaps-1-sitemap.xml
sitemap https://zone-adrenaline.rpmweb.ca/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://rpmweb.ca/
  • live - don't allow web crawlers to index cpresources/ or vendor/