guideline.com
robots.txt

Robots Exclusion Standard data for guideline.com

Resource Scan

Scan Details

Site Domain guideline.com
Base Domain guideline.com
Scan Status Ok
Last Scan2025-08-14T14:16:11+00:00
Next Scan 2025-08-28T14:16:11+00:00

Last Scan

Scanned2025-08-14T14:16:11+00:00
URL https://guideline.com/robots.txt
Redirect https://www.guideline.com/robots.txt
Redirect Domain www.guideline.com
Redirect Base guideline.com
Domain IPs 172.66.40.187, 172.66.43.69, 2606:4700:3108::ac42:28bb, 2606:4700:3108::ac42:2b45
Redirect IPs 172.66.40.187, 172.66.43.69, 2606:4700:3108::ac42:28bb, 2606:4700:3108::ac42:2b45
Response IP 172.66.43.69
Found Yes
Hash 62b16646ef20ae5d4a2741935557a826c00fc88ac1dc85b76e0d7307d8974fd3
SimHash 69408e51e630

Groups

*

Rule Path
Allow /
Disallow /gfa-pipeline
Disallow /pipeline
Disallow /cdn-cgi/

Other Records

Field Value
sitemap https://www.guideline.com/sitemap.xml

Comments

  • *
  • Host
  • Sitemaps

Warnings

  • `host` is not a known field.