headbox.com
robots.txt
Robots Exclusion Standard data for headbox.com
Resource Scan
Scan Details
Site Domain | headbox.com |
Base Domain | headbox.com |
Scan Status | Ok |
Last Scan | 2024-05-18T08:50:03+00:00 |
Next Scan | 2024-06-17T08:50:03+00:00 |
Last Scan
Scanned | 2024-05-18T08:50:03+00:00 |
URL | https://headbox.com/robots.txt |
Redirect | https://www.headbox.com/robots.txt |
Redirect Domain | www.headbox.com |
Redirect Base | headbox.com |
Domain IPs | 13.227.254.101, 13.227.254.116, 13.227.254.18, 13.227.254.59 |
Redirect IPs | 18.154.7.124, 18.154.7.64, 18.154.7.81, 18.154.7.92 |
Response IP | 13.227.254.18 |
Found | Yes |
Hash | e7e07f0a3a051619840b16a71c53b87d5eb25a93b6293debf306b94c9c462c6f |
SimHash | aa003dc50ff3 |
Groups
*
Rule | Path |
---|---|
Disallow | /admin/ |
Disallow | /users/edit/ |
Disallow | /bookings/ |
Disallow | /booked_slots/ |
Disallow | /messages/ |
Disallow | /preferredvenues/ |
Disallow | /*more_from_host$ |
Disallow | /*similar_spaces$ |
Disallow | /search_results |
Disallow | /spaces/*/viewing$ |
Disallow | /spaces/*/bespoke_package$ |
Disallow | /docs/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.headbox.com/sitemap.xml.gz |
sitemap | https://headbox.com/sitemap-ilp.xml.gz |
Comments