headbox.com
robots.txt

Robots Exclusion Standard data for headbox.com

Resource Scan

Scan Details

Site Domain headbox.com
Base Domain headbox.com
Scan Status Ok
Last Scan2024-05-18T08:50:03+00:00
Next Scan 2024-06-17T08:50:03+00:00

Last Scan

Scanned2024-05-18T08:50:03+00:00
URL https://headbox.com/robots.txt
Redirect https://www.headbox.com/robots.txt
Redirect Domain www.headbox.com
Redirect Base headbox.com
Domain IPs 13.227.254.101, 13.227.254.116, 13.227.254.18, 13.227.254.59
Redirect IPs 18.154.7.124, 18.154.7.64, 18.154.7.81, 18.154.7.92
Response IP 13.227.254.18
Found Yes
Hash e7e07f0a3a051619840b16a71c53b87d5eb25a93b6293debf306b94c9c462c6f
SimHash aa003dc50ff3

Groups

*

Rule Path
Disallow /admin/
Disallow /users/edit/
Disallow /bookings/
Disallow /booked_slots/
Disallow /messages/
Disallow /preferredvenues/
Disallow /*more_from_host$
Disallow /*similar_spaces$
Disallow /search_results
Disallow /spaces/*/viewing$
Disallow /spaces/*/bespoke_package$
Disallow /docs/

Other Records

Field Value
sitemap https://www.headbox.com/sitemap.xml.gz
sitemap https://headbox.com/sitemap-ilp.xml.gz

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file