thehabitat.com
robots.txt

Robots Exclusion Standard data for thehabitat.com

Resource Scan

Scanned	2024-11-18T04:01:54+00:00
URL	https://thehabitat.com/robots.txt
Domain IPs	104.17.119.107, 104.17.120.107, 2606:4700::6811:776b, 2606:4700::6811:786b
Response IP	104.17.119.107
Found	Yes
Hash	a424257e3c068ca0931f4076bd876150a9b48dce9ef4a1bc14694330092d4a71
SimHash	29199850ed10

Rule

Path

Disallow

/cdn-cgi/*

Disallow

/wp/*

Disallow

/app/plugins/*

Disallow

/wp-json*

Back to top

Field	Value
sitemap	https://thehabitat.com/sitemap-index.xml

Field

Value

sitemap

https://thehabitat.com/sitemap-index.xml

Back to top