glean.com
robots.txt

Robots Exclusion Standard data for glean.com

Resource Scan

Scan Details

Site Domain glean.com
Base Domain glean.com
Scan Status Ok
Last Scan2024-11-01T20:09:49+00:00
Next Scan 2024-12-01T20:09:49+00:00

Last Scan

Scanned2024-11-01T20:09:49+00:00
URL https://glean.com/robots.txt
Redirect https://www.glean.com/robots.txt
Redirect Domain www.glean.com
Redirect Base glean.com
Domain IPs 75.2.70.75, 99.83.190.102
Redirect IPs 52.197.0.54, 52.199.221.217, 54.178.223.218
Response IP 54.178.223.218
Found Yes
Hash 87ee15db02972ef59ade091e271d6a27269464e2e03c282163ca5592280bd0aa
SimHash a9094805cd92

Groups

*

Rule Path
Disallow /internal/
Disallow /new-pages/

Other Records

Field Value
sitemap https://www.glean.com/sitemap.xml
sitemap https://www.glean.com/sitemap.xml