workspace.google.com
robots.txt
Robots Exclusion Standard data for workspace.google.com
Resource Scan
Scan Details
Site Domain | workspace.google.com |
Base Domain | google.com |
Scan Status | Ok |
Last Scan | 2024-05-30T22:25:28+00:00 |
Next Scan | 2024-06-29T22:25:28+00:00 |
Last Scan
Scanned | 2024-05-30T22:25:28+00:00 |
URL | https://workspace.google.com/robots.txt |
Domain IPs | 2404:6800:4003:c05::64, 2404:6800:4003:c05::65, 2404:6800:4003:c05::66, 2404:6800:4003:c05::8a, 64.233.170.100, 64.233.170.101, 64.233.170.102, 64.233.170.113, 64.233.170.138, 64.233.170.139 |
Response IP | 74.125.24.101 |
Found | Yes |
Hash | f63e91f536670aaf67f375a51e7b41a9bc11abbe0ba6e9885075d6953ceae515 |
SimHash | 41514114c7a9 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /marketplace/ |
Allow | /intl/en_sg/ |
Disallow | /intl/*/ |
Disallow | /intl/*/customers/ |
Disallow | /learning-center/search |
Allow | /intl/en_sg/customers/ |
Disallow | /customers |
Other Records
Field | Value |
---|---|
sitemap | https://workspace.google.co.uk/sitemap.xml |