awarehq.com
robots.txt
Robots Exclusion Standard data for awarehq.com
Resource Scan
Scan Details
Site Domain | awarehq.com |
Base Domain | awarehq.com |
Scan Status | Ok |
Last Scan | 2024-08-26T00:54:10+00:00 |
Next Scan | 2024-09-25T00:54:10+00:00 |
Last Scan
Scanned | 2024-08-26T00:54:10+00:00 |
URL | https://awarehq.com/robots.txt |
Redirect | https://www.awarehq.com/robots.txt |
Redirect Domain | www.awarehq.com |
Redirect Base | awarehq.com |
Domain IPs | 52.84.229.103, 52.84.229.123, 52.84.229.50, 52.84.229.65 |
Redirect IPs | 199.60.103.2, 199.60.103.254, 2606:2c40::c73c:6702, 2606:2c40::c73c:67fe |
Response IP | 199.60.103.2 |
Found | Yes |
Hash | 4c72f6f536deb4b9efa0dc95c5bf29cd8201f59f1ee72aff33e1cb551f7dab1b |
SimHash | 7845dee8ec92 |
Groups
*
Rule | Path |
---|---|
Disallow | /sample-* |
Disallow | /blog/sample-* |
Disallow | /PDF/ |
Disallow | /_hcms/preview/ |
Disallow | /hs/manage-preferences/ |
Disallow | /hs/preferences-center/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.awarehq.com/sitemap.xml |