globalwebindex.com
robots.txt
Robots Exclusion Standard data for globalwebindex.com
Resource Scan
Scan Details
Site Domain | globalwebindex.com |
Base Domain | globalwebindex.com |
Scan Status | Ok |
Last Scan | 2025-07-15T12:04:15+00:00 |
Next Scan | 2025-08-14T12:04:15+00:00 |
Last Scan
Scanned | 2025-07-15T12:04:15+00:00 |
URL | https://globalwebindex.com/robots.txt |
Redirect | https://www.gwi.com/robots.txt |
Redirect Domain | www.gwi.com |
Redirect Base | gwi.com |
Domain IPs | 34.96.99.201 |
Redirect IPs | 199.60.103.2, 199.60.103.254, 2606:2c40::c73c:6702, 2606:2c40::c73c:67fe |
Response IP | 199.60.103.254 |
Found | Yes |
Hash | b46010fabdd33462f29332a840a28a0ff128bcf84e99256a5b0250079afbe672 |
SimHash | 3ae58cf5c5b1 |
Groups
*
Rule | Path |
---|---|
Disallow | /hubfs/304927/Downloads/ |
Disallow | /_hcms/preview/ |
Disallow | /hs/manage-preferences/ |
Disallow | /hs/preferences-center/ |
Disallow | /*?*hs_preview=* |
Disallow | /*?*hsCacheBuster=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.gwi.com/sitemap.xml |