wgi.org
robots.txt
Robots Exclusion Standard data for wgi.org
Resource Scan
Scan Details
Site Domain | wgi.org |
Base Domain | wgi.org |
Scan Status | Ok |
Last Scan | 2024-11-16T14:25:36+00:00 |
Next Scan | 2024-11-23T14:25:36+00:00 |
Last Scan
Scanned | 2024-11-16T14:25:36+00:00 |
URL | https://wgi.org/robots.txt |
Redirect | https://www.wgi.org/robots.txt |
Redirect Domain | www.wgi.org |
Redirect Base | wgi.org |
Domain IPs | 23.185.0.3, 2620:12a:8000::3, 2620:12a:8001::3 |
Redirect IPs | 151.101.131.52, 151.101.195.52, 151.101.3.52, 151.101.67.52 |
Response IP | 199.232.47.52 |
Found | Yes |
Hash | 5b30377fbdfd4708519b6f53c16110b7bc1ef7ac21b14b565e02da6acd6dd9e8 |
SimHash | 982dccc48c82 |
Groups
*
Rule | Path |
---|---|
Disallow | /?s= |
Disallow | /page/*/?s= |
Disallow | /search/ |
Disallow | /events/ |
Disallow | /group_seasons/ |
Disallow | /group_season_show/ |
Disallow | /venue/ |
Disallow | /show/ |
Disallow | /group/ |
Disallow | /orgs/ |
Disallow | /season/ |
Disallow | /event/ |
Other Records
Field | Value |
---|---|
sitemap | https://wgi.org/sitemap_index.xml |
Comments