wgrz.com
robots.txt
Robots Exclusion Standard data for wgrz.com
Resource Scan
Scan Details
Site Domain | wgrz.com |
Base Domain | wgrz.com |
Scan Status | Ok |
Last Scan | 2024-05-31T17:36:59+00:00 |
Next Scan | 2024-06-07T17:36:59+00:00 |
Last Scan
Scanned | 2024-05-31T17:36:59+00:00 |
URL | https://wgrz.com/robots.txt |
Redirect | https://www.wgrz.com/robots.txt |
Redirect Domain | www.wgrz.com |
Redirect Base | wgrz.com |
Domain IPs | 34.213.106.51, 54.68.182.72 |
Redirect IPs | 96.17.180.162, 96.17.180.179 |
Response IP | 23.48.107.67 |
Found | Yes |
Hash | 0d8b77ae7583119799a00dd16256cc129bd8e71d011513e53af5aec7ac7c9cf0 |
SimHash | 683c5d744c93 |
Groups
*
Rule | Path |
---|---|
Disallow | /ajax/ |
Disallow | /search/ |
Disallow | /monitor/home |
Disallow | /search |
Disallow | /search?= |
Disallow | /mobile/search/ |
Disallow | /mobile/monitor/home |
Disallow | /mobile/search |
Disallow | /mobile/search?= |
Disallow | /search |
Other Records
Field | Value |
---|---|
sitemap | https://www.wgrz.com/sitemap.xml |
sitemap | https://www.wgrz.com/feeds/googlenews |
sitemap | https://www.wgrz.com/feeds/googlevideos |