wgrztv.com
robots.txt
Robots Exclusion Standard data for wgrztv.com
Resource Scan
Scan Details
Site Domain | wgrztv.com |
Base Domain | wgrztv.com |
Scan Status | Ok |
Last Scan | 2024-11-08T09:02:59+00:00 |
Next Scan | 2024-11-15T09:02:59+00:00 |
Last Scan
Scanned | 2024-11-08T09:02:59+00:00 |
URL | https://wgrztv.com/robots.txt |
Redirect | https://www.wgrz.com/robots.txt |
Redirect Domain | www.wgrz.com |
Redirect Base | wgrz.com |
Domain IPs | 34.213.106.51, 54.68.182.72 |
Redirect IPs | 184.50.85.140, 184.50.85.164, 96.17.180.162, 96.17.180.179, 96.17.180.186, 96.17.180.32 |
Response IP | 184.50.85.140 |
Found | Yes |
Hash | 0d8b77ae7583119799a00dd16256cc129bd8e71d011513e53af5aec7ac7c9cf0 |
SimHash | 683c5d744c93 |
Groups
*
Rule | Path |
---|---|
Disallow | /ajax/ |
Disallow | /search/ |
Disallow | /monitor/home |
Disallow | /search |
Disallow | /search?= |
Disallow | /mobile/search/ |
Disallow | /mobile/monitor/home |
Disallow | /mobile/search |
Disallow | /mobile/search?= |
Disallow | /search |
Other Records
Field | Value |
---|---|
sitemap | https://www.wgrz.com/sitemap.xml |
sitemap | https://www.wgrz.com/feeds/googlenews |
sitemap | https://www.wgrz.com/feeds/googlevideos |