wlbz2.com
robots.txt
Robots Exclusion Standard data for wlbz2.com
Resource Scan
Scan Details
Site Domain | wlbz2.com |
Base Domain | wlbz2.com |
Scan Status | Ok |
Last Scan | 2024-06-23T18:00:15+00:00 |
Next Scan | 2024-06-30T18:00:15+00:00 |
Last Scan
Scanned | 2024-06-23T18:00:15+00:00 |
URL | https://wlbz2.com/robots.txt |
Redirect | https://www.newscentermaine.com/robots.txt |
Redirect Domain | www.newscentermaine.com |
Redirect Base | newscentermaine.com |
Domain IPs | 34.213.106.51, 54.68.182.72 |
Redirect IPs | 184.50.85.161, 184.50.85.170 |
Response IP | 184.50.85.129 |
Found | Yes |
Hash | e4ee877da7ef0a4784c8afdeda8aa9e73a74e58c278cdadaef8255c52e98b8a8 |
SimHash | 783cdc58049b |
Groups
*
Rule | Path |
---|---|
Disallow | /ajax/ |
Disallow | /search/ |
Disallow | /monitor/home |
Disallow | /search |
Disallow | /search?= |
Disallow | /mobile/search/ |
Disallow | /mobile/monitor/home |
Disallow | /mobile/search |
Disallow | /mobile/search?= |
Disallow | /search |
Other Records
Field | Value |
---|---|
sitemap | https://www.newscentermaine.com/sitemap.xml |
sitemap | https://www.newscentermaine.com/feeds/googlenews |
sitemap | https://www.newscentermaine.com/feeds/googlevideos |