newscentermaine.com
robots.txt
Robots Exclusion Standard data for newscentermaine.com
Resource Scan
Scan Details
Site Domain | newscentermaine.com |
Base Domain | newscentermaine.com |
Scan Status | Ok |
Last Scan | 2024-09-25T09:46:24+00:00 |
Next Scan | 2024-10-02T09:46:24+00:00 |
Last Scan
Scanned | 2024-09-25T09:46:24+00:00 |
URL | https://newscentermaine.com/robots.txt |
Redirect | https://www.newscentermaine.com/robots.txt |
Redirect Domain | www.newscentermaine.com |
Redirect Base | newscentermaine.com |
Domain IPs | 34.213.106.51, 54.68.182.72 |
Redirect IPs | 23.59.168.107 |
Response IP | 23.59.168.107 |
Found | Yes |
Hash | e4ee877da7ef0a4784c8afdeda8aa9e73a74e58c278cdadaef8255c52e98b8a8 |
SimHash | 783cdc58049b |
Groups
*
Rule | Path |
---|---|
Disallow | /ajax/ |
Disallow | /search/ |
Disallow | /monitor/home |
Disallow | /search |
Disallow | /search?= |
Disallow | /mobile/search/ |
Disallow | /mobile/monitor/home |
Disallow | /mobile/search |
Disallow | /mobile/search?= |
Disallow | /search |
Other Records
Field | Value |
---|---|
sitemap | https://www.newscentermaine.com/sitemap.xml |
sitemap | https://www.newscentermaine.com/feeds/googlenews |
sitemap | https://www.newscentermaine.com/feeds/googlevideos |