warwickadvertiser.com
robots.txt
Robots Exclusion Standard data for warwickadvertiser.com
Resource Scan
Scan Details
Site Domain | warwickadvertiser.com |
Base Domain | warwickadvertiser.com |
Scan Status | Ok |
Last Scan | 2024-11-15T04:41:19+00:00 |
Next Scan | 2024-11-22T04:41:19+00:00 |
Last Scan
Scanned | 2024-11-15T04:41:19+00:00 |
URL | https://warwickadvertiser.com/robots.txt |
Redirect | https://www.warwickadvertiser.com/robots.txt |
Redirect Domain | www.warwickadvertiser.com |
Redirect Base | warwickadvertiser.com |
Domain IPs | 129.213.199.43, 129.213.77.43 |
Redirect IPs | 129.213.199.43, 129.213.77.43 |
Response IP | 129.213.199.43 |
Found | Yes |
Hash | ad0acd7f6a29a31623c75c9a5896a14aa369bd4f3373287667b4f3a24302cda4 |
SimHash | 9c5b78da8692 |
Groups
*
Rule | Path |
---|---|
Disallow | /news-portlet/metalocator/ |
Disallow | /news-portlet/html/teaser-viewer-portlet/teaser_page.jsp |
Disallow | /news-portlet/html/teaser-viewer-portlet/teaser_filter.jsp |
Disallow | /news-portlet/filterteaser/ |
Disallow | /news-portlet/getfilteropts/ |
Disallow | /tracking-portlet/html/ranking-viewer/ranking_details.jsp |
Disallow | /user-portlet/login-with/ |
Disallow | /user-portlet/edit-user-profile/ |
Disallow | /user-portlet/reset-credentials/ |
Disallow | /user-portlet/confirm-email/ |
Disallow | /user-portlet/refreshuserentitlements/ |
Disallow | /user-portlet/getEntitlements/ |
Disallow | /group/ |
Disallow | /user/ |
Disallow | /web/ |
Disallow | /image/ |
*
Rule | Path |
---|---|
Disallow | /news/police-fire |
Other Records
Field | Value |
---|---|
sitemap | https://www.warwickadvertiser.com/sitemap.xml |
sitemap | https://www.warwickadvertiser.com/sitemapforgoogle.xml |
sitemap | https://www.warwickadvertiser.com/megasitemap.xml |
Comments