warwickadvertiser.com
robots.txt

Robots Exclusion Standard data for warwickadvertiser.com

Resource Scan

Scan Details

Site Domain warwickadvertiser.com
Base Domain warwickadvertiser.com
Scan Status Ok
Last Scan2024-11-15T04:41:19+00:00
Next Scan 2024-11-22T04:41:19+00:00

Last Scan

Scanned2024-11-15T04:41:19+00:00
URL https://warwickadvertiser.com/robots.txt
Redirect https://www.warwickadvertiser.com/robots.txt
Redirect Domain www.warwickadvertiser.com
Redirect Base warwickadvertiser.com
Domain IPs 129.213.199.43, 129.213.77.43
Redirect IPs 129.213.199.43, 129.213.77.43
Response IP 129.213.199.43
Found Yes
Hash ad0acd7f6a29a31623c75c9a5896a14aa369bd4f3373287667b4f3a24302cda4
SimHash 9c5b78da8692

Groups

*

Rule Path
Disallow /news-portlet/metalocator/
Disallow /news-portlet/html/teaser-viewer-portlet/teaser_page.jsp
Disallow /news-portlet/html/teaser-viewer-portlet/teaser_filter.jsp
Disallow /news-portlet/filterteaser/
Disallow /news-portlet/getfilteropts/
Disallow /tracking-portlet/html/ranking-viewer/ranking_details.jsp
Disallow /user-portlet/login-with/
Disallow /user-portlet/edit-user-profile/
Disallow /user-portlet/reset-credentials/
Disallow /user-portlet/confirm-email/
Disallow /user-portlet/refreshuserentitlements/
Disallow /user-portlet/getEntitlements/
Disallow /group/
Disallow /user/
Disallow /web/
Disallow /image/

ia_archiver

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

slurp

Rule Path
Disallow /

maxthon

Rule Path
Disallow /

cncdialer

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /news/police-fire

Other Records

Field Value
sitemap https://www.warwickadvertiser.com/sitemap.xml
sitemap https://www.warwickadvertiser.com/sitemapforgoogle.xml
sitemap https://www.warwickadvertiser.com/megasitemap.xml

Comments

  • Known harmful agents