reece.co.nz
robots.txt

Robots Exclusion Standard data for reece.co.nz

Resource Scan

Scan Details

Site Domain reece.co.nz
Base Domain reece.co.nz
Scan Status Ok
Last Scan2024-06-17T10:30:13+00:00
Next Scan 2024-07-01T10:30:13+00:00

Last Scan

Scanned2024-06-17T10:30:13+00:00
URL https://reece.co.nz/robots.txt
Redirect https://www.reece.co.nz/robots.txt
Redirect Domain www.reece.co.nz
Redirect Base reece.co.nz
Domain IPs 108.156.133.20, 108.156.133.3, 108.156.133.62, 108.156.133.70
Redirect IPs 13.33.88.112, 13.33.88.114, 13.33.88.61, 13.33.88.93, 2600:9000:223b:4600:8:ac8b:8700:93a1, 2600:9000:223b:7000:8:ac8b:8700:93a1, 2600:9000:223b:8e00:8:ac8b:8700:93a1, 2600:9000:223b:9c00:8:ac8b:8700:93a1, 2600:9000:223b:a600:8:ac8b:8700:93a1, 2600:9000:223b:ba00:8:ac8b:8700:93a1, 2600:9000:223b:ce00:8:ac8b:8700:93a1, 2600:9000:223b:dc00:8:ac8b:8700:93a1
Response IP 13.33.88.114
Found Yes
Hash 8c42eecfb8292f8100ca66272f7984b5340fe0c934011a132c1140bb2fd63272
SimHash a819c410ab71

Groups

*

Rule Path
Allow /
Disallow /admin
Disallow /page-not-found
Disallow /server-error
Disallow /response-24-7

ahrefsbot
almaden
arquivo-web-crawler
aspseek
baiduspider
dumbbot
generic
grub-client
mj12bot
msiecrawler
nexabot
npbot
owr_crawler
psbot
rpt-httpclient
scoutabout
semanticdiscovery
turnitinbot
twiceler
wget
yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.reece.co.nz/sitemap.xml

Warnings

  • 1 invalid line.