midcanterburynz.com
robots.txt
Robots Exclusion Standard data for midcanterburynz.com
Resource Scan
Scan Details
Site Domain | midcanterburynz.com |
Base Domain | midcanterburynz.com |
Scan Status | Ok |
Last Scan | 2025-09-02T09:59:15+00:00 |
Next Scan | 2025-10-02T09:59:15+00:00 |
Last Scan
Scanned | 2025-09-02T09:59:15+00:00 |
URL | https://midcanterburynz.com/robots.txt |
Redirect | https://midcanterbury.co.nz/robots.txt |
Redirect Domain | midcanterbury.co.nz |
Redirect Base | midcanterbury.co.nz |
Domain IPs | 104.26.0.203, 104.26.1.203, 172.67.68.222, 2606:4700:20::681a:1cb, 2606:4700:20::681a:cb, 2606:4700:20::ac43:44de |
Redirect IPs | 104.21.66.230, 172.67.209.26, 2606:4700:3031::6815:42e6, 2606:4700:3034::ac43:d11a |
Response IP | 104.21.66.230 |
Found | Yes |
Hash | 1e6f96ef855041a3a3676e2a04722c820c0a920ef5e8021a2c75d8466d5468b3 |
SimHash | d61ec840d4b1 |
Groups
nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler
Rule | Path |
---|---|
Disallow | / |
gptbot
chatgpt-user
claude-web
anthropic-ai
applebot-extended
bytespider
ccbot
cohere-ai
diffbot
facebookbot
google-extended
imagesiftbot
perplexitybot
omigilibot
omigili
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /manage |
Other Records
Field | Value |
---|---|
sitemap | https://midcanterbury.co.nz/sitemap.xml |
Comments