midcanterburynz.com
robots.txt

Robots Exclusion Standard data for midcanterburynz.com

Resource Scan

Scan Details

Site Domain midcanterburynz.com
Base Domain midcanterburynz.com
Scan Status Ok
Last Scan2025-09-02T09:59:15+00:00
Next Scan 2025-10-02T09:59:15+00:00

Last Scan

Scanned2025-09-02T09:59:15+00:00
URL https://midcanterburynz.com/robots.txt
Redirect https://midcanterbury.co.nz/robots.txt
Redirect Domain midcanterbury.co.nz
Redirect Base midcanterbury.co.nz
Domain IPs 104.26.0.203, 104.26.1.203, 172.67.68.222, 2606:4700:20::681a:1cb, 2606:4700:20::681a:cb, 2606:4700:20::ac43:44de
Redirect IPs 104.21.66.230, 172.67.209.26, 2606:4700:3031::6815:42e6, 2606:4700:3034::ac43:d11a
Response IP 104.21.66.230
Found Yes
Hash 1e6f96ef855041a3a3676e2a04722c820c0a920ef5e8021a2c75d8466d5468b3
SimHash d61ec840d4b1

Groups

nuclei
wikido
riddler
petalbot
zoominfobot
go-http-client
node/simplecrawler
cazoodlebot
dotbot/1.0
gigabot
barkrowler
blexbot
magpie-crawler

Rule Path
Disallow /

gptbot
chatgpt-user
claude-web
anthropic-ai
applebot-extended
bytespider
ccbot
cohere-ai
diffbot
facebookbot
google-extended
imagesiftbot
perplexitybot
omigilibot
omigili

Rule Path
Disallow /

*

Rule Path
Disallow /manage

Other Records

Field Value
sitemap https://midcanterbury.co.nz/sitemap.xml

Comments

  • START nuxt-robots (indexable)
  • Block non helpful bots
  • Block AI Crawlers
  • END nuxt-robots