tourismnewzealand.com
robots.txt

Robots Exclusion Standard data for tourismnewzealand.com

Resource Scan

Scan Details

Site Domain tourismnewzealand.com
Base Domain tourismnewzealand.com
Scan Status Ok
Last Scan2024-09-19T16:27:10+00:00
Next Scan 2024-10-19T16:27:10+00:00

Last Scan

Scanned2024-09-19T16:27:10+00:00
URL https://tourismnewzealand.com/robots.txt
Redirect https://www.tourismnewzealand.com/robots-prod.txt
Redirect Domain www.tourismnewzealand.com
Redirect Base tourismnewzealand.com
Domain IPs 23.32.29.8, 2600:1413:b000:1b::17d7:70d, 2600:1413:b000:1b::17d7:719, 96.17.180.44
Redirect IPs 23.32.29.8, 2600:1413:b000:1b::17d7:70d, 2600:1413:b000:1b::17d7:719, 96.17.180.44
Response IP 23.32.29.97
Found Yes
Hash e86197cfb5524328f8f3f40df85075ea561610e9a1ad5b9468ff2392bb1dadad
SimHash 301e9fe04af0

Groups

*

Rule Path
Disallow /api/
Disallow /farefinder/
Disallow /admin/
Disallow /dev/
Disallow /health/check/
Disallow /Security/
Disallow /CMSSecurity/
Disallow /RemoveOrphanedPagesTask/
Disallow /SiteTreeMaintenanceTask/
Disallow /UserDefinedFormController/

Other Records

Field Value
crawl-delay 5

http://www.almaden.ibm.com/cs/crawler
bordermanager*
webcollage*
java*
grub-client
lwp*
linkwalker
offline explorer
larbin
mj12bot
blexbot
dotbot
yeti
semrushbot

Rule Path
Disallow /

swiftbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 0.25

Comments

  • robots-prod.txt
  • Production Robots File
  • 20210921 0841
  • ----- DEFAULT CRAWLER RULES -----
  • - RESOURCE PATHS SS -
  • - Default crawl delay
  • ----- DISABLED CRAWLERS -----
  • ----- Swiftype specific config

Warnings

  • 3 invalid lines.