newsroom.co.nz
robots.txt

Robots Exclusion Standard data for newsroom.co.nz

Resource Scan

Scan Details

Site Domain newsroom.co.nz
Base Domain newsroom.co.nz
Scan Status Ok
Last Scan2024-11-18T04:43:17+00:00
Next Scan 2024-11-25T04:43:17+00:00

Last Scan

Scanned2024-11-18T04:43:17+00:00
URL https://newsroom.co.nz/robots.txt
Domain IPs 192.0.78.138, 192.0.78.250
Response IP 192.0.78.250
Found Yes
Hash c3a7110ba5b0155baa2bab54982980bf2d10e402f2481a7b2ac9d7271ab4e047
SimHash 1e36eb70a953

Groups

perplexitybot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

googleother

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /