brucethecat.co.nz
robots.txt

Robots Exclusion Standard data for brucethecat.co.nz

Resource Scan

Scan Details

Site Domain brucethecat.co.nz
Base Domain brucethecat.co.nz
Scan Status Ok
Last Scan2025-10-21T22:56:41+00:00
Next Scan 2025-11-20T22:56:41+00:00

Last Scan

Scanned2025-10-21T22:56:41+00:00
URL https://brucethecat.co.nz/robots.txt
Redirect https://www.brucethecat.co.nz/robots.txt
Redirect Domain www.brucethecat.co.nz
Redirect Base brucethecat.co.nz
Domain IPs 104.21.2.233, 172.67.129.203, 2606:4700:3030::ac43:81cb, 2606:4700:3033::6815:2e9
Redirect IPs 104.21.2.233, 172.67.129.203, 2606:4700:3030::ac43:81cb, 2606:4700:3033::6815:2e9
Response IP 104.21.2.233
Found Yes
Hash 677882716f0af8c45da01d6d8a3c871bbc9a9b9e4b1ab7c8c1a9265d899839e2
SimHash 0c6a5b70a675

Groups

*

Rule Path
Disallow /login/
Disallow /card/
Disallow /fotos/
Disallow /temp/
Disallow /search/
Disallow /register/
Disallow /adminpanel/
Disallow /forgot-password/
Disallow /admin-assets/
Disallow /search?*
Disallow /search?search=
Disallow /*.pdf$
Disallow /?
Disallow /*?
Disallow /*?page=
Disallow /cgi-bin*
Allow /

gptbot

Rule Path
Allow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.brucethecat.co.nz/sitemap.xml