greetz.nl
robots.txt

Robots Exclusion Standard data for greetz.nl

Resource Scan

Scan Details

Site Domain greetz.nl
Base Domain greetz.nl
Scan Status Ok
Last Scan2024-06-01T14:36:28+00:00
Next Scan 2024-06-15T14:36:28+00:00

Last Scan

Scanned2024-06-01T14:36:28+00:00
URL https://greetz.nl/robots.txt
Redirect https://www.greetz.nl/robots.txt
Redirect Domain www.greetz.nl
Redirect Base greetz.nl
Domain IPs 13.224.163.31, 13.224.163.5, 13.224.163.8, 13.224.163.99
Redirect IPs 104.18.39.147, 172.64.148.109, 2606:4700:4400::6812:2793, 2606:4700:4400::ac40:946d
Response IP 104.18.39.147
Found Yes
Hash a6d14dd3a48aa69bc90beeab41811292451d368c336e87ef441f11cb24394ed5
SimHash 0d5d074077d1

Groups

googlebot

Rule Path
Disallow */basket/
Disallow /cdn-cgi/
Disallow */customise/*
Disallow */MyAccount/
Disallow */myaccount/
Disallow */auth/token
Disallow */OrderProcess/Previews/
Disallow */orderprocess/previews/
Disallow */search/*
Disallow */Search/*

*

Rule Path
Disallow */basket/
Disallow /cdn-cgi/
Disallow */customise/*
Disallow */MyAccount/
Disallow */myaccount/
Disallow */auth/token
Disallow */OrderProcess/Previews/
Disallow */orderprocess/previews/

Other Records

Field Value
sitemap https://www.greetz.nl/.well-known/sitemap/grtz-sitemap-gallery-nl.xml
sitemap https://www.greetz.nl/.well-known/sitemap/grtz-sitemap-product-nl.xml