simplyyoubox.be
robots.txt

Robots Exclusion Standard data for simplyyoubox.be

Resource Scan

Scan Details

Site Domain simplyyoubox.be
Base Domain simplyyoubox.be
Scan Status Ok
Last Scan2025-11-26T00:35:31+00:00
Next Scan 2025-12-26T00:35:31+00:00

Last Scan

Scanned2025-11-26T00:35:31+00:00
URL https://simplyyoubox.be/robots.txt
Domain IPs 104.21.17.56, 172.67.222.86, 2606:4700:3033::ac43:de56, 2606:4700:3034::6815:1138
Response IP 104.21.17.56
Found Yes
Hash 6d5404f2a3319c0c83fb14644f001b82e929e127fb722450dc57be1cf7d7e836
SimHash 4910d0d10633

Groups

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

firecrawlagent

Rule Path
Allow /

andibot

Rule Path
Allow /

exabot

Rule Path
Allow /

phindbot

Rule Path
Allow /

youbot

Rule Path
Allow /

*

Rule Path
Disallow /?s=
Disallow /page/*/?s=
Disallow /search/

Other Records

Field Value
sitemap https://simplyyoubox.be/sitemap_index.xml

Warnings

  • `llms` is not a known field.