grosses-los.de
robots.txt

Robots Exclusion Standard data for grosses-los.de

Resource Scan

Scan Details

Site Domain grosses-los.de
Base Domain grosses-los.de
Scan Status Ok
Last Scan2026-02-15T08:28:25+00:00
Next Scan 2026-02-22T08:28:25+00:00

Last Scan

Scanned2026-02-15T08:28:25+00:00
URL https://www.grosses-los.de/robots.txt
Domain IPs 104.18.20.91, 104.18.21.91, 2606:4700::6812:145b, 2606:4700::6812:155b
Response IP 104.18.21.91
Found Yes
Hash b690d7b5db9d0e79f8bf39a40d86c3e3f54299678873e3c4a253e11ec6f5e22d
SimHash 5614c9504195

Groups

googlebot

Rule Path
Allow /
Disallow /sc/*todo%3Dcp_*
Disallow */global.pl*ident%3Ddatenschutz*
Disallow */global.pl*ident%3Dagb*
Disallow */global.pl*ident%3Dimpressum*
Disallow */global.pl*ident%3Ddsa*

mediapartners-google

Rule Path
Allow /

cookiebot

Rule Path
Allow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /
Disallow /

anthropic-ai

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

*

Rule Path
Disallow /

Warnings

  • `user agent` is not a known field.