alleitaliaanse.nl
robots.txt

Robots Exclusion Standard data for alleitaliaanse.nl

Resource Scan

Scan Details

Site Domain alleitaliaanse.nl
Base Domain alleitaliaanse.nl
Scan Status Ok
Last Scan2024-09-17T13:04:06+00:00
Next Scan 2024-09-24T13:04:06+00:00

Last Scan

Scanned2024-09-17T13:04:06+00:00
URL https://alleitaliaanse.nl/robots.txt
Domain IPs 195.201.17.67
Response IP 195.201.17.67
Found Yes
Hash 459bdc59de16168246e9dfb19caac050eee81097ee620dd031f3e5b5cc506bc3
SimHash 1968d9808992

Groups

*

Rule Path
Disallow /1000$
Disallow /1000/$
Disallow /*/1000$
Disallow /*/1000/$
Disallow /download/

gptbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

piplbot

Rule Path
Disallow /