quelle.de
robots.txt

Robots Exclusion Standard data for quelle.de

Resource Scan

Scan Details

Site Domain quelle.de
Base Domain quelle.de
Scan Status Ok
Last Scan2024-11-07T16:12:46+00:00
Next Scan 2024-11-14T16:12:46+00:00

Last Scan

Scanned2024-11-07T16:12:46+00:00
URL https://quelle.de/robots.txt
Redirect https://www.quelle.de:443/robots.txt
Redirect Domain www.quelle.de
Redirect Base quelle.de
Domain IPs 34.149.110.250
Redirect IPs 34.149.110.250
Response IP 34.149.110.250
Found Yes
Hash 99b02f7a893fc11571c79f6c34fa056d79a5211cbcf5643a09ff0b83fb9a8c5a
SimHash 012e5c69835b

Groups

sogou spider

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

shopstyle bot/1.0

Rule Path
Disallow /

python-grcrawler

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

axis/1.4

Rule Path
Disallow /

proximic

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

wget

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /merkzettel
Disallow /kasse/$
Disallow /warenkorb/$
Disallow /s$
Disallow /*.htm$