bushcraft-deutschland.de
robots.txt

Robots Exclusion Standard data for bushcraft-deutschland.de

Resource Scan

Scan Details

Site Domain bushcraft-deutschland.de
Base Domain bushcraft-deutschland.de
Scan Status Ok
Last Scan2024-06-28T23:12:51+00:00
Next Scan 2024-07-05T23:12:51+00:00

Last Scan

Scanned2024-06-28T23:12:51+00:00
URL https://bushcraft-deutschland.de/robots.txt
Redirect https://www.bushcraft-deutschland.de/robots.txt
Redirect Domain www.bushcraft-deutschland.de
Redirect Base bushcraft-deutschland.de
Domain IPs 104.26.10.141, 104.26.11.141, 172.67.74.22
Redirect IPs 104.26.10.141, 104.26.11.141, 172.67.74.22
Response IP 104.26.10.141
Found Yes
Hash 387b6cbfdb371ff7aa6bc61b9bddfb89708c47340a5de8a3eb22bc17e5fd2513
SimHash 291ed8548511

Groups

boardreader

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

spinn3r

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow

*

Rule Path
Disallow /members/
Disallow /online/
Disallow /recent-activiy/
Disallow /find-new/
Disallow /misc/
Disallow /hilfe/
Disallow /search/
Disallow /posts/
Disallow /tags/
Disallow /posts/*/reactions
Disallow /affiliate/
Disallow /account/
Disallow /whats-new/

Other Records

Field Value
sitemap https://www.bushcraft-deutschland.de/sitemap.php

Warnings

  • 2 invalid lines.
  • `diallos` is not a known field.