arche-warder.de
robots.txt

Robots Exclusion Standard data for arche-warder.de

Resource Scan

Scan Details

Site Domain arche-warder.de
Base Domain arche-warder.de
Scan Status Ok
Last Scan2025-05-28T21:10:18+00:00
Next Scan 2025-06-27T21:10:18+00:00

Last Scan

Scanned2025-05-28T21:10:18+00:00
URL https://arche-warder.de/robots.txt
Domain IPs 2001:8d8:100f:f000::299, 217.160.0.78
Response IP 217.160.0.78
Found Yes
Hash 2352284d007b6b08b4b547c1cde88757b9b2ed34e1cf394833339ab0d4a886f9
SimHash 4805cec4e613

Groups

scrapy

Rule Path
Allow /

googlebot-image

Rule Path
Disallow /

mediapartners-google

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/mu-plugins/
Disallow /wp-content/plugins/
Disallow /wp-content/cache/
Disallow /wp-content/themes/
Disallow /wp-content/uploads/highres/

Other Records

Field Value
sitemap https://www.arche-warder.de/sitemap_index.xml

Comments

  • Google Image
  • Google AdSense
  • digg mirror
  • global