haushalt.de
robots.txt

Robots Exclusion Standard data for haushalt.de

Resource Scan

Scan Details

Site Domain haushalt.de
Base Domain haushalt.de
Scan Status Ok
Last Scan2024-10-31T15:52:35+00:00
Next Scan 2024-11-07T15:52:35+00:00

Last Scan

Scanned2024-10-31T15:52:35+00:00
URL https://haushalt.de/robots.txt
Domain IPs 195.201.34.14
Response IP 195.201.34.14
Found Yes
Hash 51ecdb0a71e4fb90e7acb6328d2a24a8ffef2a017baf097e00ced81702caf2fe
SimHash 065edb40e152

Groups

*

Rule Path
Disallow /galerie/
Disallow /login/
Disallow /recent-activity/
Disallow /m/
Disallow /members/
Disallow /wiki/

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.wunderkessel.de/sitemap.php

Comments

  • robots.txt for http://www.wunderkessel.de/
  • Block spider access for some directories