planetpeschel.com
robots.txt

Robots Exclusion Standard data for planetpeschel.com

Resource Scan

Scan Details

Site Domain planetpeschel.com
Base Domain planetpeschel.com
Scan Status Ok
Last Scan2026-02-15T09:08:23+00:00
Next Scan 2026-03-17T09:08:23+00:00

Last Scan

Scanned2026-02-15T09:08:23+00:00
URL https://planetpeschel.com/robots.txt
Domain IPs 104.21.70.118, 172.67.223.98, 2606:4700:3033::6815:4676, 2606:4700:3035::ac43:df62
Response IP 172.67.223.98
Found Yes
Hash b2bd5c9260424940d0d6edc124cb859047873a18b78e2d26fbab1253b9c72fd1
SimHash 6934db40c5f1

Groups

*

Rule Path
Disallow /Auction/
Disallow /HomePage/
Disallow /art/
Disallow /themes/
Disallow /cgi-bin/
Disallow /images/
Disallow /tutorial/
Disallow /testblog.css
Disallow /path.php
Disallow /ie3.css
Disallow /PP5.css

all

Rule Path
Allow /

googlebot-image

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

Comments

  • list folders robots are not allowed to index
  • list specific files robots are not allowed to index
  • From Wordpress forum, other things to disallow