rath-heumar.info
robots.txt

Robots Exclusion Standard data for rath-heumar.info

Resource Scan

Scan Details

Site Domain rath-heumar.info
Base Domain rath-heumar.info
Scan Status Ok
Last Scan2025-03-30T15:14:25+00:00
Next Scan 2025-04-29T15:14:25+00:00

Last Scan

Scanned2025-03-30T15:14:25+00:00
URL https://rath-heumar.info/robots.txt
Redirect https://www.rath-heumar.info/robots.txt
Redirect Domain www.rath-heumar.info
Redirect Base rath-heumar.info
Domain IPs 104.21.10.40, 172.67.131.59
Redirect IPs 104.21.10.40, 172.67.131.59
Response IP 172.67.131.59
Found Yes
Hash cb53479438f1bf82e5bd6c60e36f2c4fde90f7f1c022c6b9e935fb857989a944
SimHash a91818848b93

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /temp/

webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.rath-heumar.info/sitemap_index.xml

Comments

  • ===================================
  • Generator: http://pixelfolk.net/tools/robots
  • Erstellt am: 31.01.2017, 18:55
  • Webseite: http://http://www.rath-heumar.info/
  • ===================================
  • ===================================
  • Folgende Seiten sollen nicht indexiert werden:
  • ===================================
  • ===================================
  • Schließe folgende Spider komplett aus:
  • ===================================