ils-ansbach.de
robots.txt

Robots Exclusion Standard data for ils-ansbach.de

Resource Scan

Scan Details

Site Domain ils-ansbach.de
Base Domain ils-ansbach.de
Scan Status Ok
Last Scan2025-06-05T19:09:36+00:00
Next Scan 2025-07-05T19:09:36+00:00

Last Scan

Scanned2025-06-05T19:09:36+00:00
URL https://ils-ansbach.de/robots.txt
Domain IPs 2a01:238:20a:202:1148::, 81.169.145.148
Response IP 81.169.145.148
Found Yes
Hash 3c1fdae4eda4851510d8c3bff9fa095b38e2a9888925f3daac5a7b57c0874a00
SimHash 4d115881e5d1

Groups

googlebot
bingbot
duckduckbot

Rule Path
Allow /.cm4all/mediadb/
Allow /.cm4all/sysdb/
Allow /.cm4all/uproc.php/
Allow /.cm4all/iproc.php/
Disallow /.cm4all/
Disallow *meta_robots-noindex

*

Rule Path
Allow /.cm4all/mediadb/
Allow /.cm4all/sysdb/
Allow /.cm4all/uproc.php/
Disallow /.cm4all/
Disallow *meta_robots-noindex

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap http://512778854.swh.strato-hosting.eu/sitemap.xml

Comments

  • robots.txt for 512778854.swh.strato-hosting.eu