ep-infonet.de
robots.txt

Robots Exclusion Standard data for ep-infonet.de

Resource Scan

Scan Details

Site Domain ep-infonet.de
Base Domain ep-infonet.de
Scan Status Ok
Last Scan2024-06-02T06:09:44+00:00
Next Scan 2024-07-02T06:09:44+00:00

Last Scan

Scanned2024-06-02T06:09:44+00:00
URL https://ep-infonet.de/robots.txt
Redirect https://www.ep-infonet.de:443/robots.txt
Redirect Domain www.ep-infonet.de
Redirect Base ep-infonet.de
Domain IPs 18.197.194.45, 35.156.244.224, 52.57.64.92
Redirect IPs 13.227.254.16, 13.227.254.38, 13.227.254.44, 13.227.254.52, 2600:9000:200a:2a00:12:6454:6300:93a1, 2600:9000:200a:4400:12:6454:6300:93a1, 2600:9000:200a:600:12:6454:6300:93a1, 2600:9000:200a:6400:12:6454:6300:93a1, 2600:9000:200a:9800:12:6454:6300:93a1, 2600:9000:200a:9a00:12:6454:6300:93a1, 2600:9000:200a:e000:12:6454:6300:93a1, 2600:9000:200a:e00:12:6454:6300:93a1
Response IP 13.227.254.16
Found Yes
Hash 39a8de1329c87ae1ec893428fdeb575533f46333a1f0516a500944d2efbf491e
SimHash ec52579dedf4

Groups

*

Rule Path
Disallow /epDE/de/cart
Disallow /epDE/de/checkout
Disallow /epDE/de/my-account

Other Records

Field Value Comment
crawl-delay 10 10 seconds between page requests

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

Other Records

Field Value
sitemap /epDE/de/sitemap.xml

Comments

  • For all robots
  • Block access to specific groups of pages
  • Allow search crawlers to discover the sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot

Warnings

  • `request-rate` is not a known field.
  • `visit-time` is not a known field.