www.ifgg.kit.edu
robots.txt

Robots Exclusion Standard data for www.ifgg.kit.edu

Resource Scan

Scan Details

Site Domain www.ifgg.kit.edu
Base Domain kit.edu
Scan Status Ok
Last Scan2025-07-24T06:57:51+00:00
Next Scan 2025-08-23T06:57:51+00:00

Last Scan

Scanned2025-07-24T06:57:51+00:00
URL https://www.ifgg.kit.edu/robots.txt
Domain IPs 141.3.128.253, 2a00:1398:b::8d03:80fd
Response IP 141.3.128.253
Found Yes
Hash f02640686e146f484d08638d758e6f10a7785a48909b15f1e35cd4d4d7f0e438
SimHash 691c1935c150

Groups

*

Rule Path
Disallow /karlit
Allow /karlit/img
Allow /karlit/style
Disallow /emailform.php

w3c-checklink

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.ifgg.kit.edu/10.php
sitemap https://www.ifgg.kit.edu/vegetation/forschung/cem-bgd/sitemap.php

Comments

  • http://www.robotstxt.org/wc/exclusion.html#robotstxt
  • http://validator.w3.org/docs/checklink.html#bot