kit.edu
robots.txt

Robots Exclusion Standard data for kit.edu

Resource Scan

Scan Details

Site Domain kit.edu
Base Domain kit.edu
Scan Status Ok
Last Scan2024-11-06T10:34:31+00:00
Next Scan 2024-11-20T10:34:31+00:00

Last Scan

Scanned2024-11-06T10:34:31+00:00
URL https://kit.edu/robots.txt
Redirect https://www.kit.edu/robots.txt
Redirect Domain www.kit.edu
Redirect Base kit.edu
Domain IPs 141.3.128.6, 2a00:1398:b::8d03:8006
Redirect IPs 141.3.128.6, 2a00:1398:b::8d03:8006
Response IP 141.3.128.6
Found Yes
Hash afeed28066641d8842cc4b56c7492d087222093221b11a64620d79291eb76693
SimHash 2c746de1d3d9

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /search.php
Disallow /english/search.php
Disallow /admin/events/
Disallow /emailform.php
Disallow /vcard.php

w3c-checklink

Rule Path
Disallow

Comments

  • http://www.robotstxt.org/wc/exclusion.html#robotstxt
  • http://validator.w3.org/docs/checklink.html#bot