unic.ac.cy
robots.txt

Robots Exclusion Standard data for unic.ac.cy

Resource Scan

Scan Details

Site Domain unic.ac.cy
Base Domain unic.ac.cy
Scan Status Ok
Last Scan2024-10-21T07:54:45+00:00
Next Scan 2024-11-20T07:54:45+00:00

Last Scan

Scanned2024-10-21T07:54:45+00:00
URL https://unic.ac.cy/robots.txt
Redirect https://www.unic.ac.cy/robots.txt
Redirect Domain www.unic.ac.cy
Redirect Base unic.ac.cy
Domain IPs 141.193.213.20, 141.193.213.21
Redirect IPs 141.193.213.20, 141.193.213.21
Response IP 141.193.213.21
Found Yes
Hash 3ed6d8c40a78f991212721d12effae612ebfa6fc4350622782cc9a1b7cc8f7c0
SimHash 4a70d0a26238

Groups

ahrefsbot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow */trackback
Disallow */comment-
Disallow *?replytocom=
Disallow */feed
Disallow /?s=
Disallow /xmlrpc.php
Disallow /archives/date/
Disallow /archives/tag/
Disallow /archives/author/
Disallow /el/archives/author/
Disallow /author/
Disallow /el/author/
Disallow /page/
Disallow /tag/
Disallow /wp-admin/
Disallow /readme.html
Disallow /refer/

Other Records

Field Value
crawl-delay 10

Warnings

  • `host` is not a known field.