student.dnes24.sk
robots.txt

Robots Exclusion Standard data for student.dnes24.sk

Resource Scan

Scan Details

Site Domain student.dnes24.sk
Base Domain dnes24.sk
Scan Status Ok
Last Scan2024-10-26T05:18:42+00:00
Next Scan 2024-11-25T05:18:42+00:00

Last Scan

Scanned2024-10-26T05:18:42+00:00
URL https://student.dnes24.sk/robots.txt
Domain IPs 109.71.69.113
Response IP 109.71.69.113
Found Yes
Hash 91a7238f3db2c6c3089162019ea613294684d73e8d6f26bdfbcd2aded00f1271
SimHash 284e1d4484b7

Groups

*

Rule Path
Disallow /ucet/
Disallow /registracia
Disallow /prihlasenie
Disallow /odhlasenie
Disallow /zabudnute-heslo
Disallow /account/facebook-login
Disallow /web-push-subscribe
Disallow /web-push-unsubscribe
Disallow /vyhladavanie
Disallow /zabava/kvizy/

Other Records

Field Value
crawl-delay 1

facebookexternalhit

Rule Path
Disallow /ucet/
Disallow /registracia
Disallow /prihlasenie
Disallow /odhlasenie
Disallow /zabudnute-heslo
Disallow /account/facebook-login
Disallow /web-push-subscribe
Disallow /web-push-unsubscribe
Disallow /vyhladavanie
Disallow /zabava/kvizy/

Other Records

Field Value
sitemap https://student.dnes24.sk/sitemap.xml

Comments

  • see http://www.robotstxt.org/orig.html for documentation