blogs.ntu.edu.sg
robots.txt

Robots Exclusion Standard data for blogs.ntu.edu.sg

Resource Scan

Scan Details

Site Domain blogs.ntu.edu.sg
Base Domain ntu.edu.sg
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-20T11:59:32+00:00
Next Scan 2024-11-19T11:59:32+00:00

Last Successful Scan

Scanned2024-08-29T11:58:37+00:00
URL https://blogs.ntu.edu.sg/robots.txt
Domain IPs 100.24.182.117, 184.72.224.80, 3.91.109.122, 34.199.202.106, 34.227.238.166, 35.172.73.102
Response IP 184.72.224.80
Found Yes
Hash e4d85440bbc4797aa1b6067706eaea63f438c11e392daa8bc2c53c6d47d43569
SimHash e0c456c180ab

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
crawl-delay 30

yandex

Rule Path
Disallow /

moget

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

baiduspider+(+http://www.baidu.com/search/spider.htm)

Rule Path
Disallow /

baiduspider/2.0;+http://www.baidu.com/search/spider.html

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

mozilla/5.0(compatible; baiduspider/2.0; +http://www.baidu.com/search/spider.html)

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sosospider/2.0

Rule Path
Disallow /

sosospider+

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /wp-content/mu-plugins/

Other Records

Field Value
sitemap https://blogs.ntu.edu.sg/wp-sitemap.xml

Warnings

  • 6 invalid lines.