netlibrary.net
robots.txt

Robots Exclusion Standard data for netlibrary.net

Resource Scan

Scan Details

Site Domain netlibrary.net
Base Domain netlibrary.net
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2026-01-09T16:42:34+00:00
Next Scan 2026-02-08T16:42:34+00:00

Last Successful Scan

Scanned2025-12-10T23:38:08+00:00
URL http://netlibrary.net/robots.txt
Domain IPs 72.235.245.98
Response IP 72.235.245.98
Found Yes
Hash 98dfd65f6cbfa3f746f5beda281856d7bfec6e12cd3235a0dc342225082c473f
SimHash 6340cc43e352

Groups

*

Rule Path Comment
Allow /* -
Disallow /view/ -
Disallow /ebooks/ -
Disallow /Articles/ -
Disallow /results.aspx -
Disallow /Get956uFile.aspx -
Disallow /ebooks/Get956uFile.aspx -
Disallow /App_Themes/ -
Disallow /img/ private area
Disallow /images/ private area
Disallow /js/ private area
Disallow /Members/ private area
Disallow /Members.2/ private area
Disallow /Members.3/ private area
Disallow /Members.4/ private area
Disallow /Members.5/ private area
Disallow /Members.6/ private area
Disallow /Members.7/ private area
Disallow /Members.8/ private area
Disallow /Members.9/ private area
Disallow /opac/ private area
Disallow /Report/ private area
Disallow /Services/ -
Disallow /styles/ -
Disallow /view/opac* -
Disallow /XmlDb/ -

Other Records

Field Value
sitemap http://netlibrary.net/sitemap.xml

Comments

  • robots.txt