studylib.net
robots.txt

Robots Exclusion Standard data for studylib.net

Resource Scan

Scan Details

Site Domain studylib.net
Base Domain studylib.net
Scan Status Ok
Last Scan2024-09-20T06:14:38+00:00
Next Scan 2024-09-27T06:14:38+00:00

Last Scan

Scanned2024-09-20T06:14:38+00:00
URL https://studylib.net/robots.txt
Domain IPs 104.21.73.241, 172.67.193.117, 2606:4700:3030::ac43:c175, 2606:4700:3031::6815:49f1
Response IP 104.21.73.241
Found Yes
Hash a1428bde4c15abb0337dd0694fcd2dfea6ab3d731a81b5e060f1b15f37976fcb
SimHash 00049ec09531

Groups

*

Rule Path
Disallow /viewer_next/
Disallow /theme/
Allow /theme/*/static
Disallow /store/
Disallow /upload
Disallow /download/
Disallow /docinfo.xml
Disallow /sendmail.html
Disallow /ask/searchAjax
Disallow /cdn-cgi/
Disallow /search/
Allow /

Other Records

Field Value
sitemap https://studylib.net/sitemap.xml

Warnings

  • `host` is not a known field.