happylibnet.com
robots.txt

Robots Exclusion Standard data for happylibnet.com

Resource Scan

Scan Details

Site Domain happylibnet.com
Base Domain happylibnet.com
Scan Status Ok
Last Scan2025-06-08T17:23:48+00:00
Next Scan 2025-06-15T17:23:48+00:00

Last Scan

Scanned2025-06-08T17:23:48+00:00
URL https://happylibnet.com/robots.txt
Domain IPs 104.21.78.127, 172.67.221.195, 2606:4700:3035::ac43:ddc3, 2606:4700:3037::6815:4e7f
Response IP 104.21.78.127
Found Yes
Hash 5e9cbae057dafcb5738124cb30da3d17fbf4015438cf1f12cd1da40830a4d7bf
SimHash 01089e908530

Groups

*

Rule Path
Disallow /viewer_next/
Disallow /theme/
Allow /theme/*/static
Disallow /store/
Disallow /upload
Disallow /download/
Disallow /docinfo.xml
Disallow /sendmail.html
Disallow /ask/searchAjax
Disallow /cdn-cgi/
Disallow /search/
Disallow /documents/
Allow /

applebot-extended

Rule Path
Disallow /doc/

Other Records

Field Value
sitemap https://happylibnet.com/sitemap.xml

Warnings

  • `host` is not a known field.