gutenberg.us
robots.txt

Robots Exclusion Standard data for gutenberg.us

Resource Scan

Scan Details

Site Domain gutenberg.us
Base Domain gutenberg.us
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2026-01-30T01:32:17+00:00
Next Scan 2026-04-30T01:32:17+00:00

Last Successful Scan

Scanned2025-10-02T04:04:26+00:00
URL http://gutenberg.us/robots.txt
Domain IPs 72.235.245.98
Response IP 72.235.245.98
Found Yes
Hash 14531b519dee655235b7df81de4b3e0661c45cb480ee6d0cc03948a12713edbd
SimHash 6340cd47e712

Groups

*

Rule Path Comment
Allow /* -
Disallow /view/ -
Disallow /ebooks/ -
Disallow /Articles/ -
Disallow /results.aspx -
Disallow /Get956uFile.aspx -
Disallow /ebooks/Get956uFile.aspx -
Disallow /App_Themes/ -
Disallow /img/ private area
Disallow /images/ private area
Disallow /js/ private area
Disallow /Members/ private area
Disallow /Members.2/ private area
Disallow /Members.3/ private area
Disallow /Members.4/ private area
Disallow /Members.5/ private area
Disallow /Members.6/ private area
Disallow /Members.7/ private area
Disallow /Members.8/ private area
Disallow /Members.9/ private area
Disallow /opac/ private area
Disallow /Report/ private area
Disallow /Services/ -
Disallow /styles/ -
Disallow /view/opac* -
Disallow /XmlDb/ -

Other Records

Field Value
sitemap http://gutenberg.us/sitemap.xml

Comments

  • robots.txt