gutenberg.org
robots.txt

Robots Exclusion Standard data for gutenberg.org

Resource Scan

Scan Details

Site Domain gutenberg.org
Base Domain gutenberg.org
Scan Status Ok
Last Scan2024-09-13T17:58:40+00:00
Next Scan 2024-09-20T17:58:40+00:00

Last Scan

Scanned2024-09-13T17:58:40+00:00
URL https://gutenberg.org/robots.txt
Domain IPs 152.19.134.47, 2610:28:3090:3000:0:bad:cafe:47
Response IP 152.19.134.47
Found Yes
Hash 1f9a3a7eb83eff1ebbd9b11c01a440ec80e91c484fc9d1a63d8f5d00087f4973
SimHash a15540404593

Groups

*

Rule Path
Disallow /ebooks/search