tululu.org
robots.txt

Robots Exclusion Standard data for tululu.org

Resource Scan

Scan Details

Site Domain tululu.org
Base Domain tululu.org
Scan Status Ok
Last Scan2025-04-06T12:56:41+00:00
Next Scan 2025-04-13T12:56:41+00:00

Last Scan

Scanned2025-04-06T12:56:41+00:00
URL https://tululu.org/robots.txt
Domain IPs 104.21.82.5, 172.67.167.88, 2606:4700:3030::ac43:a758, 2606:4700:3034::6815:5205
Response IP 172.67.167.88
Found Yes
Hash e34a1caa775290ab44cb1f3aaa65e886c86b8c49a9400539e25eace1839b2bd8
SimHash 6440f860cab3

Groups

*

Rule Path
Disallow /cgi-bin/

Other Records

Field Value
crawl-delay 2.0

Comments

  • Clean-param: page /read-*/

Warnings

  • `host` is not a known field.