turbli.com
robots.txt

Robots Exclusion Standard data for turbli.com

Resource Scan

Scan Details

Site Domain turbli.com
Base Domain turbli.com
Scan Status Ok
Last Scan2024-11-16T13:49:01+00:00
Next Scan 2024-11-23T13:49:01+00:00

Last Scan

Scanned2024-11-16T13:49:01+00:00
URL https://turbli.com/robots.txt
Domain IPs 104.26.4.15, 104.26.5.15, 172.67.75.166, 2606:4700:20::681a:40f, 2606:4700:20::681a:50f, 2606:4700:20::ac43:4ba6
Response IP 104.26.4.15
Found Yes
Hash 5cd52a0ec85a4e78a1585677bb899e8ba8e589dc17f2f7584c150fe84ac5b143
SimHash c824d810cb30

Groups

*

Rule Path
Disallow /databases/
Disallow /00_root/
Disallow /venv/
Disallow /*/*/*/*

Other Records

Field Value
crawl-delay 10

opebot-v

Rule Path
Allow /
Disallow /databases/
Disallow /00_root/
Disallow /venv/

twitterbot

Rule Path
Disallow /