invasiveplantatlas.org
robots.txt

Robots Exclusion Standard data for invasiveplantatlas.org

Resource Scan

Scan Details

Site Domain invasiveplantatlas.org
Base Domain invasiveplantatlas.org
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-03-18T12:30:35+00:00
Next Scan 2024-06-16T12:30:35+00:00

Last Successful Scan

Scanned2022-10-27T03:49:59+00:00
URL https://www.invasiveplantatlas.org/robots.txt
Response IP 18.161.97.64, 18.161.97.62, 18.161.97.26, 18.161.97.55
Found Yes
Hash 972cca6468ff9ee6f11f615e951d18e72afe3b53f1ce05152fbda06ab4efd557
SimHash 6a78daf2a333

Groups

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

ahrefsbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

psbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

*

Rule Path
Disallow /error/tattle.cfm

Other Records

Field Value
crawl-delay 2