sarahau.com
robots.txt

Robots Exclusion Standard data for sarahau.com

Resource Scan

Scan Details

Site Domain sarahau.com
Base Domain sarahau.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-08-28T12:01:40+00:00
Next Scan 2024-11-26T12:01:40+00:00

Last Successful Scan

Scanned2024-02-01T09:32:19+00:00
URL http://sarahau.com/robots.txt
Redirect http://ww12.sarahau.com/robots.txt
Redirect Domain ww12.sarahau.com
Redirect Base sarahau.com
Domain IPs 172.234.25.151
Redirect IPs 13.248.148.254, 76.223.26.96
Response IP 13.248.148.254
Found Yes
Hash 81150fed4cd6b900092954012f0a8181687ab60105f4ff82e33b6f19277123f6
SimHash 64a75840449a

Groups

googlebot

Rule Path
Disallow /?*

baiduspider

Rule Path
Disallow /?*

yandexbot

Rule Path
Disallow /?*

ichiro

Rule Path
Disallow /?*

sogou spider

Rule Path
Disallow /?*

sosospider

Rule Path
Disallow /?*

youdaobot

Rule Path
Disallow /?*

yetibot

Rule Path
Disallow /?*

bingbot

Rule Path
Disallow /?*

Other Records

Field Value
crawl-delay 2

yahoo! slurp

Rule Path
Disallow /?*

Other Records

Field Value
crawl-delay 2

rdfbot

Rule Path
Disallow /?*

seznambot

Rule Path
Disallow /?*

ia_archiver

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

Warnings

  • `request-rate` is not a known field.