4hu19j.com
robots.txt

Robots Exclusion Standard data for 4hu19j.com

Resource Scan

Scan Details

Site Domain 4hu19j.com
Base Domain 4hu19j.com
Scan Status Ok
Last Scan2024-05-21T12:19:59+00:00
Next Scan 2024-06-20T12:19:59+00:00

Last Scan

Scanned2024-05-21T12:19:59+00:00
URL http://4hu19j.com/robots.txt
Domain IPs 23.225.148.166
Response IP 23.225.148.166
Found Yes
Hash 80a0d514bc35c040480f0b80cef7ccaa181d4055cb69df8d05a4a1cefb15ca15
SimHash 2041d713c131

Groups

baiduspider

Rule Path
Allow /

sogou spider

Rule Path
Allow /

googlebot

Rule Path
Disallow /

googlebot-mobile

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

mediapartners-ëñë÷google

Rule Path
Disallow /

adsbot-google

Rule Path
Disallow /

feedfetcher-google

Rule Path
Disallow /

yahoo! slurp

Rule Path
Disallow /

yahoo! slurp china

Rule Path
Disallow /

yahoo!-adcrawler

Rule Path
Disallow /

yahoo slurp

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

scooter

Rule Path
Disallow /

lycos_spider_(t-rex)

Rule Path
Disallow /

fast-webcrawler

Rule Path
Disallow /

slurp

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

tomato bot

Rule Path
Disallow /

*

Rule Path
Disallow /

Warnings

  • 2 invalid lines.