news.sogou.com
robots.txt

Robots Exclusion Standard data for news.sogou.com

Resource Scan

Scan Details

Site Domain news.sogou.com
Base Domain sogou.com
Scan Status Ok
Last Scan2024-10-21T02:05:40+00:00
Next Scan 2024-11-20T02:05:40+00:00

Last Scan

Scanned2024-10-21T02:05:40+00:00
URL https://news.sogou.com/robots.txt
Domain IPs 43.153.236.147, 43.153.249.87
Response IP 43.153.249.87
Found Yes
Hash 795cf691b23b7bcdbad0873483b789d70f89d28259b24fbb05767f5c8e92ae3f
SimHash ba44dd1269ab

Groups

sogou web spider

Rule Path
Disallow /news?

sogou inst spider

Rule Path
Disallow /news?

sogou spider2

Rule Path
Disallow /news?

sogou blog

Rule Path
Disallow /news?

sogou news spider

Rule Path
Disallow /news?

sogou orion spider

Rule Path
Disallow /news?

jikespider

Rule Path
Disallow /news?

sosospider

Rule Path
Disallow /news?

googlebot

Rule Path
Disallow /news?

msnbot

Rule Path
Disallow /news?

baiduspider-image

Rule Path
Disallow /news?

youdaobot

Rule Path
Disallow /news?

baiduspider

Rule Path
Disallow /news?

*

Rule Path
Disallow /

Warnings

  • 2 invalid lines.