sn.angry.im
robots.txt

Robots Exclusion Standard data for sn.angry.im

Resource Scan

Scan Details

Site Domain sn.angry.im
Base Domain angry.im
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer redirected incorrectly.
Last Scan2024-09-12T17:44:49+00:00
Next Scan 2024-10-12T17:44:49+00:00

Last Successful Scan

Scanned2024-08-20T17:43:44+00:00
URL https://sn.angry.im/robots.txt
Domain IPs 104.21.41.194, 172.67.166.212, 2606:4700:3034::6815:29c2, 2606:4700:3034::ac43:a6d4
Response IP 172.67.166.212
Found Yes
Hash 22b1ebbd6a9a4ed2687c439c5362cb45662c4129532bc2837162f82e34df006d
SimHash ba54bb647343

Groups

*

Rule Path
Disallow /media_proxy/
Disallow /interact/

baiduspider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

new-sogou-spider

Rule Path
Disallow /

sogou pic spider

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

outfoxbot

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file

Warnings

  • 2 invalid lines.