sho.com
robots.txt

Robots Exclusion Standard data for sho.com

Resource Scan

Scan Details

Site Domain sho.com
Base Domain sho.com
Scan Status Ok
Last Scan2024-09-01T09:30:40+00:00
Next Scan 2024-10-01T09:30:40+00:00

Last Scan

Scanned2024-09-01T09:30:40+00:00
URL https://sho.com/robots.txt
Redirect https://www.sho.com/robots.txt
Redirect Domain www.sho.com
Redirect Base sho.com
Domain IPs 13.248.160.137, 76.223.34.124
Redirect IPs 108.157.254.105, 108.157.254.118, 108.157.254.129, 108.157.254.46, 2600:9000:2753:2200:1f:a46:1380:93a1, 2600:9000:2753:3e00:1f:a46:1380:93a1, 2600:9000:2753:4600:1f:a46:1380:93a1, 2600:9000:2753:6000:1f:a46:1380:93a1, 2600:9000:2753:7000:1f:a46:1380:93a1, 2600:9000:2753:7600:1f:a46:1380:93a1, 2600:9000:2753:b200:1f:a46:1380:93a1, 2600:9000:2753:cc00:1f:a46:1380:93a1
Response IP 108.157.254.105
Found Yes
Hash e942c1e8d14b96fcd69bf1bb32abf3f0f6b6b690db929cb0dee2c23cf7374c17
SimHash 0d008823ef10

Groups

*

Rule Path
Disallow /tve/
Disallow /realmedia/
Disallow /redzone/
Disallow /scboxing/
Disallow /api/
Disallow /video/affiliate/
Disallow */meta$
Disallow /pr/

nutch

Rule Path
Allow */meta$

adsbot-google, adsbot-google-mobile, mediapartners-google

Rule Path
Allow /video/affiliate/