tun.com
robots.txt

Robots Exclusion Standard data for tun.com

Resource Scan

Scan Details

Site Domain tun.com
Base Domain tun.com
Scan Status Ok
Last Scan2024-09-26T03:44:48+00:00
Next Scan 2024-10-03T03:44:48+00:00

Last Scan

Scanned2024-09-26T03:44:48+00:00
URL https://tun.com/robots.txt
Redirect https://www.tun.com/robots.txt
Redirect Domain www.tun.com
Redirect Base tun.com
Domain IPs 66.178.176.115
Redirect IPs 66.178.176.115
Response IP 66.178.176.115
Found Yes
Hash 431c92cc41f49abe7574c469f02f1727578ce7c836f1c57828db34027b0cc017
SimHash 24049fa34613

Groups

googlebot

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

mediapartners-google

Rule Path
Allow /

slurp

Rule Path
Allow /

msnbot

Rule Path
Allow /

msnbot-media

Rule Path
Allow /

teoma

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

scrubby

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

nutch

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

baiduspider+(+http://www.baidu.com/search/spider.htm)

Rule Path
Disallow /

baiduspider-ads

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

*

Rule Path
Disallow index.php/latest/
Disallow /latest/
Disallow index.php/cms/
Disallow images/admin/
Disallow images/badges/
Disallow images/states/
Disallow /StudentDiscounts/
Disallow /tunBeta/
Disallow /teachers/

Other Records

Field Value
sitemap http://www.tun.com/sitemap.xml

Comments

  • robots.txt created for http://tun.com damians