turi.org
robots.txt

Robots Exclusion Standard data for turi.org

Resource Scan

Scan Details

Site Domain turi.org
Base Domain turi.org
Scan Status Ok
Last Scan2024-06-02T11:44:28+00:00
Next Scan 2024-07-02T11:44:28+00:00

Last Scan

Scanned2024-06-02T11:44:28+00:00
URL https://turi.org/robots.txt
Redirect https://www.turi.org/robots.txt
Redirect Domain www.turi.org
Redirect Base turi.org
Domain IPs 104.21.88.34, 172.67.150.79, 2606:4700:3035::ac43:964f, 2606:4700:3037::6815:5822
Redirect IPs 104.21.88.34, 172.67.150.79, 2606:4700:3035::ac43:964f, 2606:4700:3037::6815:5822
Response IP 172.67.150.79
Found Yes
Hash 22e7555c09af294cd54c6b4ec9331fed8d26357173ecef6ef2781f244fb2f037
SimHash 67791820cab0

Groups

*

Rule Path
Disallow /Users/
Disallow /Users
Disallow /Media/
Disallow /Media
Disallow /Top_Menu/
Disallow /Top_Menu
Disallow /content/view/full/43/
Disallow /content/view/full/43
Disallow /content/view/full/5
Disallow /content/view/full/5/
Disallow /content/search
Disallow /content/advancedsearch

adsbot-google

Rule Path
Disallow /Users/
Disallow /Users
Disallow /Media/
Disallow /Media
Disallow /Top_Menu/
Disallow /Top_Menu
Disallow /content/view/full/43/
Disallow /content/view/full/43
Disallow /content/view/full/5
Disallow /content/view/full/5/
Disallow /content/search
Disallow /content/advancedsearch

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap http://turi.org/sitemap_new.xml