turi.org
robots.txt
Robots Exclusion Standard data for turi.org
Resource Scan
Scan Details
Site Domain | turi.org |
Base Domain | turi.org |
Scan Status | Ok |
Last Scan | 2024-06-02T11:44:28+00:00 |
Next Scan | 2024-07-02T11:44:28+00:00 |
Last Scan
Scanned | 2024-06-02T11:44:28+00:00 |
URL | https://turi.org/robots.txt |
Redirect | https://www.turi.org/robots.txt |
Redirect Domain | www.turi.org |
Redirect Base | turi.org |
Domain IPs | 104.21.88.34, 172.67.150.79, 2606:4700:3035::ac43:964f, 2606:4700:3037::6815:5822 |
Redirect IPs | 104.21.88.34, 172.67.150.79, 2606:4700:3035::ac43:964f, 2606:4700:3037::6815:5822 |
Response IP | 172.67.150.79 |
Found | Yes |
Hash | 22e7555c09af294cd54c6b4ec9331fed8d26357173ecef6ef2781f244fb2f037 |
SimHash | 67791820cab0 |
Groups
*
Rule | Path |
---|---|
Disallow | /Users/ |
Disallow | /Users |
Disallow | /Media/ |
Disallow | /Media |
Disallow | /Top_Menu/ |
Disallow | /Top_Menu |
Disallow | /content/view/full/43/ |
Disallow | /content/view/full/43 |
Disallow | /content/view/full/5 |
Disallow | /content/view/full/5/ |
Disallow | /content/search |
Disallow | /content/advancedsearch |
adsbot-google
Rule | Path |
---|---|
Disallow | /Users/ |
Disallow | /Users |
Disallow | /Media/ |
Disallow | /Media |
Disallow | /Top_Menu/ |
Disallow | /Top_Menu |
Disallow | /content/view/full/43/ |
Disallow | /content/view/full/43 |
Disallow | /content/view/full/5 |
Disallow | /content/view/full/5/ |
Disallow | /content/search |
Disallow | /content/advancedsearch |
Other Records
Field | Value |
---|---|
sitemap | http://turi.org/sitemap_new.xml |