turnitin.com
robots.txt
Robots Exclusion Standard data for turnitin.com
Resource Scan
Scan Details
Site Domain | turnitin.com |
Base Domain | turnitin.com |
Scan Status | Ok |
Last Scan | 2024-11-13T23:27:18+00:00 |
Next Scan | 2024-11-27T23:27:18+00:00 |
Last Scan
Scanned | 2024-11-13T23:27:18+00:00 |
URL | https://turnitin.com/robots.txt |
Redirect | https://www.turnitin.com/robots.txt |
Redirect Domain | www.turnitin.com |
Redirect Base | turnitin.com |
Domain IPs | 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133 |
Redirect IPs | 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133 |
Response IP | 199.232.46.133 |
Found | Yes |
Hash | 7431b2d6e72d20f29db88053a4e4768139f73f7007bb8300c70d720c0ac5bbea |
SimHash | cd04641d9042 |
Groups
*
Rule | Path |
---|---|
Disallow | /newreport |
Disallow | /newuser |
Disallow | /password |
Disallow | /viewEmerald.asp |
Disallow | /viewProquest.asp |
Disallow | /viewInternet.asp |
Disallow | /viewInternetArchive1.asp |
Disallow | /grademark3 |
Disallow | /s_home.asp |
Disallow | /t_home.asp |
Disallow | /t_inbox.asp |
Disallow | /paperInfo.asp |
Disallow | /s_class_portfolio.asp |
Disallow | /t_class_add_confirm.asp |
Disallow | /paperPermission.asp |
Other Records
Field | Value |
---|---|
sitemap | https://www.turnitin.com/sitemap-index.xml |