turnitin.com
robots.txt

Robots Exclusion Standard data for turnitin.com

Resource Scan

Scan Details

Site Domain turnitin.com
Base Domain turnitin.com
Scan Status Ok
Last Scan2024-11-13T23:27:18+00:00
Next Scan 2024-11-27T23:27:18+00:00

Last Scan

Scanned2024-11-13T23:27:18+00:00
URL https://turnitin.com/robots.txt
Redirect https://www.turnitin.com/robots.txt
Redirect Domain www.turnitin.com
Redirect Base turnitin.com
Domain IPs 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133
Redirect IPs 151.101.130.133, 151.101.194.133, 151.101.2.133, 151.101.66.133
Response IP 199.232.46.133
Found Yes
Hash 7431b2d6e72d20f29db88053a4e4768139f73f7007bb8300c70d720c0ac5bbea
SimHash cd04641d9042

Groups

*

Rule Path
Disallow /newreport
Disallow /newuser
Disallow /password
Disallow /viewEmerald.asp
Disallow /viewProquest.asp
Disallow /viewInternet.asp
Disallow /viewInternetArchive1.asp
Disallow /grademark3
Disallow /s_home.asp
Disallow /t_home.asp
Disallow /t_inbox.asp
Disallow /paperInfo.asp
Disallow /s_class_portfolio.asp
Disallow /t_class_add_confirm.asp
Disallow /paperPermission.asp

Other Records

Field Value
sitemap https://www.turnitin.com/sitemap-index.xml