applyindex.org
robots.txt
Robots Exclusion Standard data for applyindex.org
Resource Scan
Scan Details
Site Domain | applyindex.org |
Base Domain | applyindex.org |
Scan Status | Ok |
Last Scan | 2025-07-16T17:20:55+00:00 |
Next Scan | 2025-08-15T17:20:55+00:00 |
Last Scan
Scanned | 2025-07-16T17:20:55+00:00 |
URL | https://applyindex.org/robots.txt |
Redirect | https://applyindex.com/robots.txt |
Redirect Domain | applyindex.com |
Redirect Base | applyindex.com |
Domain IPs | 94.72.110.233 |
Redirect IPs | 104.21.29.119, 172.67.148.249, 2606:4700:3033::ac43:94f9, 2606:4700:3036::6815:1d77 |
Response IP | 104.21.29.119 |
Found | Yes |
Hash | 412cf8e82f72f5d2e8e40a85e50abb15b147a69d4b8301d6bc48ff35a250eb28 |
SimHash | 9815e4c24b33 |
Groups
*
Rule | Path |
---|---|
Disallow | /dashboard/ |
Disallow | /research-supervisors/ |
Disallow | /supervisor/ |
Disallow | /education/ |
Disallow | /service/ |
Disallow | /languages/ |
Disallow | /location/ |
Disallow | /skill/ |
Disallow | /university/page/ |
Disallow | /postdoc-position/page/ |
Disallow | /false/page/ |
Disallow | /position-duration/ |
Disallow | /position-research-area-field-of-study/ |