student.com
robots.txt
Robots Exclusion Standard data for student.com
Resource Scan
Scan Details
Site Domain | student.com |
Base Domain | student.com |
Scan Status | Ok |
Last Scan | 2024-06-05T09:36:54+00:00 |
Next Scan | 2024-06-19T09:36:54+00:00 |
Last Scan
Scanned | 2024-06-05T09:36:54+00:00 |
URL | https://student.com/robots.txt |
Redirect | https://www.student.com/robots.txt |
Redirect Domain | www.student.com |
Redirect Base | student.com |
Domain IPs | 13.251.222.126, 18.140.79.229 |
Redirect IPs | 151.101.130.49, 151.101.194.49, 151.101.2.49, 151.101.66.49 |
Response IP | 199.232.46.49 |
Found | Yes |
Hash | 57bed4d9a39400c02f11e282a33c7c69ed0232e8d0417ec6899f94b09969dd75 |
SimHash | 753151040e50 |
Groups
*
Rule | Path |
---|---|
Allow | /css |
Allow | /css/*?* |
Allow | /js |
Allow | /js/*?* |
Allow | /api/search/universities |
Allow | /api/universities |
Allow | /wp-content/* |
Allow | /wp-includes/* |
Allow | *?utm* |
Disallow | /set/ |
Disallow | /reviews |
Disallow | /*/enquiry/* |
Disallow | /enquiry/* |
Disallow | /*/compare/* |
Disallow | /compare/* |
Disallow | /*/my-account/* |
Disallow | /my-account/* |
Disallow | /wechat-qr-code |
Disallow | /articles/search/* |
Disallow | */null |
Disallow | *?* |
Other Records
Field | Value |
---|---|
sitemap | https://www.student.com/sitemap.xml |