student.be
robots.txt
Robots Exclusion Standard data for student.be
Resource Scan
Scan Details
Site Domain | student.be |
Base Domain | student.be |
Scan Status | Ok |
Last Scan | 2024-09-23T21:29:59+00:00 |
Next Scan | 2024-10-23T21:29:59+00:00 |
Last Scan
Scanned | 2024-09-23T21:29:59+00:00 |
URL | https://student.be/robots.txt |
Domain IPs | 52.212.52.84, 54.247.69.169, 63.32.161.232 |
Response IP | 54.247.69.169 |
Found | Yes |
Hash | 70d25240509a0165e876aec801d87f92e6b92d092801eb135dc3fe8842237a5a |
SimHash | a150cb54bd73 |
Groups
*
Rule | Path |
---|---|
Disallow | */users* |
Disallow | */messages* |
Disallow | */admin$ |
Disallow | */admin/* |
Disallow | /*? |
Disallow | /ads.txt$ |
Allow | /*.css$ |
Allow | /*.js$ |
Disallow | */api* |
Disallow | */index.cfm$ |
Disallow | */.well-known/assetlinks.json$ |
Disallow | */utilisateurs.json$ |
Disallow | */internships.json* |
Disallow | */studentenjobs.json* |
Disallow | */jobs-etudiants.json* |
Disallow | */first-jobs.json* |
Disallow | */eerste-jobs.json* |
Disallow | */login* |
Disallow | *internships.html* |
Disallow | *kot_a_louer.html* |
Disallow | *stages.html* |
Disallow | */employer/new-ad* |
Disallow | */job_etudiants.html* |
Disallow | */job-stage-memoire.html* |
Other Records
Field | Value |
---|---|
sitemap | https://www.student.be/sitemap.xml.gz |
Comments