student.be
robots.txt
Robots Exclusion Standard data for student.be
Resource Scan
Scan Details
Site Domain | student.be |
Base Domain | student.be |
Scan Status | Ok |
Last Scan | 2024-05-26T21:28:24+00:00 |
Next Scan | 2024-06-25T21:28:24+00:00 |
Last Scan
Scanned | 2024-05-26T21:28:24+00:00 |
URL | https://student.be/robots.txt |
Domain IPs | 52.212.52.84, 54.247.69.169, 63.32.161.232 |
Response IP | 52.212.52.84 |
Found | Yes |
Hash | a8721b985607ac910e492368c0967c11b19a5a5401cc892495ecf3a0c8a96442 |
SimHash | b151c14715f0 |
Groups
*
Rule | Path |
---|---|
Disallow | */users* |
Disallow | */messages* |
Disallow | */admin$ |
Disallow | */admin/* |
Disallow | /*? |
Disallow | /ads.txt$ |
Allow | /*.css$ |
Allow | /*.js$ |
Disallow | */api* |
Disallow | */index.cfm$ |
Disallow | */.well-known/assetlinks.json$ |
Disallow | */utilisateurs.json$ |
Disallow | */internships.json$ |
Disallow | */studentenjobs.json$ |
Disallow | */jobs-etudiants.json$ |
Disallow | */first-jobs.json$ |
Disallow | */eerste-jobs.json$ |
Disallow | */login* |
Disallow | *internships.html$ |
Disallow | *kot_a_louer.html$ |
Disallow | *stages.html$ |
Disallow | */employer/new-ad* |
Other Records
Field | Value |
---|---|
sitemap | https://www.student.be/sitemap.xml.gz |
Comments