student.be
robots.txt

Robots Exclusion Standard data for student.be

Resource Scan

Scan Details

Site Domain student.be
Base Domain student.be
Scan Status Ok
Last Scan2024-05-26T21:28:24+00:00
Next Scan 2024-06-25T21:28:24+00:00

Last Scan

Scanned2024-05-26T21:28:24+00:00
URL https://student.be/robots.txt
Domain IPs 52.212.52.84, 54.247.69.169, 63.32.161.232
Response IP 52.212.52.84
Found Yes
Hash a8721b985607ac910e492368c0967c11b19a5a5401cc892495ecf3a0c8a96442
SimHash b151c14715f0

Groups

*

Rule Path
Disallow */users*
Disallow */messages*
Disallow */admin$
Disallow */admin/*
Disallow /*?
Disallow /ads.txt$
Allow /*.css$
Allow /*.js$
Disallow */api*
Disallow */index.cfm$
Disallow */.well-known/assetlinks.json$
Disallow */utilisateurs.json$
Disallow */internships.json$
Disallow */studentenjobs.json$
Disallow */jobs-etudiants.json$
Disallow */first-jobs.json$
Disallow */eerste-jobs.json$
Disallow */login*
Disallow *internships.html$
Disallow *kot_a_louer.html$
Disallow *stages.html$
Disallow */employer/new-ad*

Other Records

Field Value
sitemap https://www.student.be/sitemap.xml.gz

Comments

  • Rows added 2023-08-02
  • Rows added 2023-08-03
  • Rows added 2023-08-07
  • Rows added 2023-08-11
  • Sitemap