student.be
robots.txt

Robots Exclusion Standard data for student.be

Resource Scan

Scan Details

Site Domain student.be
Base Domain student.be
Scan Status Ok
Last Scan2024-09-23T21:29:59+00:00
Next Scan 2024-10-23T21:29:59+00:00

Last Scan

Scanned2024-09-23T21:29:59+00:00
URL https://student.be/robots.txt
Domain IPs 52.212.52.84, 54.247.69.169, 63.32.161.232
Response IP 54.247.69.169
Found Yes
Hash 70d25240509a0165e876aec801d87f92e6b92d092801eb135dc3fe8842237a5a
SimHash a150cb54bd73

Groups

*

Rule Path
Disallow */users*
Disallow */messages*
Disallow */admin$
Disallow */admin/*
Disallow /*?
Disallow /ads.txt$
Allow /*.css$
Allow /*.js$
Disallow */api*
Disallow */index.cfm$
Disallow */.well-known/assetlinks.json$
Disallow */utilisateurs.json$
Disallow */internships.json*
Disallow */studentenjobs.json*
Disallow */jobs-etudiants.json*
Disallow */first-jobs.json*
Disallow */eerste-jobs.json*
Disallow */login*
Disallow *internships.html*
Disallow *kot_a_louer.html*
Disallow *stages.html*
Disallow */employer/new-ad*
Disallow */job_etudiants.html*
Disallow */job-stage-memoire.html*

Other Records

Field Value
sitemap https://www.student.be/sitemap.xml.gz

Comments

  • Rows added 2023-08-02
  • Rows added 2023-08-03
  • Rows added 2023-08-07
  • Rows added 2023-08-11
  • Rows added 2024-06-20
  • Sitemap