www.fon.hum.uva.nl
robots.txt

Robots Exclusion Standard data for www.fon.hum.uva.nl

Resource Scan

Scan Details

Site Domain www.fon.hum.uva.nl
Base Domain uva.nl
Scan Status Ok
Last Scan2025-07-01T16:27:27+00:00
Next Scan 2025-07-31T16:27:27+00:00

Last Scan

Scanned2025-07-01T16:27:27+00:00
URL https://www.fon.hum.uva.nl/robots.txt
Domain IPs 136.144.176.44
Response IP 136.144.176.44
Found Yes
Hash 3860870bd2c539274ec08ceef18e06bfd920c2d656766c6b945512b89d6e607a
SimHash 866d98668ab6

Groups

*

Rule Path Comment
Disallow /IFAcorpus/SLspeech/ This is an 20 GB store of analysis data
Disallow /IFAcorpus/SLcorpus/Labels/ Raw data
Disallow /IFAcorpus/SLcorpus/DatabaseFiles/ Raw data
Disallow /IFAcorpus/SLcorpus/home/ Raw data
Disallow /corpus/Browse.html A different link to /IFAcorpus/SLcorpus/
Disallow /corpus/DBstatistics Database Queries
Disallow /corpus/sentences Database Queries
Disallow /corpus/Fragment Database Queries
Disallow /corpus/GetFragment Database Queries
Disallow /corpus/getrecord Database Queries
Disallow /IFA-SpokenLanguageCorpora/IFADVcorpus/Intros/ This a store of video files
Disallow /IFA-SpokenLanguageCorpora/IFADVcorpus/Experiment/ This a store of private experiments

Comments

  • robots.txt for http://www.fon.hum.uva.nl/ and http://145.18.230.100/
  • Disallow: /IFA-SpokenLanguageCorpora/IFADVcorpus/Speech/ # This a store of sound files
  • Disallow: /IFA-SpokenLanguageCorpora/IFADVcorpus/Compressed/ # This a store of video files
  • Disallow: /IFA-SpokenLanguageCorpora/IFADVcorpus/Cropped/ # This a store of video files
  • Disallow: /IFA-SpokenLanguageCorpora/IFADVcorpus/DialogCorpus/ # This a store of video files