sciencemediacentre.org
robots.txt

Robots Exclusion Standard data for sciencemediacentre.org

Resource Scan

Scan Details

Site Domain sciencemediacentre.org
Base Domain sciencemediacentre.org
Scan Status Ok
Last Scan2024-05-23T04:53:48+00:00
Next Scan 2024-06-22T04:53:48+00:00

Last Scan

Scanned2024-05-23T04:53:48+00:00
URL https://sciencemediacentre.org/robots.txt
Domain IPs 2a00:1098:86:9a::1, 93.93.131.151
Response IP 93.93.131.151
Found Yes
Hash 27bd4a9de0f970fbe1b19cd7627b1b0e3278775fd099b66d1c6cd1a6eb3f3a30
SimHash 9b9e620a6233

Groups

*

Rule Path
Disallow /robots.txt
Disallow /roombook.htm
Disallow /images/
Disallow /downloads/
Disallow /style.css
Disallow /.htaccess
Disallow /404_filenotfound.htm
Disallow /403_noaccess.htm
Disallow /_web_archive/

acke.dc.luth.se

Rule Path
Disallow /

dallas.mt.cs.cmu.edu

Rule Path
Disallow /

darkwing.cadvision.com

Rule Path
Disallow /

waldec.com

Rule Path
Disallow /

www2000.ogsm.vanderbilt.edu

Rule Path
Disallow /

unet.ca

Rule Path
Disallow /

murph.cais.net

Rule Path
Disallow /

spyder3.microsys.com

Rule Path
Disallow /

www.freeloader.com.

Rule Path
Disallow /

Warnings

  • 6 invalid lines.