tropmedres.ac
robots.txt

Robots Exclusion Standard data for tropmedres.ac

Resource Scan

Scan Details

Site Domain tropmedres.ac
Base Domain tropmedres.ac
Scan Status Ok
Last Scan2026-01-21T17:06:40+00:00
Next Scan 2026-02-20T17:06:40+00:00

Last Scan

Scanned2026-01-21T17:06:40+00:00
URL https://tropmedres.ac/robots.txt
Redirect https://www.tropmedres.ac/robots.txt
Redirect Domain www.tropmedres.ac
Redirect Base tropmedres.ac
Domain IPs 52.56.123.235
Redirect IPs 52.56.123.235
Response IP 52.56.123.235
Found Yes
Hash 169d0bd0808f93058cb4c382a8d1d7ea96e4a0ff39c6cd1a6bc1970dba6efdcc
SimHash ac510b554d65

Groups

*

Rule Path
Disallow /images
Disallow /*/Plone
Disallow */%40%40modal

googlebot

Rule Path
Disallow /*sendto_form$
Disallow /*folder_factories$

Other Records

Field Value
sitemap https://www.tropmedres.ac/sitemap.xml.gz

Comments

  • Define access-restrictions for robots/spiders
  • http://www.robotstxt.org/wc/norobots.html
  • By default we allow robots to access all areas of our site
  • already accessible to anonymous users
  • Add Googlebot-specific syntax extension to exclude forms
  • that are repeated for each piece of content in the site
  • the wildcard is only supported by Googlebot
  • http://www.google.com/support/webmasters/bin/answer.py?answer=40367&ctx=sibling