apirnet.ilo.org
robots.txt

Robots Exclusion Standard data for apirnet.ilo.org

Resource Scan

Scan Details

Site Domain apirnet.ilo.org
Base Domain ilo.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-04-04T14:10:18+00:00
Next Scan 2025-07-03T14:10:18+00:00

Last Successful Scan

Scanned2022-05-20T23:27:41+00:00
URL https://apirnet.ilo.org/robots.txt
Response IP 202.80.252.152
Found Yes
Hash 9f939caf5a06482c2ed47b3c60886b4e9a37635ffe1c6616c41d4deed8d63606
SimHash ac718b554d65

Groups

*

Rule Path
Disallow

googlebot

Rule Path
Disallow /*sendto_form$
Disallow /*folder_factories$

Comments

  • Define access-restrictions for robots/spiders
  • http://www.robotstxt.org/wc/norobots.html
  • By default we allow robots to access all areas of our site
  • already accessible to anonymous users
  • Add Googlebot-specific syntax extension to exclude forms
  • that are repeated for each piece of content in the site
  • the wildcard is only supported by Googlebot
  • http://www.google.com/support/webmasters/bin/answer.py?answer=40367&ctx=sibling