/.well-known/

Log In Sign Up

apirnet.ilo.org
robots.txt

Robots Exclusion Standard data for apirnet.ilo.org

Archived Snapshots

Resource Scan

Scan Details

Site Domain	apirnet.ilo.org
Base Domain	ilo.org
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-04-04T14:10:18+00:00
Next Scan	2025-07-03T14:10:18+00:00

Last Successful Scan

Scanned	2022-05-20T23:27:41+00:00
URL	https://apirnet.ilo.org/robots.txt
Response IP	202.80.252.152
Found	Yes
Hash	9f939caf5a06482c2ed47b3c60886b4e9a37635ffe1c6616c41d4deed8d63606
SimHash	ac718b554d65

Groups

*

Rule

Path

Disallow

googlebot

Rule

Path

Disallow

/*sendto_form$

Disallow

/*folder_factories$

Back to top

Comments

Define access-restrictions for robots/spiders
http://www.robotstxt.org/wc/norobots.html
By default we allow robots to access all areas of our site
already accessible to anonymous users
Add Googlebot-specific syntax extension to exclude forms
that are repeated for each piece of content in the site
the wildcard is only supported by Googlebot
http://www.google.com/support/webmasters/bin/answer.py?answer=40367&ctx=sibling

Back to top