iiw.kuleuven.be
robots.txt

Robots Exclusion Standard data for iiw.kuleuven.be

Archived Snapshots

Resource Scan

Scan Details

Site Domain	iiw.kuleuven.be
Base Domain	kuleuven.be
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Couldn't connect to server.
Last Scan	2025-09-04T04:06:57+00:00
Next Scan	2025-12-03T04:06:57+00:00

Last Successful Scan

Scanned	2025-01-15T03:57:22+00:00
URL	https://iiw.kuleuven.be/robots.txt
Domain IPs	134.58.64.62, 2a02:2c40:0:80::80:62
Response IP	134.58.64.62
Found	Yes
Hash	5557b5544a479c667084ea94e54526e06ae6d0251ed70d6f061644881a83bb2d
SimHash	ad51ab554d65

Groups

*

Rule	Path
Disallow

Rule

Path

Disallow

googlebot

Rule	Path
Disallow	/*?
Disallow	/*atct_album_view$
Disallow	/*folder_factories$
Disallow	/*folder_summary_view$
Disallow	/*login_form$
Disallow	/*mail_password_form$
Disallow	/%40%40search
Disallow	/*search_rss$
Disallow	/*sendto_form$
Disallow	/*summary_view$
Disallow	/*thumbnail_view$
Disallow	/*view$

Rule

Path

Disallow

/*?

Disallow

/*atct_album_view$

Disallow

/*folder_factories$

Disallow

/*folder_summary_view$

Disallow

/*login_form$

Disallow

/*mail_password_form$

Disallow

/%40%40search

Disallow

/*search_rss$

Disallow

/*sendto_form$

Disallow

/*summary_view$

Disallow

/*thumbnail_view$

Disallow

/*view$

Back to top

Other Records

Field	Value
sitemap	https://iiw.kuleuven.be/sitemap.xml.gz

Field

Value

sitemap

https://iiw.kuleuven.be/sitemap.xml.gz

Back to top

Comments

Define access-restrictions for robots/spiders
http://www.robotstxt.org/wc/norobots.html
By default we allow robots to access all areas of our site
already accessible to anonymous users
Add Googlebot-specific syntax extension to exclude forms
that are repeated for each piece of content in the site
the wildcard is only supported by Googlebot
http://www.google.com/support/webmasters/bin/answer.py?answer=40367&ctx=sibling

Back to top

iiw.kuleuven.berobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

googlebot

Other Records

Comments

iiw.kuleuven.be
robots.txt