ugent.be
robots.txt

Robots Exclusion Standard data for ugent.be

Archived Snapshots

Resource Scan

Scan Details

Site Domain	ugent.be
Base Domain	ugent.be
Scan Status	Ok
Last Scan	2024-10-24T13:04:38+00:00
Next Scan	2024-11-23T13:04:38+00:00

Last Scan

Scanned	2024-10-24T13:04:38+00:00
URL	https://ugent.be/robots.txt
Redirect	https://www.ugent.be/robots.txt
Redirect Domain	www.ugent.be
Redirect Base	ugent.be
Domain IPs	157.193.43.50
Redirect IPs	157.193.43.50
Response IP	157.193.43.50
Found	Yes
Hash	dac6a7fa5ccd9fb3bc612249af375bf7783533b900ca9441cce02e3a15f3accd
SimHash	ad71ab554d61

Groups

*

Rule	Path
Disallow	/intranet

Rule

Path

Disallow

/intranet

googlebot

Rule	Path
Disallow	/*?
Disallow	/*atct_album_view$
Disallow	/*folder_factories$
Disallow	/*folder_summary_view$
Disallow	/*login_form$
Disallow	/*mail_password_form$
Disallow	/%40%40search
Disallow	/*search_rss$
Disallow	/*sendto_form$
Disallow	/*summary_view$
Disallow	/*thumbnail_view$
Disallow	/*view$

Rule

Path

Disallow

/*?

Disallow

/*atct_album_view$

Disallow

/*folder_factories$

Disallow

/*folder_summary_view$

Disallow

/*login_form$

Disallow

/*mail_password_form$

Disallow

/%40%40search

Disallow

/*search_rss$

Disallow

/*sendto_form$

Disallow

/*summary_view$

Disallow

/*thumbnail_view$

Disallow

/*view$

Back to top

Other Records

Field	Value
sitemap	https://www.ugent.be/sitemap.xml.gz

Field

Value

sitemap

https://www.ugent.be/sitemap.xml.gz

Back to top

Comments

Define access-restrictions for robots/spiders
http://www.robotstxt.org/wc/norobots.html
By default we allow robots to access all areas of our site
already accessible to anonymous users
Add Googlebot-specific syntax extension to exclude forms
that are repeated for each piece of content in the site
the wildcard is only supported by Googlebot
http://www.google.com/support/webmasters/bin/answer.py?answer=40367&ctx=sibling

Back to top

ugent.berobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

googlebot

Other Records

Comments

ugent.be
robots.txt