cris.maastrichtuniversity.nl
robots.txt

Robots Exclusion Standard data for cris.maastrichtuniversity.nl

Archived Snapshots

Resource Scan

Scan Details

Site Domain	cris.maastrichtuniversity.nl
Base Domain	maastrichtuniversity.nl
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-03-25T17:15:17+00:00
Next Scan	2025-05-24T17:15:17+00:00

Last Successful Scan

Scanned	2025-01-02T16:54:31+00:00
URL	https://cris.maastrichtuniversity.nl/robots.txt
Domain IPs	104.18.39.240, 172.64.148.16
Response IP	104.18.39.240
Found	Yes
Hash	b6dc6af86d68d71644ed05386e4652e0a513da5df89e88066695f70329a76ecd
SimHash	ed3c1870e133

Groups

*

Rule	Path
Disallow	/?format=rss
Disallow	/?export=xls

Rule

Path

Disallow

/*?*format=rss

Disallow

/*?*export=xls

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://cris.maastrichtuniversity.nl/sitemap.xml

Field

Value

sitemap

https://cris.maastrichtuniversity.nl/sitemap.xml

Back to top

cris.maastrichtuniversity.nlrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Other Records

gptbot

chatgpt-user

google-extended

Other Records

cris.maastrichtuniversity.nl
robots.txt