pure.qub.ac.uk
robots.txt

Robots Exclusion Standard data for pure.qub.ac.uk

Archived Snapshots

Resource Scan

Scan Details

Site Domain	pure.qub.ac.uk
Base Domain	qub.ac.uk
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2025-03-25T13:52:09+00:00
Next Scan	2025-05-24T13:52:09+00:00

Last Successful Scan

Scanned	2025-01-02T13:40:49+00:00
URL	https://pure.qub.ac.uk/robots.txt
Domain IPs	104.18.39.240, 172.64.148.16
Response IP	172.64.148.16
Found	Yes
Hash	544839b2d1176cd013882d7613d24e9cbcd032337aade7fe1e47ef105a1f3a32
SimHash	e73d5870a133

Groups

*

Rule	Path
Disallow	/?format=rss
Disallow	/?export=xls

Rule

Path

Disallow

/*?*format=rss

Disallow

/*?*export=xls

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

chatgpt-user

Rule	Path
Disallow	/

Rule

Path

Disallow

/

google-extended

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://pure.qub.ac.uk/sitemap.xml

Field

Value

sitemap

https://pure.qub.ac.uk/sitemap.xml

Back to top

pure.qub.ac.ukrobots.txt

Resource Scan

Scan Details

Last Successful Scan

Groups

*

Other Records

gptbot

chatgpt-user

google-extended

Other Records

pure.qub.ac.uk
robots.txt