regent.ac.za
robots.txt

Robots Exclusion Standard data for regent.ac.za

Archived Snapshots

Resource Scan

Scan Details

Site Domain	regent.ac.za
Base Domain	regent.ac.za
Scan Status	Ok
Last Scan	2024-10-02T19:12:56+00:00
Next Scan	2024-11-01T19:12:56+00:00

Last Scan

Scanned	2024-10-02T19:12:56+00:00
URL	https://regent.ac.za/robots.txt
Domain IPs	141.193.213.20, 141.193.213.21
Response IP	141.193.213.20
Found	Yes
Hash	6319c4aab8ea8418c59fc3ed6955c6c925ae0fa8ab409971e7886ddc17678d43
SimHash	bb245c42643b

Groups

*

Rule	Path
Allow	/wp-admin/admin-ajax.php
Allow	/wp-content/uploads/
Disallow	/wp-content/plugins/
Disallow	/wp-admin/
Disallow	/readme.html
Disallow	/wp-includes/
Disallow	/author/
Disallow	/feed/
Disallow	/tag/
Disallow	/demo-pages/
Disallow	/?s=
Disallow	/?s%2F
Disallow	/search/
Disallow	/wp-content/cache/

Rule

Path

Allow

/wp-admin/admin-ajax.php

Allow

/wp-content/uploads/

Disallow

/wp-content/plugins/

Disallow

/wp-admin/

Disallow

/readme.html

Disallow

/wp-includes/

Disallow

/author/

Disallow

/feed/

Disallow

/tag/

Disallow

/demo-pages/

Disallow

/?s=

Disallow

/?s%2F

Disallow

/search/

Disallow

/wp-content/cache/

Back to top

Other Records

Field	Value
sitemap	https://regent.ac.za/sitemap_index.xml

Field

Value

sitemap

https://regent.ac.za/sitemap_index.xml

Back to top

Comments

This robots.txt file controls the crawling of URLs under https://regent.ac.za/

Back to top

Warnings

1 invalid line.

Back to top

regent.ac.zarobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

Warnings

regent.ac.za
robots.txt