regent.ac.za
robots.txt

Robots Exclusion Standard data for regent.ac.za

Resource Scan

Scan Details

Site Domain regent.ac.za
Base Domain regent.ac.za
Scan Status Ok
Last Scan2024-10-02T19:12:56+00:00
Next Scan 2024-11-01T19:12:56+00:00

Last Scan

Scanned2024-10-02T19:12:56+00:00
URL https://regent.ac.za/robots.txt
Domain IPs 141.193.213.20, 141.193.213.21
Response IP 141.193.213.20
Found Yes
Hash 6319c4aab8ea8418c59fc3ed6955c6c925ae0fa8ab409971e7886ddc17678d43
SimHash bb245c42643b

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Allow /wp-content/uploads/
Disallow /wp-content/plugins/
Disallow /wp-admin/
Disallow /readme.html
Disallow /wp-includes/
Disallow /author/
Disallow /feed/
Disallow /tag/
Disallow /demo-pages/
Disallow /?s=
Disallow /?s%2F
Disallow /search/
Disallow /wp-content/cache/

Other Records

Field Value
sitemap https://regent.ac.za/sitemap_index.xml

Comments

  • This robots.txt file controls the crawling of URLs under https://regent.ac.za/

Warnings

  • 1 invalid line.