qut.edu.au
robots.txt

Robots Exclusion Standard data for qut.edu.au

Resource Scan

Scan Details

Site Domain qut.edu.au
Base Domain qut.edu.au
Scan Status Ok
Last Scan2024-11-10T01:49:12+00:00
Next Scan 2024-11-24T01:49:12+00:00

Last Scan

Scanned2024-11-10T01:49:12+00:00
URL https://qut.edu.au/robots.txt
Redirect https://www.qut.edu.au/robots.txt
Redirect Domain www.qut.edu.au
Redirect Base qut.edu.au
Domain IPs 131.181.196.203
Redirect IPs 43.245.41.37
Response IP 43.245.41.37
Found Yes
Hash 1faaa0c853471312f18e8e4453f988bd3cad9dbc362a32838b9595e25f00b401
SimHash f98afb32cf52

Groups

*

Rule Path
Disallow /bluebox/
Disallow /staff/
Disallow /student/
Disallow /corpsite/
Disallow /orientation
Disallow /study/structures
Disallow /study/structures/*
Disallow /admin*
Disallow /study/unit/outline
Disallow /study/unit/outline/*
Disallow /_designs/nested-content-that-needs-a-url
Disallow /live-streaming
Disallow /live-streaming/*
Allow /live-streaming/graduation
Disallow /live-streaming/graduation*

qvnutch

Rule Path
Allow /staff/web-manual
Allow /staff/web-manual/
Disallow /bluebox/
Disallow /staff/
Disallow /student/
Disallow /corpsite/
Disallow /orientation
Disallow /_designs

sogou web spider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.qut.edu.au/sitemaps/index.xml
sitemap https://www.qut.edu.au/sitemaps/index.xml
sitemap https://www.qut.edu.au/sitemaps/index.xml