masseypress.ac.nz
robots.txt

Robots Exclusion Standard data for masseypress.ac.nz

Archived Snapshots

Resource Scan

Scan Details

Site Domain	masseypress.ac.nz
Base Domain	masseypress.ac.nz
Scan Status	Ok
Last Scan	2024-09-24T14:28:00+00:00
Next Scan	2024-10-24T14:28:00+00:00

Last Scan

Scanned	2024-09-24T14:28:00+00:00
URL	https://masseypress.ac.nz/robots.txt
Redirect	https://www.masseypress.ac.nz/robots.txt
Redirect Domain	www.masseypress.ac.nz
Redirect Base	masseypress.ac.nz
Domain IPs	54.206.41.86
Redirect IPs	54.206.41.86
Response IP	54.206.41.86
Found	Yes
Hash	a106ccbd067928c7cba7f265a7aadd8fe8334f266908ca04f4707a5a182f4b51
SimHash	005c9842ec01

Groups

adbeat_bot
ahrefsbot
aitcsrobot
alexibot
blexbot
cliqzbot
dotbot
exabot
expo9
huaweisymantecspider
influencebot
ltx71 - (http://ltx71.com/)
maxpointcrawler
mj12bot
offline explorer
rogerbot
semrushbot
semrushbot-sa
sitesnagger
surveybot
teleportpro
webcopier
webreaper
webstripper
webzip
xaldon_webspider
xenuâs
xenuâs link sleuth 1.1c

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/App_Plugins/
Disallow	/App_Code/
Disallow	/App_Data/
Disallow	/bin/
Disallow	/config/
Disallow	/umbraco/
Disallow	/Views/
Disallow	/uSync/

Rule

Path

Disallow

/App_Plugins/

Disallow

/App_Code/

Disallow

/App_Data/

Disallow

/bin/

Disallow

/config/

Disallow

/umbraco/

Disallow

/Views/

Disallow

/uSync/

Back to top

Other Records

Field	Value
sitemap	https://masseypress.ac.nz/sitemap/

Field

Value

sitemap

https://masseypress.ac.nz/sitemap/

Back to top

Comments

Exclude some crawlers

Back to top

masseypress.ac.nzrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

masseypress.ac.nz
robots.txt