newnorthacademy.com
robots.txt

Robots Exclusion Standard data for newnorthacademy.com

Resource Scan

Scan Details

Site Domain newnorthacademy.com
Base Domain newnorthacademy.com
Scan Status Ok
Last Scan2025-10-23T07:24:14+00:00
Next Scan 2025-11-06T07:24:14+00:00

Last Scan

Scanned2025-10-23T07:24:14+00:00
URL https://newnorthacademy.com/robots.txt
Redirect https://www.newnorthacademy.com/robots.txt
Redirect Domain www.newnorthacademy.com
Redirect Base newnorthacademy.com
Domain IPs 109.228.48.205
Redirect IPs 109.228.48.205
Response IP 109.228.48.205
Found Yes
Hash 9ab35f65826590041958ac84e312fa03f2aff47d6e4fea5efc28338ee02ff939
SimHash 79155c13cf91

Groups

*

Rule Path
Disallow /admin/
Disallow /bin/
Disallow /Connections/
Allow /i/
Disallow /inc/
Disallow /docs/
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.xls$
Disallow /*.docx$
Allow /inc/gallery/
Allow /i/photos/Gallery/

Other Records

Field Value
sitemap https://newnorthacademy.greenhousecms.co.uk/sitemap.xml