thrivehomes.org.uk
robots.txt
Robots Exclusion Standard data for thrivehomes.org.uk
Resource Scan
Scan Details
Site Domain | thrivehomes.org.uk |
Base Domain | thrivehomes.org.uk |
Scan Status | Ok |
Last Scan | 2024-09-05T08:29:09+00:00 |
Next Scan | 2024-10-05T08:29:09+00:00 |
Last Scan
Scanned | 2024-09-05T08:29:09+00:00 |
URL | https://thrivehomes.org.uk/robots.txt |
Redirect | http://www.thrivehomes.org.uk/robots.txt |
Redirect Domain | www.thrivehomes.org.uk |
Redirect Base | thrivehomes.org.uk |
Domain IPs | 51.104.52.77 |
Redirect IPs | 51.104.52.77 |
Response IP | 51.104.52.77 |
Found | Yes |
Hash | 6d0a87ac53eac874e8d77732622d269305f07bbab19b9f6e1e1ca160d469d28c |
SimHash | 730ed826cf14 |
Groups
*
Rule | Path |
---|---|
Disallow | /aspnet_client/ |
Disallow | /bin/ |
Disallow | /config/ |
Disallow | /data/ |
Disallow | /install/ |
Disallow | /masterpages/ |
Disallow | /python/ |
Disallow | /umbraco/ |
Disallow | /umbraco_client/ |
Disallow | /usercontrols/ |
Disallow | /xslt/ |
Disallow | /app_plugins/ |
Disallow | /usync/ |
Other Records
Field | Value |
---|---|
sitemap | http://www.thrivehomes.org.uk/sitemap.xml |