johnnewman.co.uk
robots.txt
Robots Exclusion Standard data for johnnewman.co.uk
Resource Scan
Scan Details
| Site Domain | johnnewman.co.uk |
| Base Domain | johnnewman.co.uk |
| Scan Status | Ok |
| Last Scan | 2025-11-23T21:54:36+00:00 |
| Next Scan | 2025-12-07T21:54:36+00:00 |
Last Scan
| Scanned | 2025-11-23T21:54:36+00:00 |
| URL | http://johnnewman.co.uk/robots.txt |
| Redirect | https://johnnewmanofficial.com/robots.txt |
| Redirect Domain | johnnewmanofficial.com |
| Redirect Base | johnnewmanofficial.com |
| Domain IPs | 162.255.119.136 |
| Redirect IPs | 162.0.217.140 |
| Response IP | 162.0.217.140 |
| Found | Yes |
| Hash | 9d63da9e7ecfdcf8b72acd20c57d73fbe0ba271ea20b341e62666c89720f87f9 |
| SimHash | 82cfcb7aadb3 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /wp-admin/ |
| Disallow | /wp-json/ |
| Allow | /wp-admin/admin-ajax.php |
| Disallow | |
| Disallow | /?s=* |
| Disallow | /search/* |
| Disallow | /cdn-cgi/bm/cv/ |
| Disallow | /cdn-cgi/challenge-platform/ |
| Disallow | /cdn-cgi/l/email-protection/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://johnnewmanofficial.com/sitemap-index.xml |
Comments