narberth-and-whitland-today.co.uk
robots.txt

Robots Exclusion Standard data for narberth-and-whitland-today.co.uk

Resource Scan

Scan Details

Site Domain narberth-and-whitland-today.co.uk
Base Domain narberth-and-whitland-today.co.uk
Scan Status Ok
Last Scan2024-09-20T04:13:08+00:00
Next Scan 2024-09-27T04:13:08+00:00

Last Scan

Scanned2024-09-20T04:13:08+00:00
URL https://narberth-and-whitland-today.co.uk/robots.txt
Redirect https://www.narberth-and-whitland-today.co.uk/robots.txt
Redirect Domain www.narberth-and-whitland-today.co.uk
Redirect Base narberth-and-whitland-today.co.uk
Domain IPs 104.21.32.91, 172.67.185.80, 2606:4700:3035::6815:205b, 2606:4700:3037::ac43:b950
Redirect IPs 104.18.35.129, 172.64.152.127, 2606:4700:4400::6812:2381, 2606:4700:4400::ac40:987f
Response IP 172.64.152.127
Found Yes
Hash 11a42e78d85631e4939b43e6042f7a4247d68044a6ca0bbcd555fcdb87382955
SimHash a8085e22c402

Groups

*

Rule Path
Disallow /api/
Disallow /internal-api/
Disallow /*ILC-refresh

nutch

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.narberth-and-whitland-today.co.uk/sitemaps/googlenews
sitemap https://www.narberth-and-whitland-today.co.uk/sitemaps/sitemap.xml