theextract.co.uk
robots.txt

Robots Exclusion Standard data for theextract.co.uk

Resource Scan

Scan Details

Site Domain theextract.co.uk
Base Domain theextract.co.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2025-12-01T22:48:45+00:00
Next Scan 2026-03-01T22:48:45+00:00

Last Successful Scan

Scanned2025-04-13T22:32:58+00:00
URL https://theextract.co.uk/robots.txt
Redirect https://www.theextract.co.uk/robots.txt
Redirect Domain www.theextract.co.uk
Redirect Base theextract.co.uk
Domain IPs 104.21.2.63, 172.67.128.214, 2606:4700:3032::6815:23f, 2606:4700:3036::ac43:80d6
Redirect IPs 104.21.2.63, 172.67.128.214, 2606:4700:3032::6815:23f, 2606:4700:3036::ac43:80d6
Response IP 172.67.128.214
Found Yes
Hash ee57e9f6aa2dd2a544a3cb729232339b3c5ea2dc037d9a6937efe3d02540b523
SimHash d8055d44ab12

Groups

*

Rule Path
Disallow /wp-admin/*
Disallow /wp-login.php
Disallow /wp-json/

Other Records

Field Value
sitemap https://www.theextract.co.uk/sitemap_index.xml