detectingshropshire.com
robots.txt

Robots Exclusion Standard data for detectingshropshire.com

Resource Scan

Scan Details

Site Domain detectingshropshire.com
Base Domain detectingshropshire.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-07-01T02:20:20+00:00
Next Scan 2025-09-29T02:20:20+00:00

Last Successful Scan

Scanned2024-02-13T23:11:08+00:00
URL https://detectingshropshire.com/robots.txt
Domain IPs 104.21.52.96, 172.67.197.217, 2606:4700:3030::ac43:c5d9, 2606:4700:3035::6815:3460
Response IP 104.21.52.96
Found Yes
Hash 2f418ccee56d527010742bbd8ba6705fe6e6c9013aa63c8e0af12b6b1093b271
SimHash 1830dd52ab32

Groups

*

Rule Path
Allow /wp-admin/admin-ajax.php
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /readme.html
Disallow /license.txt
Disallow /author
Disallow /archive
Disallow *?attachment_id=
Disallow /wp-json/
Disallow /?rest_route=
Disallow /login/
Disallow /my-account/

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

xenu

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://detectingshropshire.com/sitemap.xml

Comments

  • Block archive.org bots