ukhsa.blog.gov.uk
robots.txt

Robots Exclusion Standard data for ukhsa.blog.gov.uk

Resource Scan

Scan Details

Site Domain ukhsa.blog.gov.uk
Base Domain blog.gov.uk
Scan Status Ok
Last Scan2025-03-03T09:47:02+00:00
Next Scan 2025-04-02T09:47:02+00:00

Last Scan

Scanned2025-03-03T09:47:02+00:00
URL https://ukhsa.blog.gov.uk/robots.txt
Domain IPs 13.226.2.102, 13.226.2.106, 13.226.2.122, 13.226.2.78, 2600:9000:21f8:1400:16:702d:4080:93a1, 2600:9000:21f8:3400:16:702d:4080:93a1, 2600:9000:21f8:3600:16:702d:4080:93a1, 2600:9000:21f8:9200:16:702d:4080:93a1, 2600:9000:21f8:9800:16:702d:4080:93a1, 2600:9000:21f8:9a00:16:702d:4080:93a1, 2600:9000:21f8:a800:16:702d:4080:93a1, 2600:9000:21f8:ec00:16:702d:4080:93a1
Response IP 13.226.2.122
Found Yes
Hash 371f81cd959eb68b3dbc11d281538fb674b804d1196b5f9a2c67518f5d154bc9
SimHash f920fa425db2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow */xmlrpc.php
Disallow */wp-*.php
Disallow */trackback/
Disallow *?wptheme=
Disallow *?comments=
Disallow *?replytocom
Disallow */comment-page-
Disallow *?s=

twitterbot

Rule Path
Disallow

Other Records

Field Value
sitemap https://ukhsa.blog.gov.uk/sitemap.xml

Comments

  • XML Sitemap & Google News version 5.4.9 - https://status301.net/wordpress-plugins/xml-sitemap-feed/

Warnings

  • 1 invalid line.