huffpost.co.uk
robots.txt

Robots Exclusion Standard data for huffpost.co.uk

Resource Scan

Scan Details

Site Domain huffpost.co.uk
Base Domain huffpost.co.uk
Scan Status Ok
Last Scan2024-11-02T19:26:46+00:00
Next Scan 2024-11-09T19:26:46+00:00

Last Scan

Scanned2024-11-02T19:26:46+00:00
URL https://huffpost.co.uk/robots.txt
Redirect https://www.huffingtonpost.co.uk/robots.txt
Redirect Domain www.huffingtonpost.co.uk
Redirect Base huffingtonpost.co.uk
Domain IPs 13.33.88.120, 13.33.88.47, 13.33.88.92, 13.33.88.94
Redirect IPs 151.101.130.114, 151.101.194.114, 151.101.2.114, 151.101.66.114
Response IP 199.232.46.114
Found Yes
Hash c72a0dbfa261db01cec953e66ff30f401f81ea41d7c7737bd54575fb0687a433
SimHash 45101855c562

Groups

*

Rule Path
Disallow /_uac/adpage.html
Disallow /_huff_uac/adpage.html
Disallow /postid
Disallow /preview

Other Records

Field Value
sitemap https://www.huffingtonpost.co.uk/sitemaps/sitemap-google-video.xml
sitemap https://www.huffingtonpost.co.uk/sitemaps/sitemap-google-news.xml
sitemap https://www.huffingtonpost.co.uk/sitemaps/sitemap-index-daily-archives-v1.xml
sitemap https://www.huffingtonpost.co.uk/sitemaps/sitemap-index-monthly-archives.xml
sitemap https://www.huffingtonpost.co.uk/sitemaps/sections.xml

Comments

  • Cambria robots
  • sitemaps