huffingtonpost.co.uk
robots.txt

Robots Exclusion Standard data for huffingtonpost.co.uk

Resource Scan

Scan Details

Site Domain huffingtonpost.co.uk
Base Domain huffingtonpost.co.uk
Scan Status Ok
Last Scan2024-04-25T14:28:04+00:00
Next Scan 2024-05-02T14:28:04+00:00

Last Scan

Scanned2024-04-25T14:28:04+00:00
URL https://huffingtonpost.co.uk/robots.txt
Redirect https://www.huffingtonpost.co.uk/robots.txt
Redirect Domain www.huffingtonpost.co.uk
Redirect Base huffingtonpost.co.uk
Domain IPs 108.156.133.120, 108.156.133.51, 108.156.133.68, 108.156.133.94
Redirect IPs 151.101.130.114, 151.101.194.114, 151.101.2.114, 151.101.66.114
Response IP 199.232.46.114
Found Yes
Hash c72a0dbfa261db01cec953e66ff30f401f81ea41d7c7737bd54575fb0687a433
SimHash 45101855c562

Groups

*

Rule Path
Disallow /_uac/adpage.html
Disallow /_huff_uac/adpage.html
Disallow /postid
Disallow /preview

Other Records

Field Value
sitemap https://www.huffingtonpost.co.uk/sitemaps/sitemap-google-video.xml
sitemap https://www.huffingtonpost.co.uk/sitemaps/sitemap-google-news.xml
sitemap https://www.huffingtonpost.co.uk/sitemaps/sitemap-index-daily-archives-v1.xml
sitemap https://www.huffingtonpost.co.uk/sitemaps/sitemap-index-monthly-archives.xml
sitemap https://www.huffingtonpost.co.uk/sitemaps/sections.xml

Comments

  • Cambria robots
  • sitemaps