news.cancerresearchuk.org
robots.txt

Robots Exclusion Standard data for news.cancerresearchuk.org

Resource Scan

Scan Details

Site Domain news.cancerresearchuk.org
Base Domain cancerresearchuk.org
Scan Status Ok
Last Scan2024-11-03T14:07:59+00:00
Next Scan 2024-12-03T14:07:59+00:00

Last Scan

Scanned2024-11-03T14:07:59+00:00
URL https://news.cancerresearchuk.org/robots.txt
Domain IPs 141.193.213.20, 141.193.213.21
Response IP 141.193.213.21
Found Yes
Hash a143e49fe64eb492d849a701fa49bcc3bbab545ff48e4906f5235a627cee72d5
SimHash eb08ef123294

Groups

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://news.cancerresearchuk.org/sitemap.xml
sitemap https://news.cancerresearchuk.org/site.xml
sitemap https://news.cancerresearchuk.org/post.xml
sitemap https://news.cancerresearchuk.org/post_immersive-stories.xml
sitemap https://news.cancerresearchuk.org/post_web-story.xml
sitemap https://news.cancerresearchuk.org/post_google_news.xml
sitemap https://news.cancerresearchuk.org/page.xml
sitemap https://news.cancerresearchuk.org/taxonomy_category.xml
sitemap https://news.cancerresearchuk.org/taxonomy_series.xml
sitemap https://news.cancerresearchuk.org/author.xml