cwp.nhs.uk
robots.txt

Robots Exclusion Standard data for cwp.nhs.uk

Resource Scan

Scan Details

Site Domain cwp.nhs.uk
Base Domain cwp.nhs.uk
Scan Status Ok
Last Scan2024-10-30T09:59:46+00:00
Next Scan 2024-11-29T09:59:46+00:00

Last Scan

Scanned2024-10-30T09:59:46+00:00
URL https://cwp.nhs.uk/robots.txt
Redirect https://www.cwp.nhs.uk/robots.txt
Redirect Domain www.cwp.nhs.uk
Redirect Base cwp.nhs.uk
Domain IPs 185.220.63.33
Redirect IPs 185.181.197.207, 2a02:21a8:0:3::cee:3744
Response IP 185.181.197.207
Found Yes
Hash b26a219fbc2286d06c97c75dde01da407e64fa0ae120abd59804d55b3222f649
SimHash 036bdfe2cf18

Groups

*

Rule Path
Disallow /ccm/*
Disallow /application/files
Disallow /application/files/*
Disallow /download_file
Disallow /download_file/*
Disallow /concrete
Disallow /demo
Disallow /demo/*
Allow /application/files/thumbnails
Allow /application/files/thumbnails/*
Allow /application/files/*.png
Allow /application/files/*.jpg

Other Records

Field Value
crawl-delay 10

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

web downloader

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

offline explorer pro

Rule Path
Disallow /

httrack website copier

Rule Path
Disallow /

offline commander

Rule Path
Disallow /

leech

Rule Path
Disallow /

websnake

Rule Path
Disallow /

blackwidow

Rule Path
Disallow /

http weazel

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

linkedinbot

Rule Path
Disallow /

mauibot (crawler.feedback+wc@gmail.com)

Rule Path
Disallow /

safednsbot (https://www.safedns.com/searchbot)

Rule Path
Disallow /

exabot

Rule Path
Disallow /

pingdom bot

Rule Path
Disallow /

adidxbot

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

applewebkit

Rule Path
Disallow /

adsbot-google

Rule Path
Disallow /

googlebot-news

Rule Path
Disallow /

baidu spider 2.0

Rule Path
Disallow /

semrush crawler 2.0

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

Comments

  • Allow access to thumbnails
  • This list is compiled by Techie Zone part of Qlogix Network.