pcesupport.co.uk
robots.txt

Robots Exclusion Standard data for pcesupport.co.uk

Resource Scan

Scan Details

Site Domain pcesupport.co.uk
Base Domain pcesupport.co.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-02-15T21:15:56+00:00
Next Scan 2025-05-16T21:15:56+00:00

Last Successful Scan

Scanned2024-03-30T21:14:09+00:00
URL https://pcesupport.co.uk/robots.txt
Domain IPs 104.21.35.107, 172.67.217.188, 2606:4700:3031::6815:236b, 2606:4700:3031::ac43:d9bc
Response IP 172.67.217.188
Found Yes
Hash a918ccde2557e8bef1b0c7f1083f3411fbf87d4433d03789bbcabc9d71f55c70
SimHash 2c54f7025491

Groups

mediapartners-google

Rule Path
Allow /

*

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/

googlebot-image

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

googlebot

Rule Path
Allow /

msnbot

Rule Path
Allow /

slurp

Rule Path
Allow /

teoma

Rule Path
Allow /

twiceler

Rule Path
Allow /

gigabot

Rule Path
Allow /

scrubby

Rule Path
Allow /

robozilla

Rule Path
Allow /

nutch

Rule Path
Allow /

ia_archiver

Rule Path
Allow /

baiduspider

Rule Path
Allow /

naverbot

Rule Path
Allow /

yeti

Rule Path
Allow /

yahoo-mmcrawler

Rule Path
Allow /

psbot

Rule Path
Allow /

asterias

Rule Path
Allow /

yahoo-blogs/v3.9

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.pcehelp.co.uk/sitemap.xml

Comments

  • disallow all files in these directories