dpgplc.co.uk
robots.txt

Robots Exclusion Standard data for dpgplc.co.uk

Resource Scan

Scan Details

Site Domain dpgplc.co.uk
Base Domain dpgplc.co.uk
Scan Status Ok
Last Scan2025-06-25T18:41:23+00:00
Next Scan 2025-07-25T18:41:23+00:00

Last Scan

Scanned2025-06-25T18:41:23+00:00
URL https://dpgplc.co.uk/robots.txt
Redirect https://dpglearn.co.uk/robots.txt
Redirect Domain dpglearn.co.uk
Redirect Base dpglearn.co.uk
Domain IPs 20.90.134.17
Redirect IPs 104.26.4.55, 104.26.5.55, 172.67.75.147, 2606:4700:20::681a:437, 2606:4700:20::681a:537, 2606:4700:20::ac43:4b93
Response IP 172.67.75.147
Found Yes
Hash 68332da81c6420d9fe3638a9a0f9c36e159e3eeafaf12d7104fa30b10f9a0811
SimHash f10a68619200

Groups

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /
Disallow /aspnet_client/
Disallow /bin/
Disallow /config/
Disallow /data/
Disallow /install/
Disallow /macroScripts/
Disallow /masterpages/
Disallow /umbraco/
Disallow /backoffice/
Disallow /umbraco_client/
Disallow /usercontrols/
Disallow /xslt/
Disallow /contactform/
Disallow /Search/
Disallow /404/
Disallow /checkout/
Disallow /orderconfirmation/
Disallow /site-general-settings/
Disallow /landing-pages/*
Disallow /lp/*
Disallow /thank-you/
Disallow /thank-you-competition/
Disallow /thank-you-enquiry/

Other Records

Field Value
sitemap https://dpglearn.co.uk/site-map/