cgon.co.uk
robots.txt

Robots Exclusion Standard data for cgon.co.uk

Resource Scan

Scan Details

Site Domain cgon.co.uk
Base Domain cgon.co.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-24T19:43:37+00:00
Next Scan 2025-12-23T19:43:37+00:00

Last Successful Scan

Scanned2023-11-12T10:38:05+00:00
URL https://cgon.co.uk/robots.txt
Redirect https://www.cgon.co.uk/robots.txt
Redirect Domain www.cgon.co.uk
Redirect Base cgon.co.uk
Domain IPs 104.21.33.99, 172.67.161.170, 2606:4700:3031::6815:2163, 2606:4700:3033::ac43:a1aa
Redirect IPs 104.21.33.99, 172.67.161.170, 2606:4700:3031::6815:2163, 2606:4700:3033::ac43:a1aa
Response IP 104.21.33.99
Found Yes
Hash 33de5ffd56d15a0e29bfc35d6a2c09515127c40b2925b03fa70f3457edba3979
SimHash 6b468d50dedb

Groups

*

Rule Path
Disallow /go/*
Disallow /*?s=*
Disallow /views/loadmore.php*

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

domaincrawler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

httrack

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

zoombot

Rule Path
Disallow /

brandverity

Rule Path
Disallow /

spiderling

Rule Path
Disallow /

buck

Rule Path
Disallow /

tigerbot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

ltx71 - (http://ltx71.com/)

Rule Path
Disallow /

linkfluence

Rule Path
Disallow /

bot@linkfluence.net

Rule Path
Disallow /

evc-batch

Rule Path
Disallow /

zgrab

Rule Path
Disallow /

adscanner

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cgon.co.uk/sitemap.xml