cgccomics.uk
robots.txt

Robots Exclusion Standard data for cgccomics.uk

Resource Scan

Scan Details

Site Domain cgccomics.uk
Base Domain cgccomics.uk
Scan Status Ok
Last Scan2025-12-16T21:21:10+00:00
Next Scan 2025-12-23T21:21:10+00:00

Last Scan

Scanned2025-12-16T21:21:10+00:00
URL https://cgccomics.uk/robots.txt
Redirect https://www.cgccomics.uk/robots.txt
Redirect Domain www.cgccomics.uk
Redirect Base cgccomics.uk
Domain IPs 104.21.32.124, 172.67.151.246, 2606:4700:3033::ac43:97f6, 2606:4700:3035::6815:207c
Redirect IPs 104.21.32.124, 172.67.151.246, 2606:4700:3033::ac43:97f6, 2606:4700:3035::6815:207c
Response IP 172.67.151.246
Found Yes
Hash dcacbc9207b2c1f79ccacbd4b644be29db5a9d2ec16d3d9ed2c33ac7c1d221d4
SimHash 9b23732e8a93

Groups

b2w/0.1
crawl
custo
discovery
emailcollector
emailsiphon
emailwolf
exabot
extractorpro
funwebproducts
htdig/3.1.5
larbin
npbot
teleport
titan
turnitinbot
webcopier
websauger
webstripper
webzip

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cgccomics.com/sitemap.xml