cgccomics.com
robots.txt

Robots Exclusion Standard data for cgccomics.com

Resource Scan

Scan Details

Site Domain cgccomics.com
Base Domain cgccomics.com
Scan Status Ok
Last Scan2024-11-09T15:32:28+00:00
Next Scan 2024-11-16T15:32:28+00:00

Last Scan

Scanned2024-11-09T15:32:28+00:00
URL https://cgccomics.com/robots.txt
Redirect https://www.cgccomics.com/robots.txt
Redirect Domain www.cgccomics.com
Redirect Base cgccomics.com
Domain IPs 104.26.6.42, 104.26.7.42, 172.67.68.48, 2606:4700:20::681a:62a, 2606:4700:20::681a:72a, 2606:4700:20::ac43:4430
Redirect IPs 104.26.6.42, 104.26.7.42, 172.67.68.48, 2606:4700:20::681a:62a, 2606:4700:20::681a:72a, 2606:4700:20::ac43:4430
Response IP 172.67.68.48
Found Yes
Hash dcacbc9207b2c1f79ccacbd4b644be29db5a9d2ec16d3d9ed2c33ac7c1d221d4
SimHash 9b23732e8a93

Groups

b2w/0.1
crawl
custo
discovery
emailcollector
emailsiphon
emailwolf
exabot
extractorpro
funwebproducts
htdig/3.1.5
larbin
npbot
teleport
titan
turnitinbot
webcopier
websauger
webstripper
webzip

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cgccomics.com/sitemap.xml