gcdtech.com
robots.txt

Robots Exclusion Standard data for gcdtech.com

Resource Scan

Scan Details

Site Domain gcdtech.com
Base Domain gcdtech.com
Scan Status Ok
Last Scan2025-10-25T13:22:58+00:00
Next Scan 2025-11-24T13:22:58+00:00

Last Scan

Scanned2025-10-25T13:22:58+00:00
URL https://gcdtech.com/robots.txt
Domain IPs 141.193.213.10, 141.193.213.11
Response IP 141.193.213.11
Found Yes
Hash af0425054f5e15a4cf51974147c1d93ed514d68978d732de4d0b0da868add3ea
SimHash badeca00948b

Groups

baiduspider

Rule Path
Disallow /

wesee

Rule Path
Disallow /

proximic

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandex bot

Rule Path
Disallow /

gurujibot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou pic spider

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

oozbot

Rule Path
Disallow /

tagoobot

Rule Path
Disallow /

catchbot

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

saladspoon

Rule Path
Disallow /

temaseek.com

Rule Path
Disallow /

ahrefbot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

trendkite-akashic-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://gcdtech.com/sitemap_index.xml

Comments

  • Sitemap Index
  • Unwanted Crawlers

Warnings

  • 2 invalid lines.