journal.cjgh.org
robots.txt

Robots Exclusion Standard data for journal.cjgh.org

Resource Scan

Scan Details

Site Domain journal.cjgh.org
Base Domain cjgh.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2026-01-12T19:44:32+00:00
Next Scan 2026-03-13T19:44:32+00:00

Last Successful Scan

Scanned2025-10-22T16:16:26+00:00
URL https://journal.cjgh.org/robots.txt
Redirect https://cjgh.org/robots.txt
Redirect Domain cjgh.org
Redirect Base cjgh.org
Domain IPs 2001:4b98:e01::38, 217.70.184.56
Redirect IPs 34.147.4.31
Response IP 34.147.4.31
Found Yes
Hash b19b5d3e7340e8abfe53534bec81c5d325e8551d6bdbc1e72a544f19416b410c
SimHash 481dca40e5d3

Groups

googlebot

Rule Path
Disallow /print/*
Allow /

bingbot

Rule Path
Disallow /print/*
Allow /

duckduckbot

Rule Path
Disallow /print/*
Allow /

applebot

Rule Path
Disallow /print/*
Allow /

*

Rule Path
Disallow /

Other Records

Field Value
sitemap undefined/sitemap.xml

Comments

  • Googlebot
  • Bingbot
  • DuckDuckBot
  • Applebot
  • All other bots