en.tgju.org
robots.txt

Robots Exclusion Standard data for en.tgju.org

Resource Scan

Scan Details

Site Domain en.tgju.org
Base Domain tgju.org
Scan Status Ok
Last Scan2025-12-06T05:56:26+00:00
Next Scan 2026-01-05T05:56:26+00:00

Last Scan

Scanned2025-12-06T05:56:26+00:00
URL https://en.tgju.org/robots.txt
Redirect https://www.tgju.org/robots.txt
Redirect Domain www.tgju.org
Redirect Base tgju.org
Domain IPs 104.26.14.85, 104.26.15.85, 172.67.73.163, 2606:4700:20::681a:e55, 2606:4700:20::681a:f55, 2606:4700:20::ac43:49a3
Redirect IPs 104.26.14.85, 104.26.15.85, 172.67.73.163, 2606:4700:20::681a:e55, 2606:4700:20::681a:f55, 2606:4700:20::ac43:49a3
Response IP 104.26.14.85
Found Yes
Hash c6e1ef6e4e56dcd7e169cdcff135c14dd0b30b0e2282200af4e876e81c86b1af
SimHash 08358158c0c3

Groups

*
googlebot

Rule Path
Allow /*.js$
Allow /*.css$

*

Rule Path
Disallow
Disallow /cgi-bin/
Disallow /cdn-cgi/

*

Rule Path
Disallow /events/

*

Rule Path
Disallow /channel/

*

Rule Path
Disallow /shop/

*

Rule Path
Disallow /product/

*

Rule Path
Disallow /market-info/

*

Rule Path
Disallow /market-shop/

*

Rule Path
Disallow /en/

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.tgju.org/sitemapindex.xml
sitemap https://www.tgju.org/sitemap.xml
sitemap https://www.tgju.org/sitemap2.xml
sitemap https://www.tgju.org/sitemap3.xml
sitemap https://www.tgju.org/sitemap4.xml
sitemap https://www.tgju.org/sitemap5.xml
sitemap https://www.tgju.org/sitemap6.xml

Comments

  • Allow all files ending with these extensions