ojc.coj.go.th
robots.txt

Robots Exclusion Standard data for ojc.coj.go.th

Resource Scan

Scan Details

Site Domain ojc.coj.go.th
Base Domain coj.go.th
Scan Status Ok
Last Scan2025-05-02T23:53:55+00:00
Next Scan 2025-06-01T23:53:55+00:00

Last Scan

Scanned2025-05-02T23:53:55+00:00
URL https://ojc.coj.go.th/robots.txt
Response IP 202.139.195.38
Found Yes
Hash cb75578869af0526382b8dd0a3743baaada661bc445b894ae5af6b54d85a2d74
SimHash 2f157b1442fb

Groups

*

Rule Path
Disallow /admin/
Disallow /*.xls$
Disallow /plugins/
Disallow /th/search/
Disallow /*.pdf$
Disallow /dist/
Disallow /th/cms/
Disallow /cms/thnewsinfo.php
Disallow /th//cms/thnewsinfo.php
Disallow /th/webboard/
Disallow /webboard/
Disallow /system/
Disallow /home/
Disallow /eform/
Disallow /ads.txt
Disallow /cgi-bin/
Disallow /scripts/
Disallow /tmp/
Disallow /CHANGELOG.txt
Disallow /cron.php
Disallow /INSTALL.mysql.txt
Disallow /INSTALL.pgsql.txt
Disallow /INSTALL.sqlite.txt
Disallow /install.php
Disallow /INSTALL.txt
Disallow /LICENSE.txt
Disallow /MAINTAINERS.txt
Disallow /update.php
Disallow /UPGRADE.txt
Disallow /xmlrpc.php
Disallow /wp-login.php
Disallow %5E.*%5C/wp-includes%5C/wlwmanifest.xml

baiduspider

Rule Path
Disallow /

msnbot-media

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yandexbot/3.0

Rule Path
Disallow /

yak/1.0

Rule Path
Disallow /

python-requests/2.27.1

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

dataforseobot/1.0

Rule Path
Disallow /

bingbot/2.0

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

bingbot/2.0

Rule Path
Disallow /*.*

gptbot

Rule Path
Disallow /

Comments

  • Files
  • RH, 06.30.21: these are files likely bad bots are requesting
  • User-agent: Googlebot-Video
  • Disallow: /