jjal.kr
robots.txt

Robots Exclusion Standard data for jjal.kr

Resource Scan

Scan Details

Site Domain jjal.kr
Base Domain jjal.kr
Scan Status Ok
Last Scan2025-11-16T15:33:39+00:00
Next Scan 2025-11-23T15:33:39+00:00

Last Scan

Scanned2025-11-16T15:33:39+00:00
URL http://jjal.kr/robots.txt
Domain IPs 211.233.50.245
Response IP 211.233.50.245
Found Yes
Hash 02195b7a3d3647a7d33e98547ff5934fb040f8a68de640de5f2214e73c923552
SimHash 0514cba05771

Groups

baiduspider

Rule Path
Disallow *

r6_commentreader

Rule Path
Disallow *

sistrix crawler

Rule Path
Disallow *

exabot

Rule Path
Disallow *

flipboardproxy

Rule Path
Disallow *

q_blogbot

Rule Path
Disallow *

qt_blogbot

Rule Path
Disallow *

yandexbot

Rule Path
Disallow *

nkbot

Rule Path
Disallow *

mediapartners-google

Rule Path
Allow /
Disallow /wp/wp-contents/upload/archives/

googlebot

Rule Path
Allow /
Disallow /wp/wp-contents/upload/archives/

googlebot-mobile

Rule Path
Allow /
Disallow /wp/wp-contents/upload/archives/

Comments

  • for Google GDN (Not Crawl)
  • for Google bot
  • for Google mobile bot

Warnings

  • 1 invalid line.