intra.sdedu.co.kr
robots.txt

Robots Exclusion Standard data for intra.sdedu.co.kr

Resource Scan

Scan Details

Site Domain intra.sdedu.co.kr
Base Domain sdedu.co.kr
Scan Status Ok
Last Scan2025-05-02T18:01:21+00:00
Next Scan 2025-06-01T18:01:21+00:00

Last Scan

Scanned2025-05-02T18:01:21+00:00
URL https://intra.sdedu.co.kr/robots.txt
Domain IPs 132.226.17.35
Response IP 132.226.17.35
Found Yes
Hash b0ac113d368b57ba8daa889f284f1ec8bf403b2f548fe8ae94ef1712bf8aa8a8
SimHash 49356a86df24

Groups

yeti

Rule Path
Allow /

googlebot
gptbot
gpt-4
chatgpt-user
gpt
instructgpt
bingbot

Rule Path
Allow /
Disallow /*.mp4$
Disallow /*.cab$
Disallow /*.exe$
Disallow /*.hwp$
Disallow /*.pdf$
Disallow /*.doc$
Disallow /*.docx$
Disallow /*.xls$
Disallow /*.xlsx$
Disallow /data/
Disallow /doc/
Disallow /tmp
Disallow /temp
Disallow /private
Disallow /admin/
Disallow /ebook/
Disallow /inc/
Disallow /lms/
Disallow /manage/
Disallow /mp3/
Disallow /ncadmin/
Disallow /fs/
Disallow /checkout/
Disallow /board/
Disallow /bin/
Disallow /bemypage/
Disallow /cm/
Disallow /adm/

*

Rule Path
Disallow /
Disallow

Comments

  • Disallow:/*.gif$
  • Disallow:/*.png$
  • Disallow:/*.jpg$
  • Disallow:/*.js$