itpln.ac.id
robots.txt

Robots Exclusion Standard data for itpln.ac.id

Resource Scan

Scan Details

Site Domain itpln.ac.id
Base Domain itpln.ac.id
Scan Status Ok
Last Scan2025-10-03T19:24:19+00:00
Next Scan 2025-10-10T19:24:19+00:00

Last Scan

Scanned2025-10-03T19:24:19+00:00
URL https://itpln.ac.id/robots.txt
Domain IPs 104.26.10.116, 104.26.11.116, 172.67.69.143, 2606:4700:20::681a:a74, 2606:4700:20::681a:b74, 2606:4700:20::ac43:458f
Response IP 172.67.69.143
Found Yes
Hash abd8e2f97ae36de974de841a29a91518b28b07b6ed34efbcbda2466806d7cffe
SimHash 3bae69c1a733

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /readme.html
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Allow /wp-content/uploads/

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

mazbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

Other Records

Field Value
sitemap https://itpln.ac.id/sitemap_index.xml

Comments

  • Allow crawling uploads media (gambar, video, dokumen)
  • Blok crawler AI / data-scraping tertentu
  • Tautkan sitemap agar mudah ditemukan mesin pencari

Warnings

  • 1 invalid line.