jurnallugas.com
robots.txt

Robots Exclusion Standard data for jurnallugas.com

Resource Scan

Scan Details

Site Domain jurnallugas.com
Base Domain jurnallugas.com
Scan Status Ok
Last Scan2026-02-10T06:14:36+00:00
Next Scan 2026-02-17T06:14:36+00:00

Last Scan

Scanned2026-02-10T06:14:36+00:00
URL https://jurnallugas.com/robots.txt
Domain IPs 104.21.95.189, 172.67.147.24, 2606:4700:3031::ac43:9318, 2606:4700:3032::6815:5fbd
Response IP 172.67.147.24
Found Yes
Hash ddccf4d5299339eb20428da342dba220b2a241995372836993416ffe4810dee1
SimHash 6d104bf066b1

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/*.php$
Disallow /wp-content/themes/*.php$
Disallow /cgi-bin/
Disallow /xmlrpc.php
Disallow /*/feed/
Disallow /comments/
Disallow /*?replytocom
Disallow /private/
Disallow /temp/
Disallow /drafts/
Allow /

googlebot

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googleother

Rule Path
Allow /

google-extended

Rule Path
Allow /

bingbot

Rule Path
Allow /

bingpreview

Rule Path
Allow /

msnbot

Rule Path
Allow /

gptbot-microsoft

Rule Path
Allow /

perplexitybot

Rule Path
Allow /news/
Disallow /private/
Disallow /drafts/
Disallow /temp/
Disallow /wp-admin/
Disallow /wp-includes/

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

openai

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

moreover

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

paqlebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://jurnallugas.com/index.php/sitemap_index.xml
sitemap https://jurnallugas.com/index.php/news-sitemap.xml

Comments

  • ===========================
  • robots.txt for JurnalLugas.com
  • Allow Google, Bing, Gemini, Copilot
  • Allow limited access for Perplexity (public news only)
  • ===========================
  • ---------------------------
  • General Settings
  • ---------------------------
  • Allow Google Crawlers (Search, News, Ads, Images, Videos, Gemini)
  • Gemini (Google AI)
  • ---------------------------
  • Allow Microsoft Bots (Bing Search & Copilot)
  • Microsoft Copilot AI
  • ---------------------------
  • Allow Perplexity (only for public news content)
  • ---------------------------
  • Block Specific Non-Google or Unsafe Bots
  • ---------------------------
  • Sitemap Locations
  • ===========================
  • End of robots.txt
  • ===========================