republika.co.id
robots.txt

Robots Exclusion Standard data for republika.co.id

Resource Scan

Scan Details

Site Domain republika.co.id
Base Domain republika.co.id
Scan Status Ok
Last Scan2024-09-24T06:19:06+00:00
Next Scan 2024-10-01T06:19:06+00:00

Last Scan

Scanned2024-09-24T06:19:06+00:00
URL https://republika.co.id/robots.txt
Domain IPs 104.18.8.234, 104.18.9.234, 2606:4700::6812:8ea, 2606:4700::6812:9ea
Response IP 104.18.9.234
Found Yes
Hash 1f24c2e28df419d6e8ca7c867bfb2443ce747a7ac4ffb865e7df1bde29db6008
SimHash 15117c78eef3

Groups

*

Rule Path
Disallow /search/?q=
Disallow /komentar/*
Disallow /copy/*
Disallow /cron/*
Disallow /curl/*
Disallow /feed/*
Disallow /tag/*
Disallow /post/*
Disallow /ajax/*
Disallow *?utm_source=*
Disallow *?source=*

gptbot

Rule Path
Disallow /

openai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.republika.co.id/files/xml/sitemap.xml