terkini.id
robots.txt

Robots Exclusion Standard data for terkini.id

Resource Scan

Scan Details

Site Domain terkini.id
Base Domain terkini.id
Scan Status Ok
Last Scan2024-09-23T20:50:34+00:00
Next Scan 2024-09-30T20:50:34+00:00

Last Scan

Scanned2024-09-23T20:50:34+00:00
URL https://terkini.id/robots.txt
Domain IPs 104.26.10.178, 104.26.11.178, 172.67.73.155, 2606:4700:20::681a:ab2, 2606:4700:20::681a:bb2, 2606:4700:20::ac43:499b
Response IP 104.26.11.178
Found Yes
Hash c64ff9afa5e36f2b0bacc41dcaa72d2caa436ed6e1201e32af6c9c024303c834
SimHash de43c27894b1

Groups

*

Rule Path
Disallow /search$
Disallow /search?*
Disallow /search/
Disallow /app/
Disallow */komentar$
Disallow */komentar?*
Disallow */komentar/
Disallow /mitra_wp/
Disallow /mitra_wp/*
Disallow /api2024/
Disallow /api2024/*

chatgpt-user

Rule Path
Disallow /

openai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

nuclei

Rule Path
Disallow /

wikido

Rule Path
Disallow /

riddler

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

node/simplecrawler

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://terkini.id/sitemap.xml