maluku.inews.id
robots.txt

Robots Exclusion Standard data for maluku.inews.id

Resource Scan

Scan Details

Site Domain maluku.inews.id
Base Domain inews.id
Scan Status Ok
Last Scan2024-09-26T20:52:58+00:00
Next Scan 2024-10-10T20:52:58+00:00

Last Scan

Scanned2024-09-26T20:52:58+00:00
URL https://maluku.inews.id/robots.txt
Domain IPs 104.18.12.203, 104.18.13.203, 2606:4700::6812:ccb, 2606:4700::6812:dcb
Response IP 104.18.13.203
Found Yes
Hash e47b21fef6b0fdd70ff7c1f14a364acef460047db52eb3a1d2971a87a0ff5eb2
SimHash 6838c0778b32

Groups

*

Rule Path
Allow /
Disallow /getwidget
Disallow /getwidget-mobile
Disallow /getnews

chatgpt-user

Rule Path
Disallow /

openai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://maluku.inews.id/sitemap.xml