detikinet.com
robots.txt

Robots Exclusion Standard data for detikinet.com

Resource Scan

Scan Details

Site Domain detikinet.com
Base Domain detikinet.com
Scan Status Ok
Last Scan2024-04-28T10:59:57+00:00
Next Scan 2024-05-28T10:59:57+00:00

Last Scan

Scanned2024-04-28T10:59:57+00:00
URL http://detikinet.com/robots.txt
Redirect https://inet.detik.com/robots.txt
Redirect Domain inet.detik.com
Redirect Base detik.com
Domain IPs 103.49.221.103, 203.190.242.103
Redirect IPs 103.49.221.103, 203.190.242.103
Response IP 103.49.221.103
Found Yes
Hash 9bffadfec3ded985d83a2dcf570def7e551a9656e80a19ab047068d583674b16
SimHash 78009371df23

Groups

googlebot

Rule Path
Disallow */komentar$
Disallow */komentar?*
Disallow */komentar/
Disallow /ajax/
Disallow /api/
Disallow */read/2011
Disallow */read/2012
Disallow */read/2013
Disallow */read/2014
Disallow */read/2015
Disallow */read/2016
Disallow *?mpiinet
Disallow */indeksfokus
Disallow *?utm_source
Disallow *?query-string
Disallow *?tag_from
Disallow *?date=
Disallow *?_ga
Disallow *%26sortby
Disallow *?device=desktop
Disallow *%26device%3Ddesktop

chatgpt-user

Rule Path
Disallow /

openai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://inet.detik.com/sitemap.xml