detik.com
robots.txt

Robots Exclusion Standard data for detik.com

Resource Scan

Scan Details

Site Domain detik.com
Base Domain detik.com
Scan Status Ok
Last Scan2024-11-14T06:35:41+00:00
Next Scan 2024-11-21T06:35:41+00:00

Last Scan

Scanned2024-11-14T06:35:41+00:00
URL https://detik.com/robots.txt
Redirect https://www.detik.com/robots.txt
Redirect Domain www.detik.com
Redirect Base detik.com
Domain IPs 103.49.221.211, 203.190.242.211
Redirect IPs 103.49.221.211, 203.190.242.211
Response IP 103.49.221.211
Found Yes
Hash 26f06c6b9002edbf4765ca008b7efc21257422aaa7afe9004039e505d299e37c
SimHash 69191b73e193

Groups

googlebot

Rule Path
Disallow */komentar$
Disallow */main$
Disallow */main/*
Disallow /ajax/
Disallow /api/
Disallow /search/*
Disallow /tag/news/*
Disallow /tag/foto/*
Disallow *?_ga
Disallow *%26sortby
Disallow *%26device%3Ddesktop
Disallow */?
Disallow *edu/pov/d-*

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.detik.com/sitemap.xml