news.detik.com
robots.txt

Robots Exclusion Standard data for news.detik.com

Resource Scan

Scan Details

Site Domain news.detik.com
Base Domain detik.com
Scan Status Ok
Last Scan2024-04-27T21:25:19+00:00
Next Scan 2024-05-27T21:25:19+00:00

Last Scan

Scanned2024-04-27T21:25:19+00:00
URL https://news.detik.com/robots.txt
Domain IPs 103.49.221.187, 203.190.242.187
Response IP 203.190.242.187
Found Yes
Hash 5cbea365328b4fdfc37d39c0fdb10571af0dda67c638a6b83d71f97265f54a20
SimHash 79008952cd13

Groups

googlebot

Rule Path
Disallow */komentar$
Disallow */komentar?*
Disallow */komentar/
Disallow /ajax/
Disallow /api/
Disallow *?tag_from
Disallow *?_ga
Disallow *%26sortby
Disallow *?device=desktop
Disallow *%26device%3Ddesktop
Disallow */jawabarat/
Disallow */jawatengah/
Disallow */jawatimur/
Disallow */msite/

chatgpt-user

Rule Path
Disallow /

openai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://news.detik.com/sitemap.xml