waspada.id
robots.txt

Robots Exclusion Standard data for waspada.id

Resource Scan

Scan Details

Site Domain waspada.id
Base Domain waspada.id
Scan Status Ok
Last Scan2024-11-14T18:53:00+00:00
Next Scan 2024-11-21T18:53:00+00:00

Last Scan

Scanned2024-11-14T18:53:00+00:00
URL https://waspada.id/robots.txt
Redirect https://www.waspada.id/robots.txt
Redirect Domain www.waspada.id
Redirect Base waspada.id
Domain IPs 104.21.7.243, 172.67.188.26, 2606:4700:3031::ac43:bc1a, 2606:4700:3036::6815:7f3
Redirect IPs 104.21.7.243, 172.67.188.26, 2606:4700:3031::ac43:bc1a, 2606:4700:3036::6815:7f3
Response IP 172.67.188.26
Found Yes
Hash 670d911ac9c89063bea8c6516abdac0ff973ea25660f18a94cbc1b9e0f9e243d
SimHash 7830b882e933

Groups

*
googlebot

Rule Path
Allow /
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /search/?q=
Disallow /komentar/*
Disallow /copy/*
Disallow *?jxrecoid=*
Disallow *?utm_source=*
Disallow *?source=*
Disallow *?PageSpeed=noscript*
Disallow *?fb_comment_id=*
Disallow *?amp=1*
Disallow *?cat=*
Disallow *?penci_spp_count=*
Disallow *?noamp=mobile*
Disallow *?ajax-request=jnews*
Disallow /indeks/?id=
Disallow *?filter_by=*
Disallow *?page=*
Disallow *?ajax-request=*
Disallow *?s&PageSpeed=noscript*
Disallow /embed/*
Disallow *?s*
Disallow *?s=*
Disallow /feed/*
Disallow *?paged=*
Disallow *?utm_source=rss&utm_medium=rss&utm_campaign=*
Disallow *?utm_source=*

chatgpt-user

Rule Path
Disallow /

openai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.waspada.id/sitemap_index.xml
sitemap https://www.waspada.id/post-sitemap.xml