cakrawali.com
robots.txt

Robots Exclusion Standard data for cakrawali.com

Resource Scan

Scan Details

Site Domain cakrawali.com
Base Domain cakrawali.com
Scan Status Ok
Last Scan2026-01-01T07:25:30+00:00
Next Scan 2026-01-08T07:25:30+00:00

Last Scan

Scanned2026-01-01T07:25:30+00:00
URL https://cakrawali.com/robots.txt
Domain IPs 84.17.46.49
Response IP 84.17.46.49
Found Yes
Hash 1a1ab29a3315f58459707afba11dd4bb22250ff5a121ff264d7a0b2054afc253
SimHash 691ccd8a66a9

Groups

googlebot-mobile

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads

googlebot

Rule Path
Allow /wp-content/uploads

mediapartners-google

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

bingbot

Rule Path
Allow /

bingpreview

Rule Path
Allow /

*

Rule Path
Allow /wp-content/uploads
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/themes
Disallow /comments
Disallow /readme.html
Disallow /comments/feed/
Disallow /trackback/
Disallow /feed/
Disallow /index.php
Disallow /xmlrpc.php
Disallow /search/

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

zookabot

Rule Path
Disallow /

httrack

Rule Path
Disallow /

Other Records

Field Value
sitemap https://cakrawali.com/sitemap.xml