ciptadesa.com
robots.txt

Robots Exclusion Standard data for ciptadesa.com

Resource Scan

Scan Details

Site Domain ciptadesa.com
Base Domain ciptadesa.com
Scan Status Ok
Last Scan2024-11-17T21:53:04+00:00
Next Scan 2024-11-24T21:53:04+00:00

Last Scan

Scanned2024-11-17T21:53:04+00:00
URL https://ciptadesa.com/robots.txt
Domain IPs 5.181.216.83
Response IP 5.181.216.83
Found Yes
Hash ed4cc369ab04bbe2df4fede803514ff2ded74ab0218bb36338395c2c60d0eb1c
SimHash 6d30b335d793

Groups

adsbot-google

Rule Path
Allow /

googlebot-mobile

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /search
Allow *?s=
Allow *%26s%3D
Disallow /?hl=ar
Disallow /tag/
Disallow /category/
Disallow /page/
Disallow */page/
Disallow */embed$
Disallow */xmlrpc.php
Disallow *utm*%3D
Disallow *openstat%3D
Disallow */download/

Other Records

Field Value
sitemap https://ciptadesa.com/sitemap_index.xml