ciptadesa.com
robots.txt
Robots Exclusion Standard data for ciptadesa.com
Resource Scan
Scan Details
Site Domain | ciptadesa.com |
Base Domain | ciptadesa.com |
Scan Status | Ok |
Last Scan | 2024-11-17T21:53:04+00:00 |
Next Scan | 2024-11-24T21:53:04+00:00 |
Last Scan
Scanned | 2024-11-17T21:53:04+00:00 |
URL | https://ciptadesa.com/robots.txt |
Domain IPs | 5.181.216.83 |
Response IP | 5.181.216.83 |
Found | Yes |
Hash | ed4cc369ab04bbe2df4fede803514ff2ded74ab0218bb36338395c2c60d0eb1c |
SimHash | 6d30b335d793 |
Groups
googlebot-mobile
Rule | Path |
---|---|
Allow | / |
Disallow | /cgi-bin/ |
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Disallow | /search |
Allow | *?s= |
Allow | *%26s%3D |
Disallow | /?hl=ar |
Disallow | /tag/ |
Disallow | /category/ |
Disallow | /page/ |
Disallow | */page/ |
Disallow | */embed$ |
Disallow | */xmlrpc.php |
Disallow | *utm*%3D |
Disallow | *openstat%3D |
Disallow | */download/ |
Other Records
Field | Value |
---|---|
sitemap | https://ciptadesa.com/sitemap_index.xml |