clikalia.com
robots.txt

Robots Exclusion Standard data for clikalia.com

Resource Scan

Scan Details

Site Domain clikalia.com
Base Domain clikalia.com
Scan Status Ok
Last Scan2024-09-16T06:35:47+00:00
Next Scan 2024-10-16T06:35:47+00:00

Last Scan

Scanned2024-09-16T06:35:47+00:00
URL https://clikalia.com/robots.txt
Domain IPs 13.74.249.69
Response IP 13.74.249.69
Found Yes
Hash 9482a4926be48796f126f99994505213f7bee66483bce1552de6f1c4f70e2199
SimHash a900d62fd419

Groups

*

Rule Path
Disallow /contacto
Disallow /aviso-legal
Disallow /politica-de-cookies
Disallow /politica-de-privacidad
Disallow /terminos-y-condiciones
Disallow /formulario-venta
Disallow *?
Disallow /tag
Disallow /author

adsbot-google

Rule Path
Allow /campaign.clikalia.com

orthogaffe disallow: /
ubicrawler disallow: /
doc disallow: /
zao disallow: /
zealbot disallow: /
msiecrawler disallow: /
sitesnagger disallow: /
webstripper disallow: /
webcopier disallow: /
fetch disallow: /
offline explorer disallow: /
teleport disallow: /
teleportpro disallow: /
webzip disallow: /
linko disallow: /
httrack disallow: /
microsoft.url.control disallow: /
xenu disallow: /
larbin disallow: /
libwww disallow: /
zyborg disallow: /
download ninja disallow: /
wget disallow: /
grub-client disallow: /

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.clikalia.com/directory_sitemaps.xml