aceitunaslou.com
robots.txt

Robots Exclusion Standard data for aceitunaslou.com

Resource Scan

Scan Details

Site Domain aceitunaslou.com
Base Domain aceitunaslou.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2026-01-24T18:40:41+00:00
Next Scan 2026-03-25T18:40:41+00:00

Last Successful Scan

Scanned2025-11-02T16:26:28+00:00
URL https://aceitunaslou.com/robots.txt
Domain IPs 213.158.84.89
Response IP 213.158.84.89
Found Yes
Hash d90cb22f96a3d9d2d73388fd9c55876ff916d285d31e137ba7de29c312c62cda
SimHash ab9c59c1c6b3

Groups

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-content/themes
Allow /wp-content/plugins/
Allow /wp-includes/css/
Allow /wp-includes/js/
Disallow /wp-admin/
Disallow /*/feed/
Disallow /*/trackback/
Disallow /*/attachment/
Disallow /author/
Disallow /varnish/
Disallow /smartbox/*

googlebot-image

Rule Path
Allow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.aceitunaslou.com/sitemap_index.xml