guidavalencia.com
robots.txt

Robots Exclusion Standard data for guidavalencia.com

Resource Scan

Scan Details

Site Domain guidavalencia.com
Base Domain guidavalencia.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-10-12T05:00:52+00:00
Next Scan 2025-10-26T05:00:52+00:00

Last Successful Scan

Scanned2025-09-27T00:08:53+00:00
URL https://guidavalencia.com/robots.txt
Domain IPs 185.201.65.222
Response IP 185.201.65.222
Found Yes
Hash 439eb4338a69ef583b5c20ac1c240a5d1977d0e7c0fa6fa249533e3810c89c15
SimHash e8d85d1056f4

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Allow /wp-content/uploads/
Disallow /feed/$
Disallow /wp-
Disallow /wp-content/
Disallow /trackback/
Disallow /feed/
Disallow /?s=
Disallow /search/
Disallow /archives/
Disallow /*?
Disallow /*.php$
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow */feed/
Disallow */trackback/
Disallow /turismo/
Disallow /pieroviajero/
Disallow /*.sql$
Disallow /*.tgz$
Disallow /*.gz$
Disallow /*.tar$
Disallow /*.svn$

arianna

Rule Path
Disallow /

arianna news

Rule Path
Disallow /

arianna web

Rule Path
Disallow /

arianna robot

Rule Path
Disallow /

webnews arianna

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 15

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

Comments

  • Disallow: /page/
  • Disallow: /tag/
  • Disallow: /category/
  • No indexar copias de seguridad
  • Reglas para bots conocidos