josepguasch.com
robots.txt

Robots Exclusion Standard data for josepguasch.com

Resource Scan

Scan Details

Site Domain josepguasch.com
Base Domain josepguasch.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-11-28T07:39:29+00:00
Next Scan 2025-12-05T07:39:29+00:00

Last Successful Scan

Scanned2025-10-28T07:38:55+00:00
URL https://josepguasch.com/robots.txt
Domain IPs 213.158.84.112
Response IP 213.158.84.112
Found Yes
Hash bcc0d02a9a6d0fed99d7597c805c004e09ec35c8a627746d1f0c07e6cca310f7
SimHash 402c48512919

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /wp-login
Disallow /wp-admin
Disallow /*/feed/
Disallow /*/trackback/
Disallow /*/attachment/
Disallow /author/
Disallow /*/page/
Disallow /*/feed/
Disallow /tag/*/page/
Disallow /tag/*/feed/
Disallow /page/
Disallow /comments/
Disallow /xmlrpc.php
Disallow /*?s=
Disallow /*/*/*/feed.xml
Disallow /?attachment_id*
Disallow /WEB-ANTIGUA/

Other Records

Field Value
sitemap https://www.josepguasch.com/sitemap.xml

Warnings

  • 2 invalid lines.