juandharma.com
robots.txt

Robots Exclusion Standard data for juandharma.com

Resource Scan

Scan Details

Site Domain juandharma.com
Base Domain juandharma.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2026-01-11T07:58:01+00:00
Next Scan 2026-04-11T07:58:01+00:00

Last Successful Scan

Scanned2024-03-23T02:50:26+00:00
URL https://juandharma.com/robots.txt
Domain IPs 217.76.150.94
Response IP 217.76.150.94
Found Yes
Hash 57038ed3e79c3df3097748b7c2f5796e5d464b6f1ddfa6226d2f696cb68770fb
SimHash 680408726ba8

Groups

scrapy

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /*/feed/
Disallow /*/trackback/
Disallow /*/attachment/
Disallow /author/
Disallow *?replytocom
Disallow /tag/*/page/
Disallow /tag/*/feed/
Disallow /comments/
Disallow /xmlrpc.php
Disallow /*?s=
Disallow /*/*/*/feed.xml
Disallow /?attachment_id*
Disallow /search

googlebot

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap http://juandharma.com/sitemap_index.xml