harshalpublications.in
robots.txt

Robots Exclusion Standard data for harshalpublications.in

Resource Scan

Scan Details

Site Domain harshalpublications.in
Base Domain harshalpublications.in
Scan Status Ok
Last Scan2025-09-29T19:26:32+00:00
Next Scan 2025-10-29T19:26:32+00:00

Last Scan

Scanned2025-09-29T19:26:32+00:00
URL https://harshalpublications.in/robots.txt
Redirect https://www.harshalpublications.in/robots.txt
Redirect Domain www.harshalpublications.in
Redirect Base harshalpublications.in
Domain IPs 108.170.41.74
Redirect IPs 108.170.41.74
Response IP 108.170.41.74
Found Yes
Hash 2cebb69670e27dd7fc158c4d1e0e7c64332cfefc38b828671c1bc7333ca136ff
SimHash e10188024f93

Groups

*

Rule Path
Disallow /wp-content/uploads/wc-logs/
Disallow /wp-content/uploads/woocommerce_transient_files/
Disallow /wp-content/uploads/woocommerce_uploads/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

*

Rule Path
Disallow /wp-content/uploads/wpo/wpo-plugins-tables-list.json