dinegurumi.com
robots.txt
Robots Exclusion Standard data for dinegurumi.com
Resource Scan
Scan Details
Site Domain | dinegurumi.com |
Base Domain | dinegurumi.com |
Scan Status | Ok |
Last Scan | 2024-09-07T16:05:04+00:00 |
Next Scan | 2024-10-07T16:05:04+00:00 |
Last Scan
Scanned | 2024-09-07T16:05:04+00:00 |
URL | https://dinegurumi.com/robots.txt |
Redirect | https://www.dinegurumi.com/robots.txt |
Redirect Domain | www.dinegurumi.com |
Redirect Base | dinegurumi.com |
Domain IPs | 2001:8d8:100f:f000::26e, 217.160.0.231 |
Redirect IPs | 104.26.6.135, 104.26.7.135, 172.67.72.179, 2606:4700:20::681a:687, 2606:4700:20::681a:787, 2606:4700:20::ac43:48b3 |
Response IP | 172.67.72.179 |
Found | Yes |
Hash | d5c6b98640ca7f169e4a5a037eac8d1292070bf77945dfeccd21ab1e87c966f7 |
SimHash | e9898882ecbb |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-content/uploads/wc-logs/ |
Disallow | /wp-content/uploads/woocommerce_transient_files/ |
Disallow | /wp-content/uploads/woocommerce_uploads/ |
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://www.dinegurumi.com/sitemap_index.xml |
Comments