cleaninghow.to
robots.txt
Robots Exclusion Standard data for cleaninghow.to
Resource Scan
Scan Details
Site Domain | cleaninghow.to |
Base Domain | cleaninghow.to |
Scan Status | Ok |
Last Scan | 2025-10-10T15:28:17+00:00 |
Next Scan | 2025-10-17T15:28:17+00:00 |
Last Scan
Scanned | 2025-10-10T15:28:17+00:00 |
URL | https://cleaninghow.to/robots.txt |
Domain IPs | 104.21.47.218, 172.67.172.209, 2606:4700:3031::6815:2fda, 2606:4700:3037::ac43:acd1 |
Response IP | 172.67.172.209 |
Found | Yes |
Hash | a581746d7f2a7bbe2795db63ff10c6ed0893edd503168082a7068370793fe518 |
SimHash | 4909c840e291 |
Groups
*
Rule | Path |
---|---|
Allow | /wp-content/uploads/ |
Disallow | /wp-content/plugins/ |
Disallow | /wp-admin/ |
Disallow | /readme.html |
Disallow | /refer/ |
Disallow | /*add-to-cart%3D* |
Disallow | /*add_to_wishlist%3D* |
Other Records
Field | Value |
---|---|
sitemap | https://cleaninghow.to/sitemap_index.xml |