cleanclothes.org
robots.txt
Robots Exclusion Standard data for cleanclothes.org
Resource Scan
Scan Details
Site Domain | cleanclothes.org |
Base Domain | cleanclothes.org |
Scan Status | Ok |
Last Scan | 2025-05-03T17:52:29+00:00 |
Next Scan | 2025-06-02T17:52:29+00:00 |
Last Scan
Scanned | 2025-05-03T17:52:29+00:00 |
URL | https://cleanclothes.org/robots.txt |
Domain IPs | 95.211.57.76 |
Response IP | 95.211.57.76 |
Found | Yes |
Hash | e04f66a62ec145568b57fab76dd6ffab5266c8df5f96198cbecd8e476804c501 |
SimHash | ad51ab554d61 |
Groups
*
Rule | Path |
---|---|
Disallow |
googlebot
Rule | Path |
---|---|
Disallow | /*? |
Disallow | /*atct_album_view$ |
Disallow | /*folder_factories$ |
Disallow | /*folder_summary_view$ |
Disallow | /*login_form$ |
Disallow | /*mail_password_form$ |
Disallow | /%40%40search |
Disallow | /*search_rss$ |
Disallow | /*sendto_form$ |
Disallow | /*summary_view$ |
Disallow | /*thumbnail_view$ |
Disallow | /*view$ |
Other Records
Field | Value |
---|---|
sitemap | https://cleanclothes.org/sitemap.xml.gz |
Comments