sh-man.com
robots.txt
Robots Exclusion Standard data for sh-man.com
Resource Scan
Scan Details
| Site Domain | sh-man.com |
| Base Domain | sh-man.com |
| Scan Status | Ok |
| Last Scan | 2025-11-21T09:53:49+00:00 |
| Next Scan | 2025-12-05T09:53:49+00:00 |
Last Scan
| Scanned | 2025-11-21T09:53:49+00:00 |
| URL | https://sh-man.com/robots.txt |
| Domain IPs | 104.21.39.167, 172.67.146.198, 2606:4700:3033::ac43:92c6, 2606:4700:3035::6815:27a7 |
| Response IP | 104.21.39.167 |
| Found | Yes |
| Hash | 845beaf837db77aa64334c4646a4ec03c065b4c2ff604c247bb1c5732b94a988 |
| SimHash | 63476f730ba7 |
Groups
*
No rules defined. All paths allowed.
Other Records
| Field | Value |
|---|---|
| crawl-delay | 10 |
*
| Rule | Path |
|---|---|
| Allow | / |
| Disallow | /*%26amp%3Blt%3Biframe |
| Disallow | /*?currency= |
| Disallow | /*/p*?page=* |
| Disallow | /*/page-*?page=* |
| Disallow | /cart |
| Disallow | */redirect |
Other Records
| Field | Value |
|---|---|
| sitemap | https://sh-man.com/sitemap.xml |
Warnings
- 10 invalid lines.