catherineweitzman.com
robots.txt
Robots Exclusion Standard data for catherineweitzman.com
Resource Scan
Scan Details
| Site Domain | catherineweitzman.com |
| Base Domain | catherineweitzman.com |
| Scan Status | Ok |
| Last Scan | 2025-10-08T14:41:32+00:00 |
| Next Scan | 2025-10-22T14:41:32+00:00 |
Last Scan
| Scanned | 2025-10-08T14:41:32+00:00 |
| URL | https://catherineweitzman.com/robots.txt |
| Domain IPs | 67.20.113.46 |
| Response IP | 67.20.113.46 |
| Found | Yes |
| Hash | 7433e724350c0862ca3acbdda9aeee810bae653d8889fcd0ee5730081f1aebe3 |
| SimHash | 210ef8622691 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /catalog/ |
| Disallow | /backups/ |
| Disallow | /email_templates/ |
| Disallow | /Flash/ |
| Disallow | /footer-test/ |
| Disallow | /images/ |
| Disallow | /includes/ |
| Disallow | /newsletter/ |
| Disallow | /proofs/ |
| Disallow | /Templates/ |
| Disallow | /test_zenstore1/ |
| Disallow | /test_zenstore3/ |
| Disallow | /zen-stage/ |
| Allow | / |
Other Records
| Field | Value |
|---|---|
| sitemap | http://cdn.attracta.com/sitemap/1832981.xml.gz |
Warnings
- 1 invalid line.