toucheatoutblog.canalblog.com
robots.txt
            Robots Exclusion Standard data for toucheatoutblog.canalblog.com
Resource Scan
Scan Details
| Site Domain | toucheatoutblog.canalblog.com | 
| Base Domain | canalblog.com | 
| Scan Status | Ok | 
| Last Scan | 2025-10-01T13:10:40+00:00 | 
| Next Scan | 2025-10-31T13:10:40+00:00 | 
Last Scan
| Scanned | 2025-10-01T13:10:40+00:00 | 
| URL | https://toucheatoutblog.canalblog.com/robots.txt | 
| Domain IPs | 185.128.239.110, 185.128.239.111 | 
| Response IP | 185.128.239.111 | 
| Found | Yes | 
| Hash | 2a23329c907c2f4a1329381afcca1a0f2e15bd564d00c7a2201c9053c72e9ed3 | 
| SimHash | 6b14d0554773 | 
Groups
*
          | Rule | Path | 
|---|---|
| Allow | / | 
| Disallow | /contact | 
| Disallow | /mail/subscribe | 
| Disallow | /mail/valid-* | 
| Disallow | /api/* | 
| Disallow | /search | 
| Disallow | /search/* | 
Other Records
| Field | Value | 
|---|---|
| sitemap | https://toucheatoutblog.canalblog.com/sitemap-news.xml | 
| sitemap | https://toucheatoutblog.canalblog.com/sitemap.xml |