cheriesblog.canalblog.com
robots.txt
Robots Exclusion Standard data for cheriesblog.canalblog.com
Resource Scan
Scan Details
Site Domain | cheriesblog.canalblog.com |
Base Domain | canalblog.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-18T17:22:03+00:00 |
Next Scan | 2024-12-17T17:22:03+00:00 |
Last Successful Scan
Scanned | 2024-01-30T17:06:01+00:00 |
URL | https://cheriesblog.canalblog.com/robots.txt |
Redirect | http://cheriesblog.canalblog.com/robots.txt |
Domain IPs | 185.128.239.110, 185.128.239.111 |
Response IP | 185.128.239.111 |
Found | Yes |
Hash | 395ceb3c750c8e2c36f1bc4cbb94884e5c8d0fea4b21edae0f5c169e0f52480a |
SimHash | 630d5471c215 |
Groups
*
Rule | Path |
---|---|
Disallow | /cf/fe/remote/ffads.cfm |
*
Rule | Path |
---|---|
Disallow | /cf/fe/remote/ffads.cfm |
Other Records
Field | Value |
---|---|
sitemap | http://cheriesblog.canalblog.com/rss.xml |