cristau-de-hauguerne.net
robots.txt
Robots Exclusion Standard data for cristau-de-hauguerne.net
Resource Scan
Scan Details
Site Domain | cristau-de-hauguerne.net |
Base Domain | cristau-de-hauguerne.net |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-09-28T19:27:14+00:00 |
Next Scan | 2024-10-05T19:27:14+00:00 |
Last Successful Scan
Scanned | 2024-09-20T19:26:50+00:00 |
URL | https://cristau-de-hauguerne.net/robots.txt |
Domain IPs | 185.128.239.112 |
Response IP | 185.128.239.112 |
Found | Yes |
Hash | d4fd589640c1c3d27735d65c8c42ade1460d6d301f22dbf72ea9e2da2d2a8adc |
SimHash | 6208c057c731 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /contact |
Disallow | /mail/subscribe |
Disallow | /mail/valid-* |
Disallow | /api/* |
Other Records
Field | Value |
---|---|
sitemap | https://cristau-de-hauguerne.net/sitemap-news.xml |
sitemap | https://cristau-de-hauguerne.net/sitemap.xml |