liberation.com
robots.txt
Robots Exclusion Standard data for liberation.com
Resource Scan
Scan Details
Site Domain | liberation.com |
Base Domain | liberation.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-05-10T15:14:54+00:00 |
Next Scan | 2024-07-09T15:14:54+00:00 |
Last Successful Scan
Scanned | 2024-04-10T14:56:47+00:00 |
URL | https://liberation.com/robots.txt |
Redirect | https://www.liberation.fr/robots.txt |
Redirect Domain | www.liberation.fr |
Redirect Base | liberation.fr |
Domain IPs | 143.204.231.124, 143.204.231.59, 143.204.231.73, 143.204.231.84 |
Redirect IPs | 2600:1413:b000:14::b857:c149, 2600:1413:b000:14::b857:c154, 72.247.127.210, 72.247.127.234 |
Response IP | 42.99.140.201 |
Found | Yes |
Hash | 730f7052572e61e7f4b54576d25181327a26d622a6413fb509f7457dc29e172e |
SimHash | e15c36252783 |
Groups
*
Rule | Path |
---|---|
Disallow | /recherche |
Disallow | /search |
Disallow | */link- |
Disallow | /evenements/ |
Disallow | /direct/ |
Disallow | /elections/ |
Disallow | */undefined?d= |
Disallow | */undefined/?d= |
Disallow | */pf/undefined?d= |
Disallow | /syndication/google/currents/diaporama/ |
Disallow | /apps/*.csv |
Disallow | /apps/*.tsv |
Disallow | /apps/*.mst |
Other Records
Field | Value |
---|---|
sitemap | https://www.liberation.fr/arc/outboundfeeds/sitemap_news.xml?outputType=xml |
sitemap | https://www.liberation.fr/arc/outboundfeeds/sitemap.xml?outputType=xml |
sitemap | https://www.liberation.fr/arc/outboundfeeds/sections-sitemap.xml?outputType=xml |
Comments