liberation.fr
robots.txt
Robots Exclusion Standard data for liberation.fr
Resource Scan
Scan Details
Site Domain | liberation.fr |
Base Domain | liberation.fr |
Scan Status | Ok |
Last Scan | 2024-11-14T19:05:00+00:00 |
Next Scan | 2024-11-21T19:05:00+00:00 |
Last Scan
Scanned | 2024-11-14T19:05:00+00:00 |
URL | https://liberation.fr/robots.txt |
Redirect | https://www.liberation.fr/robots.txt |
Redirect Domain | www.liberation.fr |
Redirect Base | liberation.fr |
Domain IPs | 99.86.91.29, 99.86.91.5, 99.86.91.7, 99.86.91.96 |
Redirect IPs | 23.46.230.130, 23.46.230.132, 2600:1413:b000:13::b857:c188, 2600:1413:b000:13::b857:c1a1 |
Response IP | 23.45.207.177 |
Found | Yes |
Hash | bc7cf9efd84c26bbbc454269605bd475f2e52faac9b37a3e026e215e768fc315 |
SimHash | e15c36056781 |
Groups
*
Rule | Path |
---|---|
Disallow | /recherche |
Disallow | /search |
Disallow | */link- |
Disallow | /evenements/ |
Disallow | /direct/ |
Disallow | /elections/ |
Disallow | */undefined?d= |
Disallow | */undefined/?d= |
Disallow | */pf/undefined?d= |
Disallow | /syndication/google/currents/diaporama/ |
Disallow | /apps/*.csv |
Disallow | /apps/*.tsv |
Disallow | /apps/*.mst |
Other Records
Field | Value |
---|---|
sitemap | https://statics.liberation.fr/datasource/elections/elections-results-sitemap.xml |
sitemap | https://www.liberation.fr/arc/outboundfeeds/sitemap_news.xml?outputType=xml |
sitemap | https://www.liberation.fr/arc/outboundfeeds/sitemap.xml?outputType=xml |
sitemap | https://www.liberation.fr/arc/outboundfeeds/sections-sitemap.xml?outputType=xml |
Comments