pathe.fr
robots.txt
Robots Exclusion Standard data for pathe.fr
Resource Scan
Scan Details
Site Domain | pathe.fr |
Base Domain | pathe.fr |
Scan Status | Ok |
Last Scan | 2024-11-08T21:52:51+00:00 |
Next Scan | 2024-11-22T21:52:51+00:00 |
Last Scan
Scanned | 2024-11-08T21:52:51+00:00 |
URL | https://pathe.fr/robots.txt |
Redirect | https://www.pathe.fr/robots.txt |
Redirect Domain | www.pathe.fr |
Redirect Base | pathe.fr |
Domain IPs | 217.70.184.55 |
Redirect IPs | 184.50.85.162, 184.50.85.164, 2600:1413:b000:1b::17d7:708, 2600:1413:b000:1b::17d7:71a |
Response IP | 184.50.85.164 |
Found | Yes |
Hash | 7e6e3f59221e5cf7816552fa1eb5c690d8e42cba97130e35ae83f23b9e3d9a76 |
SimHash | bc685f48a5b2 |
Groups
*
Rule | Path |
---|---|
Disallow | *utm_ |
Disallow | *actId |
Disallow | *mc_cid |
Disallow | *err%3D |
Disallow | *language%3D |
Disallow | *idfilm%3D |
Disallow | */reserver/* |
Disallow | */film/* |
Disallow | /en/*/*/video/ |
Disallow | */redirect/* |
Disallow | */l/* |
Disallow | */filters/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.pathe.fr/sitemaps/sitemap.xml |
Comments