pathe.ma
robots.txt
Robots Exclusion Standard data for pathe.ma
Resource Scan
Scan Details
Site Domain | pathe.ma |
Base Domain | pathe.ma |
Scan Status | Ok |
Last Scan | 2024-11-08T16:51:41+00:00 |
Next Scan | 2024-11-22T16:51:41+00:00 |
Last Scan
Scanned | 2024-11-08T16:51:41+00:00 |
URL | https://pathe.ma/robots.txt |
Redirect | https://www.pathe.ma/robots.txt |
Redirect Domain | www.pathe.ma |
Redirect Base | pathe.ma |
Domain IPs | 217.70.184.55 |
Redirect IPs | 23.215.7.12, 23.215.7.30, 2600:1413:b000:1b::17d7:70c, 2600:1413:b000:1b::17d7:71e |
Response IP | 23.215.7.26 |
Found | Yes |
Hash | 7e6e3f59221e5cf7816552fa1eb5c690d8e42cba97130e35ae83f23b9e3d9a76 |
SimHash | bc685f48a5b2 |
Groups
*
Rule | Path |
---|---|
Disallow | *utm_ |
Disallow | *actId |
Disallow | *mc_cid |
Disallow | *err%3D |
Disallow | *language%3D |
Disallow | *idfilm%3D |
Disallow | */reserver/* |
Disallow | */film/* |
Disallow | /en/*/*/video/ |
Disallow | */redirect/* |
Disallow | */l/* |
Disallow | */filters/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.pathe.fr/sitemaps/sitemap.xml |
Comments