pathe.tn
robots.txt
Robots Exclusion Standard data for pathe.tn
Resource Scan
Scan Details
Site Domain | pathe.tn |
Base Domain | pathe.tn |
Scan Status | Ok |
Last Scan | 2024-11-07T03:20:29+00:00 |
Next Scan | 2024-11-21T03:20:29+00:00 |
Last Scan
Scanned | 2024-11-07T03:20:29+00:00 |
URL | https://pathe.tn/robots.txt |
Redirect | https://www.pathe.tn/robots.txt |
Redirect Domain | www.pathe.tn |
Redirect Base | pathe.tn |
Domain IPs | 217.70.184.55 |
Redirect IPs | 23.215.7.17, 23.215.7.5, 2600:1413:b000:1b::17d7:705, 2600:1413:b000:1b::17d7:711 |
Response IP | 23.32.29.17 |
Found | Yes |
Hash | 7e6e3f59221e5cf7816552fa1eb5c690d8e42cba97130e35ae83f23b9e3d9a76 |
SimHash | bc685f48a5b2 |
Groups
*
Rule | Path |
---|---|
Disallow | *utm_ |
Disallow | *actId |
Disallow | *mc_cid |
Disallow | *err%3D |
Disallow | *language%3D |
Disallow | *idfilm%3D |
Disallow | */reserver/* |
Disallow | */film/* |
Disallow | /en/*/*/video/ |
Disallow | */redirect/* |
Disallow | */l/* |
Disallow | */filters/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.pathe.fr/sitemaps/sitemap.xml |
Comments