pathe.sn
robots.txt
Robots Exclusion Standard data for pathe.sn
Resource Scan
Scan Details
Site Domain | pathe.sn |
Base Domain | pathe.sn |
Scan Status | Ok |
Last Scan | 2024-09-27T07:04:29+00:00 |
Next Scan | 2024-10-11T07:04:29+00:00 |
Last Scan
Scanned | 2024-09-27T07:04:29+00:00 |
URL | https://pathe.sn/robots.txt |
Redirect | https://www.pathe.sn/robots.txt |
Redirect Domain | www.pathe.sn |
Redirect Base | pathe.sn |
Domain IPs | 217.70.184.55 |
Redirect IPs | 23.215.7.27, 23.215.7.29, 2600:1413:b000:1b::17d7:71b, 2600:1413:b000:1b::17d7:71d |
Response IP | 23.32.29.107 |
Found | Yes |
Hash | b58447a0602112aed98be1d2d38af79b2fc74cb88ce20c292e4d663394dbbaf0 |
SimHash | ac000b48adb6 |
Groups
*
Rule | Path |
---|---|
Disallow | *utm_ |
Disallow | *actId |
Disallow | *mc_cid |
Disallow | *err%3D |
Disallow | *language%3D |
Disallow | *idfilm%3D |
Disallow | */reserver/* |
Disallow | */film/* |
Disallow | /en/*/*/video/ |
Disallow | */redirect/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.pathe.fr/sitemaps/sitemap.xml |
Comments