pathe.be
robots.txt
Robots Exclusion Standard data for pathe.be
Resource Scan
Scan Details
Site Domain | pathe.be |
Base Domain | pathe.be |
Scan Status | Ok |
Last Scan | 2024-11-04T01:11:05+00:00 |
Next Scan | 2024-11-18T01:11:05+00:00 |
Last Scan
Scanned | 2024-11-04T01:11:05+00:00 |
URL | https://pathe.be/robots.txt |
Redirect | https://www.pathe.be/robots.txt |
Redirect Domain | www.pathe.be |
Redirect Base | pathe.be |
Domain IPs | 217.70.184.55 |
Redirect IPs | 96.17.180.43, 96.17.180.46 |
Response IP | 96.17.180.46 |
Found | Yes |
Hash | b58447a0602112aed98be1d2d38af79b2fc74cb88ce20c292e4d663394dbbaf0 |
SimHash | ac000b48adb6 |
Groups
*
Rule | Path |
---|---|
Disallow | *utm_ |
Disallow | *actId |
Disallow | *mc_cid |
Disallow | *err%3D |
Disallow | *language%3D |
Disallow | *idfilm%3D |
Disallow | */reserver/* |
Disallow | */film/* |
Disallow | /en/*/*/video/ |
Disallow | */redirect/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.pathe.fr/sitemaps/sitemap.xml |
Comments