cinema.sapo.pt
robots.txt

Robots Exclusion Standard data for cinema.sapo.pt

Resource Scan

Scan Details

Site Domain cinema.sapo.pt
Base Domain sapo.pt
Scan Status Ok
Last Scan2024-11-16T14:32:10+00:00
Next Scan 2024-11-23T14:32:10+00:00

Last Scan

Scanned2024-11-16T14:32:10+00:00
URL https://cinema.sapo.pt/robots.txt
Domain IPs 213.13.145.216
Response IP 213.13.145.216
Found Yes
Hash 83ef47f0252a2ee1d570c1991eea27d4a1066baf86a221544b60db675d438a2e
SimHash 3151c8746fb0

Groups

*

Rule Path
Disallow /404
Disallow /500
Disallow /v1/app/
Disallow /pesquisar?q=*
Disallow /assets/static/
Allow *

Other Records

Field Value
sitemap https://mag.sapo.pt/sitemap.xml
sitemap https://mag.sapo.pt/trailersitemap.xml
sitemap https://mag.sapo.pt/sitemap-index.xml.gz