noticias.sapo.pt
robots.txt

Robots Exclusion Standard data for noticias.sapo.pt

Resource Scan

Scan Details

Site Domain noticias.sapo.pt
Base Domain sapo.pt
Scan Status Ok
Last Scan2024-11-11T23:27:51+00:00
Next Scan 2024-11-18T23:27:51+00:00

Last Scan

Scanned2024-11-11T23:27:51+00:00
URL https://noticias.sapo.pt/robots.txt
Redirect https://www.sapo.pt/robots.txt
Redirect Domain www.sapo.pt
Redirect Base sapo.pt
Domain IPs 213.13.145.216
Redirect IPs 213.13.146.142
Response IP 213.13.146.142
Found Yes
Hash 9c72428ef7060864ab3290cd38a944e1ad332862b68fa3d47aa81150ff666126
SimHash 310c8ce26fb0

Groups

*

Rule Path
Disallow /dev
Disallow /services
Disallow /newsletter
Disallow /prime/config
Disallow /prime/comprados
Disallow /404
Disallow /500
Disallow /v1/app/
Disallow /assets/static/
Disallow /pesquisa
Allow *

Other Records

Field Value
sitemap https://www.sapo.pt/categories_sitemap.xml
sitemap https://www.sapo.pt/sitemap/sitemap_articles.xml