publico.pt
robots.txt

Robots Exclusion Standard data for publico.pt

Resource Scan

Scan Details

Site Domain publico.pt
Base Domain publico.pt
Scan Status Ok
Last Scan2024-05-17T14:11:11+00:00
Next Scan 2024-05-24T14:11:11+00:00

Last Scan

Scanned2024-05-17T14:11:11+00:00
URL https://publico.pt/robots.txt
Redirect https://www.publico.pt/robots.txt
Redirect Domain www.publico.pt
Redirect Base publico.pt
Domain IPs 13.225.4.28, 13.225.4.5, 13.225.4.60, 13.225.4.8
Redirect IPs 13.225.4.28, 13.225.4.5, 13.225.4.60, 13.225.4.8
Response IP 13.225.4.5
Found Yes
Hash 0717d1991e1282541bd14f3e7a009629b6708f3bd8af6a9655b3bc020ec0c80c
SimHash c9049060c733

Groups

googlebot-bard

Rule Path
Disallow /

openai-crawler

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

*

Rule Path
Allow *

Other Records

Field Value
sitemap https://www.publico.pt/sitemaps/news.xml
sitemap https://www.publico.pt/sitemaps/sitemapindex.xml