negocios.pt
robots.txt

Robots Exclusion Standard data for negocios.pt

Resource Scan

Scan Details

Site Domain negocios.pt
Base Domain negocios.pt
Scan Status Ok
Last Scan2024-11-15T09:31:34+00:00
Next Scan 2024-11-22T09:31:34+00:00

Last Scan

Scanned2024-11-15T09:31:34+00:00
URL https://negocios.pt/robots.txt
Redirect https://www.jornaldenegocios.pt/robots.txt
Redirect Domain www.jornaldenegocios.pt
Redirect Base jornaldenegocios.pt
Domain IPs 195.23.36.47
Redirect IPs 88.157.217.147
Response IP 88.157.217.147
Found Yes
Hash 64e46a32325f274ce232ecb03d70eab2c7bfbc2024b7228df1d920e7ad4aeb79
SimHash 5910194dcdd3

Groups

openai-crawler

Rule Path
Disallow /

googlebot-bard

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

*

Rule Path
Disallow /Error/
Disallow /Views/
Disallow /Content
Disallow /embed
Disallow /fonts
Disallow /scripts
Disallow /comentarios
Disallow /ssosubscriptions/
Disallow /AjaxCalls/
Disallow /informacaofinanceira/
Disallow /Bundles/
Disallow /pesquisa

Other Records

Field Value
sitemap https://www.jornaldenegocios.pt/sitemap