larepublica.co
robots.txt

Robots Exclusion Standard data for larepublica.co

Resource Scan

Scan Details

Site Domain larepublica.co
Base Domain larepublica.co
Scan Status Ok
Last Scan2024-11-02T18:47:29+00:00
Next Scan 2024-11-09T18:47:29+00:00

Last Scan

Scanned2024-11-02T18:47:29+00:00
URL https://larepublica.co/robots.txt
Redirect https://www.larepublica.co/robots.txt
Redirect Domain www.larepublica.co
Redirect Base larepublica.co
Domain IPs 13.35.238.104, 13.35.238.107, 13.35.238.37, 13.35.238.73
Redirect IPs 151.101.1.91, 151.101.129.91, 151.101.193.91, 151.101.65.91, 2a04:4e42:200::347, 2a04:4e42:400::347, 2a04:4e42:600::347, 2a04:4e42::347
Response IP 199.232.45.91
Found Yes
Hash 4aa34855b6b4cba1327ca829a8de277b35dd5d804afa902d2be065001c77846c
SimHash 4945df506503

Groups

googlebot

Rule Path
Allow /

googlebot

Rule Path
Disallow /buscar
Disallow /vista-previa/

googlebot-news

Rule Path
Allow /

googlebot-news

Rule Path
Disallow /vista-previa/

*

Rule Path
Allow /

*

Rule Path
Disallow /vista-previa/
Disallow /pagos/
Disallow /buscar

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.larepublica.co/sitemapindex
sitemap https://www.larepublica.co/sitemapnews