guiasaude.org
robots.txt

Robots Exclusion Standard data for guiasaude.org

Resource Scan

Scan Details

Site Domain guiasaude.org
Base Domain guiasaude.org
Scan Status Ok
Last Scan2025-04-13T15:26:52+00:00
Next Scan 2025-04-20T15:26:52+00:00

Last Scan

Scanned2025-04-13T15:26:52+00:00
URL https://guiasaude.org/robots.txt
Domain IPs 104.21.83.223, 172.67.182.97, 2606:4700:3033::ac43:b661, 2606:4700:3035::6815:53df
Response IP 104.21.83.223
Found Yes
Hash 799eb94b4ecd46cc2b61e4fdf9b0a883b5306515a4cf3dbdea5ade149597567b
SimHash ecf54d5d5df5

Groups

googlebot

Rule Path
Disallow /*.js$
Disallow /*.inc$
Disallow /*.css$
Disallow /*.wmv$
Disallow /*.cgi$

mediapartners-google*

Rule Path
Disallow
Allow /*

googlebot-image

Rule Path
Allow /wp-content/uploads/

Other Records

Field Value
sitemap http://www.guiasaude.org/sitemap.xml
sitemap http://www.guiasaude.org/sitemap.xml.gz

Comments

  • remova os diretorios
  • remover scrips css e afins
  • permitir o adsense em qualquer url
  • Sitemap

Warnings

  • 8 invalid lines.