old.cnpgc.embrapa.br
robots.txt

Robots Exclusion Standard data for old.cnpgc.embrapa.br

Resource Scan

Scan Details

Site Domain old.cnpgc.embrapa.br
Base Domain embrapa.br
Scan Status Ok
Last Scan2025-07-02T16:35:27+00:00
Next Scan 2025-08-01T16:35:27+00:00

Last Scan

Scanned2025-07-02T16:35:27+00:00
URL https://old.cnpgc.embrapa.br/robots.txt
Domain IPs 200.129.254.60
Response IP 200.129.254.60
Found Yes
Hash b10f69bffe13d3f220b8f416ebb2cbee0d7ead2ca5650ae71a07813417324142
SimHash cc4268513742

Groups

googlebot

Rule Path
Disallow /*?
Disallow /cgi-bin/
Disallow /_notes/
Disallow /atendimento/webalizer/
Disallow /atendimento/webalizer_rautu/
Disallow /css/
Disallow /digits/
Disallow /denied/
Disallow /icons/
Disallow /img/
Disallow /imagens/
Disallow /image/
Disallow /images/
Disallow /locale/
Disallow /logos/
Disallow /private/
Disallow /proxy/
Disallow /temp/
Disallow /templates/
Disallow /Templates/
Disallow /tmp/
Disallow /teste/

*

Rule Path
Disallow /