old.cnpgc.embrapa.br
robots.txt
Robots Exclusion Standard data for old.cnpgc.embrapa.br
Resource Scan
Scan Details
Site Domain | old.cnpgc.embrapa.br |
Base Domain | embrapa.br |
Scan Status | Ok |
Last Scan | 2025-07-02T16:35:27+00:00 |
Next Scan | 2025-08-01T16:35:27+00:00 |
Last Scan
Scanned | 2025-07-02T16:35:27+00:00 |
URL | https://old.cnpgc.embrapa.br/robots.txt |
Domain IPs | 200.129.254.60 |
Response IP | 200.129.254.60 |
Found | Yes |
Hash | b10f69bffe13d3f220b8f416ebb2cbee0d7ead2ca5650ae71a07813417324142 |
SimHash | cc4268513742 |
Groups
googlebot
Rule | Path |
---|---|
Disallow | /*? |
Disallow | /cgi-bin/ |
Disallow | /_notes/ |
Disallow | /atendimento/webalizer/ |
Disallow | /atendimento/webalizer_rautu/ |
Disallow | /css/ |
Disallow | /digits/ |
Disallow | /denied/ |
Disallow | /icons/ |
Disallow | /img/ |
Disallow | /imagens/ |
Disallow | /image/ |
Disallow | /images/ |
Disallow | /locale/ |
Disallow | /logos/ |
Disallow | /private/ |
Disallow | /proxy/ |
Disallow | /temp/ |
Disallow | /templates/ |
Disallow | /Templates/ |
Disallow | /tmp/ |
Disallow | /teste/ |
*
Rule | Path |
---|---|
Disallow | / |