anpec.org.br
robots.txt
Robots Exclusion Standard data for anpec.org.br
Resource Scan
Scan Details
| Site Domain | anpec.org.br |
| Base Domain | anpec.org.br |
| Scan Status | Ok |
| Last Scan | 2026-02-16T14:32:48+00:00 |
| Next Scan | 2026-03-18T14:32:48+00:00 |
Last Scan
| Scanned | 2026-02-16T14:32:48+00:00 |
| URL | https://anpec.org.br/robots.txt |
| Domain IPs | 104.21.1.174, 172.67.129.170, 2606:4700:3030::ac43:81aa, 2606:4700:3035::6815:1ae |
| Response IP | 172.67.129.170 |
| Found | Yes |
| Hash | 6f81a28c1e24529d07f85800be1795bc9dbf27e96434fa20461c44c08e496161 |
| SimHash | 7df46850e312 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /cgi-bin/ |
| Disallow | /exame2007/ |
| Disallow | /exame2008/ |
| Disallow | /exame2009/ |
| Disallow | /exame2010/ |
| Disallow | /imagens/ |
| Disallow | /portal/ |
| Disallow | /revista2/ |
| Disallow | **/*/files_NI/ |
| Disallow | **/*/files_I/ |
| Disallow | **/*/arquivos_identificados/ |
| Disallow | **/*/arquivos_nao_identificados/ |
| Disallow | **/*/folha_de_resposta/ |
| Disallow | **/*/comissao/ |