portaldaindustria.com.br
robots.txt
Robots Exclusion Standard data for portaldaindustria.com.br
Resource Scan
Scan Details
Site Domain | portaldaindustria.com.br |
Base Domain | portaldaindustria.com.br |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-28T10:43:33+00:00 |
Next Scan | 2024-10-12T10:43:33+00:00 |
Last Successful Scan
Scanned | 2024-08-21T09:15:57+00:00 |
URL | https://portaldaindustria.com.br/robots.txt |
Redirect | https://www.portaldaindustria.com.br/robots.txt |
Redirect Domain | www.portaldaindustria.com.br |
Redirect Base | portaldaindustria.com.br |
Domain IPs | 20.62.210.67 |
Redirect IPs | 20.62.210.67 |
Response IP | 20.62.210.67 |
Found | Yes |
Hash | 40c41249b35ad7593cd3966298f592c1edf63d5f7e0cd9a1c203413b9e161d31 |
SimHash | 215564ac59b3 |
Groups
*
Rule | Path |
---|---|
Disallow | /busca/* |
Disallow | /agenciacni/busca/* |
Disallow | /iel/busca/* |
Disallow | /portal/busca/* |
Disallow | */?edit%2F* |
Disallow | */admin/* |
Disallow | */login/* |
Disallow | */rss/* |
Disallow | /resultado_busca/* |
Disallow | */lista-temas/* |
Disallow | /inovatalentos/acessar/* |
Disallow | /inoavatalentos/submissao/* |
Disallow | */gerar-pdf/* |
Other Records
Field | Value |
---|---|
sitemap | http://www.portaldaindustria.com.br/sitemap.xml |