www.infoteca.cnptia.embrapa.br
robots.txt
Robots Exclusion Standard data for www.infoteca.cnptia.embrapa.br
Resource Scan
Scan Details
Site Domain | www.infoteca.cnptia.embrapa.br |
Base Domain | embrapa.br |
Scan Status | Ok |
Last Scan | 2024-11-03T12:37:22+00:00 |
Next Scan | 2024-12-03T12:37:22+00:00 |
Last Scan
Scanned | 2024-11-03T12:37:22+00:00 |
URL | https://www.infoteca.cnptia.embrapa.br/robots.txt |
Domain IPs | 200.0.70.2, 2801:80:1400:128:8::4 |
Response IP | 200.0.70.2 |
Found | Yes |
Hash | b0ccd9448f980ea66924c48d9f4fbd4ef014f882c07e611b0b78eeeb4d9fd533 |
SimHash | a4944d15c5b5 |
Groups
*
Rule | Path |
---|---|
Disallow | /discover |
Disallow | /simple-search |
Disallow | /retrieve |
Disallow | /browse |
Disallow | /statistics |
Disallow | /contact |
Disallow | /feedback |
Disallow | /feed |
Disallow | /forgot |
Disallow | /login |
Disallow | /register |
Other Records
Field | Value |
---|---|
sitemap | http://www.infoteca.cnptia.embrapa.br/sitemap |
sitemap | http://www.infoteca.cnptia.embrapa.br/htmlmap |
Comments