up.edu.br
robots.txt
Robots Exclusion Standard data for up.edu.br
Resource Scan
Scan Details
Site Domain | up.edu.br |
Base Domain | up.edu.br |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-10-03T18:19:43+00:00 |
Next Scan | 2025-01-01T18:19:43+00:00 |
Last Successful Scan
Scanned | 2024-05-14T18:12:19+00:00 |
URL | https://up.edu.br/robots.txt |
Redirect | https://www.up.edu.br//robots.txt |
Redirect Domain | www.up.edu.br |
Redirect Base | up.edu.br |
Domain IPs | 2600:1413:b000:1e::17d1:2e5c, 2600:1413:b000:1e::17d1:2e61, 96.17.72.65 |
Redirect IPs | 23.209.46.71, 23.209.46.92, 2600:1413:b000:1e::17d1:2e47, 2600:1413:b000:1e::17d1:2e5c |
Response IP | 23.202.33.161 |
Found | Yes |
Hash | 0fd42e81835d33a3e635ff2bd81678785ef4aaabea060b8d1cc8a14e8b70c296 |
SimHash | 0954d1355313 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /search/* |
Disallow | /generica |
Disallow | /pagina-nao-encontrada |
Disallow | /teste-cpa |
Disallow | /pdp-extensao |
Disallow | /pdp-pos-graduacao |
Disallow | /sala-de-imprensa/detalhe-release |
Disallow | /pdp |
Disallow | /dados-gerais |
Disallow | /modulos |
Disallow | /robo |
Other Records
Field | Value |
---|---|
sitemap | https://www.up.edu.br/sitemap_index.xml |