incor.usp.br
robots.txt

Robots Exclusion Standard data for incor.usp.br

Resource Scan

Scan Details

Site Domain incor.usp.br
Base Domain usp.br
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2025-03-20T07:15:47+00:00
Next Scan 2025-06-18T07:15:47+00:00

Last Successful Scan

Scanned2024-01-31T19:42:31+00:00
URL https://incor.usp.br/robots.txt
Domain IPs 200.9.95.2
Response IP 200.9.95.2
Found Yes
Hash 4d9b0ab583fc7f20e683d352ebb9abe0a985aa41ea1edfad606a8ac9be259516
SimHash 7544c0748f62

Groups

*

Rule Path
Disallow /manual/
Disallow /doc/
Disallow /gif/
Disallow /intranet/
Disallow /intranetfz/
Disallow /intranet.joomla/
Disallow /phpMyAdmin/
Disallow /downloads/
Disallow /desenv/
Disallow /fundacao_ftp/

susedig

Rule Path
Disallow

stress-agent

Rule Path
Disallow /

Comments

  • exclude help system from robots
  • but allow htdig to index our doc-tree
  • disallow stress test