novaroma.edu.br
robots.txt

Robots Exclusion Standard data for novaroma.edu.br

Resource Scan

Scan Details

Site Domain novaroma.edu.br
Base Domain novaroma.edu.br
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-09-17T09:26:35+00:00
Next Scan 2024-10-01T09:26:35+00:00

Last Successful Scan

Scanned2024-09-02T03:34:31+00:00
URL http://novaroma.edu.br/robots.txt
Domain IPs 142.44.199.112
Response IP 142.44.199.112
Found Yes
Hash 52c4f07eb60d36e63364a97b4fb972b5def6fe077d88fabd00884c41b977e993
SimHash 850f0c108e56

Groups

*

Rule Path
Disallow

*

Rule Path
Disallow /admin/

*

Rule Path
Disallow user/login/

Other Records

Field Value
sitemap https://curtlink.com/sitemap.xml

Comments

  • Este é um arquivo robots.txt do site CurtLink
  • Permitir que todos os bots acessem todo o site
  • Bloquear todos os bots de acessar a página de administração
  • Bloquear todos os bots de acessar a página de login
  • Sitemap do site