cn.org.br
robots.txt

Robots Exclusion Standard data for cn.org.br

Resource Scan

Scan Details

Site Domain cn.org.br
Base Domain cn.org.br
Scan Status Ok
Last Scan2025-12-25T17:55:32+00:00
Next Scan 2026-01-24T17:55:32+00:00

Last Scan

Scanned2025-12-25T17:55:32+00:00
URL https://cn.org.br/robots.txt
Domain IPs 104.21.67.26, 172.67.211.147, 2606:4700:3030::6815:431a, 2606:4700:3031::ac43:d393
Response IP 104.21.67.26
Found Yes
Hash a4d0fe3a76a2e5a3598996dd2b0881378a887a8cddbaad7d51e1a5dc3d081f34
SimHash 4d39dd480591

Groups

*

Rule Path
Allow /portal/

*

Rule Path
Disallow /40anosbrasilia/
Disallow /50anos/
Disallow /app_ressuscitou/
Disallow /assets/
Disallow /cantos/
Disallow /cgi-local/
Disallow /cssHotsiteCarmen/
Disallow /dist/
Disallow /downloads/
Disallow /encontrofortaleza/
Disallow /headerSite/
Disallow /pedidos/
Disallow /prototipos/
Disallow /Ressuscitou_MP3/
Disallow /videos/
Disallow /informativo/
Disallow /wordpress-5.9-pt_BR/
Disallow /cn.org.br/portal/videoconvivencia2023/
Disallow /cn.org.br/portal/icone-kiko-2022-sao-pelagio/
Disallow /cn.org.br/portal/canto-anuncio-quaresma-22/
Disallow /cn.org.br/portal/videoconvivencia2022/
Disallow /cn.org.br/portal/video-anuncio-advento-21-22
Disallow /cn.org.br/portal/marca-pagina-abertura-causa-carmen-4-11/

googlebot

Rule Path
Disallow /*.pdf$

Other Records

Field Value
sitemap https://cn.org.br/portal/sitemap.xml
sitemap https://cn.org.br/sitemap.xml

Comments

  • robots.txt generated by cn.org.br