cwi.com.br
robots.txt

Robots Exclusion Standard data for cwi.com.br

Resource Scan

Scan Details

Site Domain cwi.com.br
Base Domain cwi.com.br
Scan Status Ok
Last Scan2025-09-17T13:18:26+00:00
Next Scan 2025-10-17T13:18:26+00:00

Last Scan

Scanned2025-09-17T13:18:26+00:00
URL https://cwi.com.br/robots.txt
Domain IPs 104.26.10.212, 104.26.11.212, 172.67.68.213, 2606:4700:20::681a:ad4, 2606:4700:20::681a:bd4, 2606:4700:20::ac43:44d5
Response IP 172.67.68.213
Found Yes
Hash ba20c4ec806cecccc4941a2b5c560c8d3d76e965159239dd738138f7612bf00a
SimHash 7d26d22202f6

Groups

*

Rule Path Comment
Disallow /login -
Disallow /esqueci-a-senha -
Disallow /cadastro -
Disallow /candidatura-confirmada -
Disallow /editar-perfil -
Disallow /newsletter -
Disallow /perfil -
Disallow /trocar-senha -
Disallow /wp-admin/ -
Disallow /technology/ -
Disallow /banner/ -
Disallow /home-banner/ -
Disallow /home_highlight/ -
Disallow /partner/ -
Disallow /client/ -
Disallow /qr-code -
Disallow /employee_review/ -
Disallow /agradecimento -
Allow / -
Allow /sobre -
Allow /talentos/formacao/ crescer
Allow /talentos/formacao/ lets-code
Allow /talentos/formacao/ reset
Allow /talentos/oportunidades -
Allow /blog -
Allow /cases -
Allow /verticais -
Allow /contato -
Allow /politica-de-privacidade -
Allow /wp-admin/admin-ajax.php -

Other Records

Field Value
sitemap https://cwi.com.br/sitemap_index.xml