noticiasdeosascoeregiao.com.br
robots.txt

Robots Exclusion Standard data for noticiasdeosascoeregiao.com.br

Resource Scan

Scan Details

Site Domain noticiasdeosascoeregiao.com.br
Base Domain noticiasdeosascoeregiao.com.br
Scan Status Ok
Last Scan2024-10-01T08:25:16+00:00
Next Scan 2024-10-08T08:25:16+00:00

Last Scan

Scanned2024-10-01T08:25:16+00:00
URL https://noticiasdeosascoeregiao.com.br/robots.txt
Domain IPs 187.1.136.124, 2804:10:8015::136:124
Response IP 187.1.136.124
Found Yes
Hash 057debf1602d201a92a62fc5c19e88add6f0e1dcbf0b7021c4488a8f1bb2a08e
SimHash 7151760547d5

Groups

googlebot

Rule Path
Allow *

feedfetcher-google

Rule Path
Allow *

bingbot

Rule Path
Allow *

slurp

Rule Path
Allow *

facebot

Rule Path
Allow *

facebookexternalhit

Rule Path
Allow *

ia_archiver

Rule Path
Allow *

twitterbot

Rule Path
Allow *

*

Rule Path
Disallow /cgi-bin/
Disallow /Painel/
Disallow /arquivos/
Disallow /erros/
Disallow /includes/
Disallow /uploads/
Disallow /instalar/
Disallow /_UpdateIM/
Disallow /wusage
Disallow /vendor/