crcnews.com.br
robots.txt

Robots Exclusion Standard data for crcnews.com.br

Resource Scan

Scan Details

Site Domain crcnews.com.br
Base Domain crcnews.com.br
Scan Status Ok
Last Scan2026-02-23T03:50:56+00:00
Next Scan 2026-03-02T03:50:56+00:00

Last Scan

Scanned2026-02-23T03:50:56+00:00
URL https://crcnews.com.br/robots.txt
Domain IPs 108.167.151.68
Response IP 108.167.151.68
Found Yes
Hash 074e7c940712fbe3c473223bf7e3bb5cf9b0b03425290ae6801ac8e0b1bcb465
SimHash 4226c9c6e891

Groups

*

Rule Path Comment
Disallow /wp-admin/ -
Allow /wp-admin/admin-ajax.php -
Disallow /wp-login.php -
Disallow /wp-includes/ -
Disallow /wp-content/plugins/ -
Disallow /*/*?s=* -
Disallow /search/ -
Disallow *?s=* -
Disallow /*?s= -
Disallow /?filter_tamanhos= -
Disallow /?filter_numeracao= -
Disallow /?filter_cor -
Disallow /?filtering= -
Disallow /?filtering=1&filter_product_brands= -
Disallow /author/ block access to author pages
Disallow /404-error/ block access to 404 page

Other Records

Field Value
sitemap https://crcnews.com.br/sitemap_index.xml

Comments

  • Sitemap