cmriindia.org
robots.txt

Robots Exclusion Standard data for cmriindia.org

Resource Scan

Scan Details

Site Domain cmriindia.org
Base Domain cmriindia.org
Scan Status Ok
Last Scan2024-10-08T06:20:40+00:00
Next Scan 2024-10-15T06:20:40+00:00

Last Scan

Scanned2024-10-08T06:20:40+00:00
URL https://cmriindia.org/robots.txt
Redirect https://www.cmriindia.org/robots.txt
Redirect Domain www.cmriindia.org
Redirect Base cmriindia.org
Domain IPs 104.21.8.230, 172.67.188.206, 2606:4700:3032::ac43:bcce, 2606:4700:3035::6815:8e6
Redirect IPs 104.21.8.230, 172.67.188.206, 2606:4700:3032::ac43:bcce, 2606:4700:3035::6815:8e6
Response IP 104.21.8.230
Found Yes
Hash 022b5e4f3dcbfc6fa926d8343aaa2c32c1cf6d5c444df220684339c8abc1ad0a
SimHash 79ec5c4c0ab3

Groups

*

Rule Path
Disallow /wp-login.php
Disallow /wp-register.php
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-content/cache/
Disallow /wp-content/backups/
Disallow */trackback/
Disallow /xmlrpc.php
Disallow /readme.html
Disallow /search/
Disallow *?replytocom
Disallow */comment-page-
Disallow /tag/

Other Records

Field Value
sitemap https://www.cmriindia.org/sitemap_index.xml