cmarix.com
robots.txt
Robots Exclusion Standard data for cmarix.com
Resource Scan
Scan Details
Site Domain | cmarix.com |
Base Domain | cmarix.com |
Scan Status | Ok |
Last Scan | 2024-10-17T19:49:03+00:00 |
Next Scan | 2024-11-16T19:49:03+00:00 |
Last Scan
Scanned | 2024-10-17T19:49:03+00:00 |
URL | https://cmarix.com/robots.txt |
Redirect | https://www.cmarix.com/robots.txt |
Redirect Domain | www.cmarix.com |
Redirect Base | cmarix.com |
Domain IPs | 50.18.112.111 |
Redirect IPs | 13.224.163.122, 13.224.163.20, 13.224.163.64, 13.224.163.9 |
Response IP | 13.227.254.111 |
Found | Yes |
Hash | f634f401dd5fe1701e26b9a13ba82fd592357ee35d7c7f14f26e4e8e328a7d44 |
SimHash | 800548840f1b |
Groups
*
Rule | Path |
---|---|
Allow | *.js |
Allow | *.css |
Disallow | /wp-admin/ |
Disallow | /thank-you.html |
Disallow | /*?s= |
Disallow | /404.html |
Disallow | /blog/tag/ |
Disallow | /blog/author/atman/* |
Disallow | /blog/page/* |
Disallow | /blog/author/jeegnasa-mudsa/* |
Disallow | /blog/author/sunny-patel/* |
Disallow | /blog/amp/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.cmarix.com/sitemap.xml |
sitemap | https://www.cmarix.com/blog/sitemap_index.xml |