capterra.in
robots.txt
Robots Exclusion Standard data for capterra.in
Resource Scan
Scan Details
Site Domain | capterra.in |
Base Domain | capterra.in |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-08-07T04:01:20+00:00 |
Next Scan | 2025-11-05T04:01:20+00:00 |
Last Successful Scan
Scanned | 2023-12-24T00:12:15+00:00 |
URL | https://capterra.in/robots.txt |
Redirect | https://www.capterra.in/robots.txt |
Redirect Domain | www.capterra.in |
Redirect Base | capterra.in |
Domain IPs | 172.66.40.43, 172.66.43.213, 2606:4700:3108::ac42:282b, 2606:4700:3108::ac42:2bd5 |
Redirect IPs | 172.66.40.43, 172.66.43.213, 2606:4700:3108::ac42:282b, 2606:4700:3108::ac42:2bd5 |
Response IP | 172.66.43.213 |
Found | Yes |
Hash | cb283f670e1c1cda7a30f3f09251d12f39e2ff8f99552bf975b265dfbe38480e |
SimHash | a2b591a3ee61 |
Groups
*
Rule | Path |
---|---|
Allow | /*?vsn=d$ |
Allow | /sitemap/*?page= |
Allow | /directory/*?page= |
Allow | /blog?page= |
Disallow | /*?* |
Disallow | /cdn-cgi/ |
Comments