capterra.com
robots.txt
Robots Exclusion Standard data for capterra.com
Resource Scan
Scan Details
Site Domain | capterra.com |
Base Domain | capterra.com |
Scan Status | Ok |
Last Scan | 2024-06-02T10:01:39+00:00 |
Next Scan | 2024-06-09T10:01:39+00:00 |
Last Scan
Scanned | 2024-06-02T10:01:39+00:00 |
URL | https://capterra.com/robots.txt |
Redirect | https://www.capterra.com/robots.txt |
Redirect Domain | www.capterra.com |
Redirect Base | capterra.com |
Domain IPs | 34.206.19.51, 44.213.52.154 |
Redirect IPs | 104.18.16.169, 104.18.17.169 |
Response IP | 104.18.16.169 |
Found | Yes |
Hash | 521a315209e491861510673e33c947c806993de3012018b18a84e9f64249f059 |
SimHash | 66da39fbce42 |
Groups
*
Rule | Path |
---|---|
Disallow | /*?* |
Disallow | /compare/*/* |
Disallow | /compare/*/*-vs-*-vs-* |
Allow | /compare/*/*-vs-* |
Allow | /*-software/*?page=* |
Disallow | /external_click |
Disallow | /external_slp_click |
Disallow | /external_click_sa |
Disallow | /external_click_ga |
Disallow | /sem-combo |
Disallow | /sem/ |
Disallow | /sem-compare/ |
Disallow | /search |
Disallow | *?preview |
Disallow | *?exp= |
Disallow | *?variant= |
Disallow | *sort_options%3D |
Disallow | /resources/preview/* |
Disallow | /resources/_next/*.json |
Disallow | /resources/_next/*.js |
Disallow | /*-software/*?*account_campaign_id=* |
Disallow | /auth/login |
Disallow | */rest/ |
Disallow | */fit-finder/ |
Disallow | */glossaryletter/ |
Disallow | */p/*/reviews/*/ |
Disallow | /workspace/ |
Disallow | /sem-services/ |
Disallow | /sem-compare-services/ |
Disallow | /sem-ppl/ |
Disallow | /ai-assistant/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.capterra.com/sitemap.xml |
Comments