capterra.com
robots.txt

Robots Exclusion Standard data for capterra.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	capterra.com
Base Domain	capterra.com
Scan Status	Ok
Last Scan	2024-11-04T16:18:12+00:00
Next Scan	2024-11-18T16:18:12+00:00

Last Scan

Scanned	2024-11-04T16:18:12+00:00
URL	https://capterra.com/robots.txt
Redirect	https://www.capterra.com/robots.txt
Redirect Domain	www.capterra.com
Redirect Base	capterra.com
Domain IPs	35.175.90.160, 50.16.210.22
Redirect IPs	104.18.40.158, 172.64.147.98
Response IP	104.18.40.158
Found	Yes
Hash	d7aff2dfaa5a68f2ee3162623cf387e3a90a0b26dda9036a65634dca8b25bda5
SimHash	63dab9fbce42

Groups

*

Rule	Path
Disallow	/?
Disallow	/compare//-vs--vs-
Allow	/compare//-vs-*
Allow	/-software/?page=*
Allow	/_next/image/?url=
Disallow	/external_click
Disallow	/external_slp_click
Disallow	/external_click_sa
Disallow	/external_click_ga
Disallow	/sem-combo
Disallow	/sem/
Disallow	/sem-b/
Disallow	/sem-compare/
Disallow	/sem-compare-b/
Disallow	/search
Disallow	*?preview
Disallow	*?exp=
Disallow	*?variant=
Disallow	*sort_options%3D
Disallow	/resources/preview/*
Disallow	/resources/_next/*.json
Disallow	/resources/_next/*.js
Disallow	/-software/?account_campaign_id=
Disallow	/auth/login
Disallow	*/rest/
Disallow	*/fit-finder/
Disallow	*/glossaryletter/
Disallow	/p//reviews/*/
Disallow	/workspace/
Disallow	/sem-services/
Disallow	/sem-compare-services/
Disallow	/sem-ppl/
Disallow	/ai-assistant/
Disallow	/discover/solutions/

Rule

Path

Disallow

/*?*

Disallow

/compare/*/*-vs-*-vs-*

Allow

/compare/*/*-vs-*

Allow

/*-software/*?page=*

Allow

*/_next/image/?url=*

Disallow

/external_click

Disallow

/external_slp_click

Disallow

/external_click_sa

Disallow

/external_click_ga

Disallow

/sem-combo

Disallow

/sem/

Disallow

/sem-b/

Disallow

/sem-compare/

Disallow

/sem-compare-b/

Disallow

/search

Disallow

*?preview

Disallow

*?exp=

Disallow

*?variant=

Disallow

*sort_options%3D

Disallow

/resources/preview/*

Disallow

/resources/_next/*.json

Disallow

/resources/_next/*.js

Disallow

/*-software/*?*account_campaign_id=*

Disallow

/auth/login

Disallow

*/rest/

Disallow

*/fit-finder/

Disallow

*/glossaryletter/

Disallow

*/p/*/reviews/*/

Disallow

/workspace/

Disallow

/sem-services/

Disallow

/sem-compare-services/

Disallow

/sem-ppl/

Disallow

/ai-assistant/

Disallow

/discover/solutions/

adidxbot

No rules defined. All paths allowed.

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

ubicrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

bubing

Rule	Path
Disallow	/

Rule

Path

Disallow

doc

Rule	Path
Disallow	/

Rule

Path

Disallow

zao

Rule	Path
Disallow	/

Rule

Path

Disallow

sitecheck.internetseer.com

Rule	Path
Disallow	/

Rule

Path

Disallow

zealbot

Rule	Path
Disallow	/

Rule

Path

Disallow

msiecrawler

Rule	Path
Disallow	/

Rule

Path

Disallow

sitesnagger

Rule	Path
Disallow	/

Rule

Path

Disallow

webstripper

Rule	Path
Disallow	/

Rule

Path

Disallow

webcopier

Rule	Path
Disallow	/

Rule

Path

Disallow

fetch

Rule	Path
Disallow	/

Rule

Path

Disallow

offline explorer

Rule	Path
Disallow	/

Rule

Path

Disallow

teleport

Rule	Path
Disallow	/

Rule

Path

Disallow

teleportpro

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

linko

Rule	Path
Disallow	/

Rule

Path

Disallow

httrack

Rule	Path
Disallow	/

Rule

Path

Disallow

microsoft.url.control

Rule	Path
Disallow	/

Rule

Path

Disallow

xenu

Rule	Path
Disallow	/

Rule

Path

Disallow

larbin

Rule	Path
Disallow	/

Rule

Path

Disallow

libwww

Rule	Path
Disallow	/

Rule

Path

Disallow

zyborg

Rule	Path
Disallow	/

Rule

Path

Disallow

download ninja

Rule	Path
Disallow	/

Rule

Path

Disallow

wget

Rule	Path
Disallow	/

Rule

Path

Disallow

grub-client

Rule	Path
Disallow	/

Rule

Path

Disallow

k2spider

Rule	Path
Disallow	/

Rule

Path

Disallow

npbot

Rule	Path
Disallow	/

Rule

Path

Disallow

webreaper

Rule	Path
Disallow	/

Rule

Path

Disallow

psbot

Rule	Path
Disallow	/

Rule

Path

Disallow

exabot

Rule	Path
Disallow	/

Rule

Path

Disallow

speedy

Rule	Path
Disallow	/

Rule

Path

Disallow

dotbot

Rule	Path
Disallow	/

Rule

Path

Disallow

bloglines/3.1

Rule	Path
Disallow	/

Rule

Path

Disallow

jyxobot/1

Rule	Path
Disallow	/

Rule

Path

Disallow

cityreview

Rule	Path
Disallow	/

Rule

Path

Disallow

crazywebcrawler-spider

Rule	Path
Disallow	/

Rule

Path

Disallow

domain re-animator bot

Rule	Path
Disallow	/

Rule

Path

Disallow

semrushbot-sa

Rule	Path
Disallow	/

Rule

Path

Disallow

vegi

Rule	Path
Disallow	/

Rule

Path

Disallow

rogerbot

Rule	Path
Disallow	/

Rule

Path

Disallow

ntentbot

Rule	Path
Disallow	/

Rule

Path

Disallow

brandverity

Rule	Path
Disallow	/

Rule

Path

Disallow

Other Records

Field	Value
sitemap	https://www.capterra.com/sitemaps/sitemap.xml

Field

Value

sitemap

https://www.capterra.com/sitemaps/sitemap.xml

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file

capterra.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

adidxbot

Other Records

ubicrawler

bubing

doc

zao

sitecheck.internetseer.com

zealbot

msiecrawler

sitesnagger

webstripper

webcopier

fetch

offline explorer

teleport

teleportpro

webzip

linko

httrack

microsoft.url.control

xenu

larbin

libwww

zyborg

download ninja

wget

grub-client

k2spider

npbot

webreaper

psbot

exabot

speedy

dotbot

bloglines/3.1

jyxobot/1

cityreview

crazywebcrawler-spider

domain re-animator bot

semrushbot-sa

vegi

rogerbot

ntentbot

brandverity

Other Records

Comments

capterra.com
robots.txt