crm.capient.de
robots.txt

Robots Exclusion Standard data for crm.capient.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	crm.capient.de
Base Domain	capient.de
Scan Status	Ok
Last Scan	2025-09-11T19:48:16+00:00
Next Scan	2025-09-25T19:48:16+00:00

Last Scan

Scanned	2025-09-11T19:48:16+00:00
URL	https://crm.capient.de/robots.txt
Domain IPs	94.130.131.41
Response IP	94.130.131.41
Found	Yes
Hash	3e6ff632993737ca5799d3bac249f277f37f6ab036975f1166b37a932749c2e2
SimHash	8a4f66c461d2

Groups

teleport*

Rule	Path
Disallow	/

Rule

Path

Disallow

webwhacker

Rule	Path
Disallow	/

Rule

Path

Disallow

webdevil

Rule	Path
Disallow	/

Rule

Path

Disallow

webzip

Rule	Path
Disallow	/

Rule

Path

Disallow

net attache

Rule	Path
Disallow	/

Rule

Path

Disallow

sitesnagger

Rule	Path
Disallow	/

Rule

Path

Disallow

wx_mail/2.000

Rule	Path
Disallow	/

Rule

Path

Disallow

emailcollector

Rule	Path
Disallow	/

Rule

Path

Disallow

whowhere

Rule	Path
Disallow	/

Rule

Path

Disallow

roverbot

Rule	Path
Disallow	/

Rule

Path

Disallow

activeagent

Rule	Path
Disallow	/

Rule

Path

Disallow

emailsiphon

Rule	Path
Disallow	/

Rule

Path

Disallow

ia_archiver

Rule	Path
Disallow	/

Rule

Path

Disallow

websitewiki

Rule	Path
Disallow	/

Rule

Path

Disallow

Comments

Robot Exclusion File -- robots.txt
digital//m 2019
Keep Rover from grabbing Email addresses from our site

crm.capient.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

teleport*

webwhacker

webdevil

webzip

net attache

sitesnagger

wx_mail/2.000

emailcollector

whowhere

roverbot

activeagent

emailsiphon

ia_archiver

websitewiki

Comments

crm.capient.de
robots.txt