claconnect.com
robots.txt

Robots Exclusion Standard data for claconnect.com

Resource Scan

Scan Details

Site Domain claconnect.com
Base Domain claconnect.com
Scan Status Ok
Last Scan2025-09-25T18:44:11+00:00
Next Scan 2025-10-25T18:44:11+00:00

Last Scan

Scanned2025-09-25T18:44:11+00:00
URL https://claconnect.com/robots.txt
Redirect https://www.claconnect.com/robots.txt
Redirect Domain www.claconnect.com
Redirect Base claconnect.com
Domain IPs 23.100.43.208
Redirect IPs 104.18.32.2, 172.64.155.254
Response IP 172.64.155.254
Found Yes
Hash 89c7c6912934933995a7854a3b268db32ffbfc65da44bf96210dc190efc0492c
SimHash cc138c008dd3

Groups

hubspot page fetcher/1.0 http://www.hubspot.com web-crawlers@hubspot.com

Rule Path
Disallow

*

Rule Path
Disallow /ftn-emails/
Disallow /-/media/noin/*

Other Records

Field Value
sitemap https://www.claconnect.com/en/SiteInfo/Main/Sitemaps/Index.ashx

Comments

  • Hi! I'm DynamicRobot 1.0.0.0. You'll be happy to know I'm installed properly.
  • I'm serving up robots__www.claconnect.com.txt, because it matched the request's host "www.claconnect.com".
  • STANDARD

Warnings

  • 21 invalid lines.