conjoint.ly
robots.txt

Robots Exclusion Standard data for conjoint.ly

Resource Scan

Scan Details

Site Domain conjoint.ly
Base Domain conjoint.ly
Scan Status Ok
Last Scan2024-10-22T01:40:16+00:00
Next Scan 2024-11-21T01:40:16+00:00

Last Scan

Scanned2024-10-22T01:40:16+00:00
URL https://conjoint.ly/robots.txt
Redirect https://conjointly.com/robots.txt
Redirect Domain conjointly.com
Redirect Base conjointly.com
Domain IPs 172.66.40.133, 172.66.43.123, 2606:4700:3108::ac42:2885, 2606:4700:3108::ac42:2b7b
Redirect IPs 108.156.133.15, 108.156.133.38, 108.156.133.69, 108.156.133.85
Response IP 108.156.133.15
Found Yes
Hash 03eaf01c61a140dd004100c626611285dfd8071c63a6eb0e50f0c4a68a08f25c
SimHash 6050195947b5

Groups

ai2bot

Rule Path
Disallow /

ai2bot-dolma

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

icc-crawler

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

youbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

img2dataset

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /booking-confirmed.html
Disallow /booking-confirmed/
Disallow /googled979b0cc4477a394.html
Disallow /googled979b0cc4477a394/
Disallow /analysis/

Other Records

Field Value
sitemap https://conjointly.com/sitemap.xml
sitemap https://conjointly.com/image-sitemap.xml

Comments

  • Disallow data scraping and usage of website content for AI model training or prompting.
  • Explicit opt-out from certain crawlers is not an invitation for others to train AI models on our content.
  • Data scraping and model training must be opt-in, not opt-out.
  • Demand consent, credit, and compensation.
  • CreateDontScrape
  • LLM training/data selling bots.

Warnings

  • `host` is not a known field.