captivea.com
robots.txt

Robots Exclusion Standard data for captivea.com

Resource Scan

Scan Details

Site Domain captivea.com
Base Domain captivea.com
Scan Status Ok
Last Scan2025-12-17T06:31:22+00:00
Next Scan 2026-01-16T06:31:22+00:00

Last Scan

Scanned2025-12-17T06:31:22+00:00
URL https://captivea.com/robots.txt
Redirect https://www.captivea.com/robots.txt
Redirect Domain www.captivea.com
Redirect Base captivea.com
Domain IPs 172.66.40.121, 172.66.43.135, 2606:4700:3108::ac42:2879, 2606:4700:3108::ac42:2b87
Redirect IPs 172.66.40.121, 172.66.43.135, 2606:4700:3108::ac42:2879, 2606:4700:3108::ac42:2b87
Response IP 172.66.40.121
Found Yes
Hash c27af654d7bd12cbfb8f149f2b0a9afbf93d7f898ff3ecc223e709a624611b31
SimHash 34ad0911c5f3

Groups

*
*
ai2bot

Rule Path
Allow /

ai2bot-dolma

Rule Path
Allow /

amazonbot

Rule Path
Allow /

applebot

Rule Path
Allow /

applebot-extended

Rule Path
Allow /

bytespider

Rule Path
Allow /

ccbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

claude-web

Rule Path
Allow /

claudebot

Rule Path
Allow /

diffbot

Rule Path
Allow /

friendlycrawler

Rule Path
Allow /

gptbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

googleother

Rule Path
Allow /

googleother-image

Rule Path
Allow /

googleother-video

Rule Path
Allow /

icc-crawler

Rule Path
Allow /

imagesiftbot

Rule Path
Allow /

meta-externalagent

Rule Path
Allow /

meta-externalfetcher

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

piplbot

Rule Path
Allow /

timpibot

Rule Path
Allow /

webzio-extended

Rule Path
Allow /

youbot

Rule Path
Allow /

anthropic-ai

Rule Path
Allow /

cohere-ai

Rule Path
Allow /

iaskspider/2.0

Rule Path
Allow /

img2dataset

Rule Path
Allow /

scrapy

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

*

Rule Path Comment
Allow /social_instagram/ Allow access to Instagram social pages
Disallow /web Admin and login section
Disallow /website/info Internal pages not useful for SEO
Disallow /web/login User login page
Disallow /web?db=* URLs with specific Odoo database parameters
Disallow /mail Email management pages
Disallow /calendar/ Calendar-related content
Disallow /page/ All pages containing "/page/"
Disallow /profile/ User profiles
Disallow /jobs/apply/ Job application forms
Disallow /case-studies/ Exclude if not optimized
Disallow /thank-you Exclude if not optimized
Disallow /static/ Static files (CSS, JS, etc.)
Disallow /portal/ Client portal, not relevant for SEO
Disallow /shop Block the shop and all subpages under /shop
Disallow /shop/ Explicitly block all subpages under /shop
Disallow /*?* Block URLs with query parameters (duplication)
Disallow /*%26* Block URLs with multiple parameters
Allow /sitemap.xml -
Allow /robots.txt -

*

Rule Path
Allow /social_instagram/

Other Records

Field Value
sitemap https://www.captivea.com/sitemap.xml
sitemap https://www.captivea.com/sitemap.xml

Comments

  • custom
  • Allow specific AI bots full access
  • Block unwanted bots entirely
  • General rules for all other bots
  • Allow important files