intalio.com
robots.txt

Robots Exclusion Standard data for intalio.com

Resource Scan

Scan Details

Site Domain intalio.com
Base Domain intalio.com
Scan Status Ok
Last Scan2025-11-09T03:36:37+00:00
Next Scan 2025-12-09T03:36:37+00:00

Last Scan

Scanned2025-11-09T03:36:37+00:00
URL https://intalio.com/robots.txt
Domain IPs 160.153.0.194
Response IP 160.153.0.194
Found Yes
Hash aba2a3e2130a30178bd38a5f0638d15c4fb961e9a4596d2ca81e6c2eb89d63f4
SimHash 71549850a7e0

Groups

gptbot

Rule Path
Allow /

claudebot

Rule Path
Allow /

claude-web

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

perplexity-user

Rule Path
Allow /

google-extended

Rule Path
Allow /

amazonbot

Rule Path
Allow /

qwenbot

Rule Path
Allow /

*

Rule Path
Disallow /*tag*
Disallow /*2018*
Disallow /*2020*
Disallow /*2021*
Disallow /*author*
Disallow /*category*
Disallow /*product_brief_category*
Disallow /*?s=*
Disallow /*?tx_product_brief_category=*

Other Records

Field Value
sitemap https://www.intalio.com/sitemap.xml

Comments

  • ===============================
  • AI Visibility Configuration
  • ===============================
  • Allow OpenAI GPTBot (ChatGPT)
  • Allow Anthropic Claude bots
  • Allow Perplexity AI crawlers
  • Allow Google’s AI crawler for AI Overviews
  • Allow Amazonbot (for AI and Alexa integrations)
  • Allow Alibabas QwenBot