humanloop.com
robots.txt

Robots Exclusion Standard data for humanloop.com

Resource Scan

Scan Details

Site Domain humanloop.com
Base Domain humanloop.com
Scan Status Ok
Last Scan2025-09-22T22:57:19+00:00
Next Scan 2025-10-22T22:57:19+00:00

Last Scan

Scanned2025-09-22T22:57:19+00:00
URL https://humanloop.com/robots.txt
Domain IPs 76.76.21.21
Response IP 76.76.21.21
Found Yes
Hash db2c0656cdc727f2d556b0e425ba1aae2c8388f533597eb3810a762ab80c4f5d
SimHash 75569876e621

Groups

*

Rule Path
Allow /

gptbot

Rule Path
Allow /

chatgpt-user

Rule Path
Allow /

oai-searchbot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

perplexity-user

Rule Path
Allow /

claude-web

Rule Path
Allow /

google-extended

Rule Path
Allow /

gemini-user

Rule Path
Allow /

bingbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://humanloop.com/sitemap-index.xml

Comments

  • AI Bot Configurations (guidance from https://app.athenahq.ai/action-center)
  • OpenAI GPTBot configuration (for AI model training)
  • ChatGPT user browsing configuration
  • OpenAI search configuration
  • Perplexity search configuration
  • Perplexity user browsing configuration
  • Anthropic Claude configuration
  • Google Gemini configuration
  • Gemini user browsing configuration
  • Microsoft Bing/Copilot configuration