academy.alegra.com
robots.txt

Robots Exclusion Standard data for academy.alegra.com

Resource Scan

Scan Details

Site Domain academy.alegra.com
Base Domain alegra.com
Scan Status Ok
Last Scan2025-05-24T00:19:45+00:00
Next Scan 2025-06-23T00:19:45+00:00

Last Scan

Scanned2025-05-24T00:19:45+00:00
URL https://academy.alegra.com/robots.txt
Domain IPs 138.197.3.188
Response IP 138.197.3.188
Found Yes
Hash cefa06b5b5f6f91932ff53ae42c00e1e1a67a450a74c760a0b258eb97982fb33
SimHash 4330013af177

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /*?s=
Disallow /tag/
Disallow /author/
Disallow /cdn-cgi/
Disallow /*/comments/
Disallow /wp-comments-post.php
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /*?comments

gptbot
claudebot
google-extended
chatgpt-user
anthropic-ai
cohere-ai
perplexitybot
oai-searchbot
perplexity-ai
anthropicbot
bard
openai-api
deepseekbot
huggingfacebot
ai21bot
ai2bot
aisearchbot
anthropic-crawler
ccbot/2.0
claude-webbo
cohere-crawler
coherebot
google-generative-ai-crawler
groqbot
img2dataset
llama-control
meta ai bot
midjourneybot
omgilibot/1.0
stabilitybot
petalbot
aihitbot
duplexweb-google
mlbot
bytespider
pixray-seeker
aipbot
aibot
youbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://academy.alegra.com/sitemap_index.xml

Comments

  • Sitemap files
  • Allow IA to index the website