owis.org
robots.txt

Robots Exclusion Standard data for owis.org

Resource Scan

Scan Details

Site Domain owis.org
Base Domain owis.org
Scan Status Ok
Last Scan2025-10-23T21:07:47+00:00
Next Scan 2025-11-06T21:07:47+00:00

Last Scan

Scanned2025-10-23T21:07:47+00:00
URL https://www.owis.org/robots.txt
Domain IPs 104.26.10.37, 104.26.11.37, 172.67.75.117
Response IP 172.67.75.117
Found Yes
Hash b8b7e13c7f62cc3f2e94a7d5ed94c0111c1136a039f433671f19f00611b56534
SimHash 1596f2004507

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /private/
Allow /

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

yandex

Rule Path
Allow /

baiduspider

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

applebot

Rule Path
Allow /

gptbot

Rule Path
Allow /

claudebot

Rule Path
Allow /

perplexitybot

Rule Path
Allow /

google-extended

Rule Path
Allow /

bingai

Rule Path
Allow /

ccbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://owis.org/sa/sitemap_index.xml
sitemap https://owis.org/in/sitemap_index.xml
sitemap https://owis.org/sg/sitemap_index.xml
sitemap https://owis.org/jp/sitemap_index.xml

Comments

  • Major Search Engine Bots
  • Google Search
  • Bing Search
  • Yahoo Search
  • Yandex Search
  • Baidu Search
  • DuckDuckGo Bot
  • Apple Search Bot
  • Major AI Bots
  • OpenAI GPTBot (used for ChatGPT / GPT training)
  • Anthropic ClaudeBot
  • Perplexity AI Bot
  • Google AI Crawler (Gemini/Bard training)
  • Microsoft AI Bot (Bing AI / Copilot)
  • Common Crawl (used by many AI models)