internationalschoolcommunity.com
robots.txt

Robots Exclusion Standard data for internationalschoolcommunity.com

Resource Scan

Scan Details

Site Domain internationalschoolcommunity.com
Base Domain internationalschoolcommunity.com
Scan Status Ok
Last Scan2025-10-11T09:32:46+00:00
Next Scan 2025-11-10T09:32:46+00:00

Last Scan

Scanned2025-10-11T09:32:46+00:00
URL https://internationalschoolcommunity.com/robots.txt
Domain IPs 192.124.249.7
Response IP 192.124.249.7
Found Yes
Hash 536a205311f98352d2a7111ae0f59e4fe056b184d3a0ae31a3f838c233984acc
SimHash 51248b40a532

Groups

*

Rule Path
Allow /

*

Rule Path
Disallow /product/

baiduspider
yisouspider
petalbot
amazonbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://internationalschoolcommunity.com/sitemap.xml

Comments

  • Block problem bots
  • Block OpenAI
  • Block Google Bard AI
  • Block Common Crawl AI scraper
  • Block Perplexity AI
  • Block other misc AI scrapers

Warnings

  • 1 invalid line.