telanganaboard.com
robots.txt

Robots Exclusion Standard data for telanganaboard.com

Resource Scan

Scan Details

Site Domain telanganaboard.com
Base Domain telanganaboard.com
Scan Status Ok
Last Scan2025-09-12T09:56:57+00:00
Next Scan 2025-09-19T09:56:57+00:00

Last Scan

Scanned2025-09-12T09:56:57+00:00
URL https://telanganaboard.com/robots.txt
Redirect https://www.telanganaboard.com/robots.txt
Redirect Domain www.telanganaboard.com
Redirect Base telanganaboard.com
Domain IPs 104.21.11.198, 172.67.150.61, 2606:4700:3037::6815:bc6, 2606:4700:3037::ac43:963d
Redirect IPs 104.21.11.198, 172.67.150.61, 2606:4700:3037::6815:bc6, 2606:4700:3037::ac43:963d
Response IP 172.67.150.61
Found Yes
Hash 1409f7f0f78fd07e5fb0408907fc176ef59de00b989f47fd66e7ff99689eb4f1
SimHash 4991fb42f091

Groups

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

*

Rule Path
Disallow /

anthropic-web-crawler

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

grokbot

Rule Path
Disallow /

xai-bot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Comments

  • Allow Google Search engine crawlers
  • Disallow all other bots, including AI tools
  • Explicitly disallow known AI crawlers