ggsipuonline.com
robots.txt

Robots Exclusion Standard data for ggsipuonline.com

Resource Scan

Scan Details

Site Domain ggsipuonline.com
Base Domain ggsipuonline.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-13T15:26:07+00:00
Next Scan 2025-09-20T15:26:07+00:00

Last Successful Scan

Scanned2025-09-05T15:25:13+00:00
URL https://ggsipuonline.com/robots.txt
Redirect https://www.ggsipuonline.com/robots.txt
Redirect Domain www.ggsipuonline.com
Redirect Base ggsipuonline.com
Domain IPs 104.21.75.242, 172.67.184.9, 2606:4700:3035::ac43:b809, 2606:4700:3036::6815:4bf2
Redirect IPs 104.21.75.242, 172.67.184.9, 2606:4700:3035::ac43:b809, 2606:4700:3036::6815:4bf2
Response IP 172.67.184.9
Found Yes
Hash 1409f7f0f78fd07e5fb0408907fc176ef59de00b989f47fd66e7ff99689eb4f1
SimHash 4991fb42f091

Groups

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

*

Rule Path
Disallow /

anthropic-web-crawler

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

grokbot

Rule Path
Disallow /

xai-bot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Comments

  • Allow Google Search engine crawlers
  • Disallow all other bots, including AI tools
  • Explicitly disallow known AI crawlers