rtmnuonline.com
robots.txt

Robots Exclusion Standard data for rtmnuonline.com

Resource Scan

Scan Details

Site Domain rtmnuonline.com
Base Domain rtmnuonline.com
Scan Status Ok
Last Scan2025-11-01T21:50:48+00:00
Next Scan 2025-11-08T21:50:48+00:00

Last Scan

Scanned2025-11-01T21:50:48+00:00
URL https://rtmnuonline.com/robots.txt
Redirect https://www.rtmnuonline.com/robots.txt
Redirect Domain www.rtmnuonline.com
Redirect Base rtmnuonline.com
Domain IPs 104.21.44.172, 172.67.201.154, 2606:4700:3033::ac43:c99a, 2606:4700:3035::6815:2cac
Redirect IPs 104.21.44.172, 172.67.201.154, 2606:4700:3033::ac43:c99a, 2606:4700:3035::6815:2cac
Response IP 104.21.44.172
Found Yes
Hash 1409f7f0f78fd07e5fb0408907fc176ef59de00b989f47fd66e7ff99689eb4f1
SimHash 4991fb42f091

Groups

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

googlebot-video

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

*

Rule Path
Disallow /

anthropic-web-crawler

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

grokbot

Rule Path
Disallow /

xai-bot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Comments

  • Allow Google Search engine crawlers
  • Disallow all other bots, including AI tools
  • Explicitly disallow known AI crawlers