mixuai.com
robots.txt

Robots Exclusion Standard data for mixuai.com

Resource Scan

Scan Details

Site Domain mixuai.com
Base Domain mixuai.com
Scan Status Ok
Last Scan2025-11-24T00:45:18+00:00
Next Scan 2025-12-24T00:45:18+00:00

Last Scan

Scanned2025-11-24T00:45:18+00:00
URL https://www.mixuai.com/robots.txt
Domain IPs 104.21.69.187, 172.67.212.48, 2606:4700:3036::6815:45bb, 2606:4700:3036::ac43:d430
Response IP 172.67.212.48
Found Yes
Hash 9c573a4f02cb02fb87c86276ae593c0edfb33983daffa7b302a3f81164469d50
SimHash 705cd340e090

Groups

blexbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

meltwater

Rule Path
Disallow /

seekr

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

applebot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

*

Rule Path
Disallow /static/contact.html

*

Rule Path
Disallow

Comments

  • Block Open AI
  • Block Google (Gemini)
  • Block Claude
  • Block CommonCrawl
  • Block Diffbot
  • Block Meta (Facebook)
  • Block Webz.io
  • Block ImagesiftBot
  • Block Meltwater
  • Block Seekr