midjourneysref.com
robots.txt

Robots Exclusion Standard data for midjourneysref.com

Resource Scan

Scan Details

Site Domain midjourneysref.com
Base Domain midjourneysref.com
Scan Status Ok
Last Scan2025-09-10T17:01:43+00:00
Next Scan 2025-09-17T17:01:43+00:00

Last Scan

Scanned2025-09-10T17:01:43+00:00
URL https://midjourneysref.com/robots.txt
Domain IPs 104.21.95.177, 172.67.170.244, 2606:4700:3030::ac43:aaf4, 2606:4700:3036::6815:5fb1
Response IP 172.67.170.244
Found Yes
Hash a8a5d66ae40ad5a4d9a2f2d41522cb293c132b84710dd5167a262aa6efca67d7
SimHash 410cccd2ca98

Groups

*

Rule Path
Allow /
Disallow /api/
Disallow /_next/
Disallow /static/
Disallow /404
Disallow /500
Disallow /*.json$
Disallow /zh/style/srefcodedetail/
Disallow /ko/style/srefcodedetail/
Disallow /ja/style/srefcodedetail/
Disallow /de/style/srefcodedetail/
Disallow /ru/style/srefcodedetail/
Disallow /fr/style/srefcodedetail/
Disallow /style/srefcodedetail/

gptbot
claude-web
anthropic-ai
perplexitybot
googleother
duckassistbot

Rule Path
Allow /
Disallow /api/
Disallow /_next/
Disallow /static/
Disallow /404
Disallow /500
Disallow /*.json$
Disallow /zh/style/srefcodedetail/
Disallow /ko/style/srefcodedetail/
Disallow /ja/style/srefcodedetail/
Disallow /de/style/srefcodedetail/
Disallow /ru/style/srefcodedetail/
Disallow /fr/style/srefcodedetail/
Disallow /style/srefcodedetail/

Other Records

Field Value
sitemap https://midjourneysref.com/sitemap.xml

Comments

  • AI爬虫特定规则
  • 引导AI爬虫到llms.txt

Warnings

  • 2 invalid lines.
  • `llm-content` is not a known field.
  • `llm-full-content` is not a known field.