pattern.com
robots.txt

Robots Exclusion Standard data for pattern.com

Resource Scan

Scan Details

Site Domain pattern.com
Base Domain pattern.com
Scan Status Ok
Last Scan2025-07-25T21:09:21+00:00
Next Scan 2025-08-24T21:09:21+00:00

Last Scan

Scanned2025-07-25T21:09:21+00:00
URL https://pattern.com/robots.txt
Domain IPs 18.211.166.153, 34.202.203.47, 54.243.86.28
Response IP 15.160.106.203
Found Yes
Hash 3639d4e8c68339c466b297c46f39f80be10bb438b74ef1831bb2fcab9891ea87
SimHash 5005810dadf1

Groups

*

Rule Path
Disallow /*?region
Disallow /*?tag
Disallow /*author
Disallow /*?authors
Disallow /report-files

gptbot
chatgpt-user
ccbot
anthropic-ai
claude-web
google-extended

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://cdn.prod.website-files.com/67d327c7ca817d803c46c86b/682675945b95c192d0d772fc_llms.txt
sitemap https://pattern.com/sitemap.xml

Comments

  • Pointing to llms.txt for LLMs and AI crawlers

Warnings

  • `llm-discovery` is not a known field.