thepeachbox.com
robots.txt

Robots Exclusion Standard data for thepeachbox.com

Resource Scan

Scan Details

Site Domain thepeachbox.com
Base Domain thepeachbox.com
Scan Status Ok
Last Scan2025-10-12T08:40:53+00:00
Next Scan 2025-10-19T08:40:53+00:00

Last Scan

Scanned2025-10-12T08:40:53+00:00
URL https://thepeachbox.com/robots.txt
Domain IPs 67.227.186.83
Response IP 67.227.186.83
Found Yes
Hash b12c89d32de880da9afe2c659e2c54e88c1d5c0b269098c4b6c0ee77149ebe6b
SimHash 7a199523eff0

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

googlebot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

siteauditbot

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

splitsignalbot

Rule Path
Disallow /

semrushbot-coub

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

Other Records

Field Value
sitemap https://thepeachbox.com/wp-sitemap.xml

Comments

  • This is the robots.txt file for thepeachbox.com
  • Thepeachbox content is made available under our terms and conditions of use.
  • Any other uses are not permitted, incl. but not limited to: for large language
  • models (LLMs), machine learning and/or artificial intelligence-related
  • purposes; with any of the aforementioned technologies; and/or for any
  • commercial purposes.
  • Added OF
  • semrush
  • majestic