mygrasslands.com
robots.txt

Robots Exclusion Standard data for mygrasslands.com

Resource Scan

Scan Details

Site Domain mygrasslands.com
Base Domain mygrasslands.com
Scan Status Ok
Last Scan2025-10-22T23:14:40+00:00
Next Scan 2025-11-21T23:14:40+00:00

Last Scan

Scanned2025-10-22T23:14:40+00:00
URL https://mygrasslands.com/robots.txt
Redirect https://www.mygrasslands.com/robots.txt
Redirect Domain www.mygrasslands.com
Redirect Base mygrasslands.com
Domain IPs 75.2.70.75, 99.83.190.102
Redirect IPs 52.68.134.190, 54.238.67.66, 57.180.160.60
Response IP 13.233.175.166
Found Yes
Hash f47534c3cddaa6c16bf2614a6c3ab50a743e0601f2af135202e7e58543dfa4d6
SimHash 24152b324d83

Groups

*

Rule Path Comment
Disallow /api/ Disallow Webflow API access
Disallow /cdn-cgi/ Disallow Webflow's CDN paths
Disallow /uploads/ Disallow access to uploaded files folder (optional)
Disallow /drafts/ Disallow access to draft pages or content (if applicable)
Disallow /*?* -
Allow / -

Other Records

Field Value
sitemap https://mygrasslands.com/sitemap.xml
sitemap https://www.mygrasslands.com/sitemap.xml

Comments

  • robots.txt for mygrasslands.com
  • This file controls how search engine bots crawl your website.
  • Prevent bots from crawling URL parameters to avoid duplicate content
  • Allow everything else to be crawled
  • Sitemap location