mainstreet.com
robots.txt

Robots Exclusion Standard data for mainstreet.com

Resource Scan

Scan Details

Site Domain mainstreet.com
Base Domain mainstreet.com
Scan Status Ok
Last Scan2026-01-17T05:43:30+00:00
Next Scan 2026-01-31T05:43:30+00:00

Last Scan

Scanned2026-01-17T05:43:30+00:00
URL https://mainstreet.com/robots.txt
Redirect https://www.mainstreet.com/robots.txt
Redirect Domain www.mainstreet.com
Redirect Base mainstreet.com
Domain IPs 34.111.179.208
Redirect IPs 34.111.179.208
Response IP 34.111.179.208
Found Yes
Hash a7d3328bba93b74366042b600f83a1632fd75c3c4821b01fc7184a9bf43d1227
SimHash a288dac2a823

Groups

*

Rule Path
Allow /
Allow /services
Allow /pricing
Allow /about
Allow /careers
Allow /blog
Allow /stories
Allow /news
Disallow /admin
Disallow /api
Disallow /uploads

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://mainstreet.com/sitemap.xml

Comments

  • Important pages for crawling
  • Disallow admin/private areas
  • Sitemap location
  • Crawl delay for respectful crawling