everymomspage.com
robots.txt

Robots Exclusion Standard data for everymomspage.com

Resource Scan

Scan Details

Site Domain everymomspage.com
Base Domain everymomspage.com
Scan Status Ok
Last Scan6/6/2025, 4:38:12 AM
Next Scan 6/13/2025, 4:38:12 AM

Last Scan

Scanned6/6/2025, 4:38:12 AM
URL https://everymomspage.com/robots.txt
Domain IPs 104.21.69.139, 172.67.208.250, 2606:4700:3034::6815:458b, 2606:4700:3035::ac43:d0fa
Response IP 104.21.69.139
Found Yes
Hash 5a1c671888a02431a9cd0aac3ed4a0416a668e893f809f3e801d150e54304976
SimHash a9044b1167a1

Groups

*

Rule Path
Disallow /admin/
Disallow /login/
Disallow /checkout/
Disallow /cart/
Disallow /private/
Disallow /user/
Disallow /register/

googlebot

Rule Path
Disallow /admin/
Disallow /login/
Disallow /checkout/
Disallow /cart/
Disallow /private/
Disallow /user/
Disallow /register/

badbot

Rule Path
Disallow /
Disallow /*.pdf$
Disallow /*.zip$
Disallow /*.tar$

googlebot-image

Rule Path
Allow /images/

Other Records

Field Value
sitemap https://www.everymomspage.com/sitemap.xml

Comments

  • robots.txt for best Google crawling
  • Allow all search engines to crawl all content
  • Allow Googlebot to crawl everything except restricted areas
  • Block a specific bot from crawling the site
  • Sitemap location (helps crawlers find your sitemap easily)
  • Block bots from indexing certain file types (like PDFs or temporary files)
  • Enable Googlebot to crawl your images