alwaysdeluxe.com
robots.txt

Robots Exclusion Standard data for alwaysdeluxe.com

Resource Scan

Scan Details

Site Domain alwaysdeluxe.com
Base Domain alwaysdeluxe.com
Scan Status Ok
Last Scan2025-04-15T00:17:43+00:00
Next Scan 2025-05-15T00:17:43+00:00

Last Scan

Scanned2025-04-15T00:17:43+00:00
URL https://alwaysdeluxe.com/robots.txt
Domain IPs 104.21.71.179, 172.67.147.246, 2606:4700:3035::6815:47b3, 2606:4700:3037::ac43:93f6
Response IP 104.21.71.179
Found Yes
Hash 7359e4ac820b4e48e29bceef391c88beeb072c1fb314a19765833ce1126fb705
SimHash a8044f1167a5

Groups

*

Rule Path
Disallow /admin/
Disallow /login/
Disallow /checkout/
Disallow /cart/
Disallow /private/
Disallow /user/
Disallow /register/

googlebot

Rule Path
Disallow /admin/
Disallow /login/
Disallow /checkout/
Disallow /cart/
Disallow /private/
Disallow /user/
Disallow /register/

badbot

Rule Path
Disallow /
Disallow /*.pdf$
Disallow /*.zip$
Disallow /*.tar$

googlebot-image

Rule Path
Allow /images/

Other Records

Field Value
sitemap https://alwaysdeluxe.com/sitemap.xml

Comments

  • robots.txt for best Google crawling
  • Allow all search engines to crawl all content
  • Allow Googlebot to crawl everything except restricted areas
  • Block a specific bot from crawling the site
  • Sitemap location (helps crawlers find your sitemap easily)
  • Block bots from indexing certain file types (like PDFs or temporary files)
  • Enable Googlebot to crawl your images