benderacoy.online
robots.txt

Robots Exclusion Standard data for benderacoy.online

Resource Scan

Scan Details

Site Domain benderacoy.online
Base Domain benderacoy.online
Scan Status Ok
Last Scan2025-04-06T00:57:53+00:00
Next Scan 2025-05-06T00:57:53+00:00

Last Scan

Scanned2025-04-06T00:57:53+00:00
URL https://benderacoy.online/robots.txt
Domain IPs 104.21.59.112, 172.67.175.158, 2606:4700:3034::ac43:af9e, 2606:4700:3036::6815:3b70
Response IP 104.21.59.112
Found Yes
Hash 40c133d8018261c1b5928ee47b692650da8ceb7f60ba05988644feb9cb421164
SimHash a9044f1167a1

Groups

*

Rule Path
Disallow /admin/
Disallow /login/
Disallow /checkout/
Disallow /cart/
Disallow /private/
Disallow /user/
Disallow /register/

googlebot

Rule Path
Disallow /admin/
Disallow /login/
Disallow /checkout/
Disallow /cart/
Disallow /private/
Disallow /user/
Disallow /register/

badbot

Rule Path
Disallow /
Disallow /*.pdf$
Disallow /*.zip$
Disallow /*.tar$

googlebot-image

Rule Path
Allow /images/

Other Records

Field Value
sitemap https://benderacoy.online/sitemap.xml

Comments

  • robots.txt for best Google crawling
  • Allow all search engines to crawl all content
  • Allow Googlebot to crawl everything except restricted areas
  • Block a specific bot from crawling the site
  • Sitemap location (helps crawlers find your sitemap easily)
  • Block bots from indexing certain file types (like PDFs or temporary files)
  • Enable Googlebot to crawl your images