casadomontanhista.com.br
robots.txt

Robots Exclusion Standard data for casadomontanhista.com.br

Resource Scan

Scan Details

Site Domain casadomontanhista.com.br
Base Domain casadomontanhista.com.br
Scan Status Ok
Last Scan2024-11-05T08:09:34+00:00
Next Scan 2024-12-05T08:09:34+00:00

Last Scan

Scanned2024-11-05T08:09:34+00:00
URL https://casadomontanhista.com.br/robots.txt
Domain IPs 104.21.47.186, 172.67.171.208, 2606:4700:3034::ac43:abd0, 2606:4700:3036::6815:2fba
Response IP 172.67.171.208
Found Yes
Hash 953ee2b575eacc81d1842ab4ac41f608ac6b52ab4f45534b283958995e4354dd
SimHash 2d707b1346d9

Groups

*

Rule Path
Disallow /CVS
Disallow /*.svn$
Disallow /*.idea$
Disallow /*.sql$
Disallow /*.tgz$

Other Records

Field Value
crawl-delay 30

baiduspider

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-imagem

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-news

Rule Path
Disallow /

baiduspider-favo

Rule Path
Disallow /

baiduspider-cpro

Rule Path
Disallow /

baiduspider-anĂșncios

Rule Path
Disallow /

baidu

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

majestic12

Rule Path
Disallow /

yandeximages

Rule Path
Disallow /

yandex

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

aranhabot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

coccocbot-image

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

daumoa

Rule Path
Disallow /

dazoobot

Rule Path
Disallow /

deusu

Rule Path
Disallow /

duckduckbot

Rule Path
Disallow /

duckduckgo-favicons-bot

Rule Path
Disallow /

euripbot

Rule Path
Disallow /

exploratodo

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

sogou blog

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou news spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sputnikbot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

bdcbot/1.0

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yandexbot/3.0

Rule Path
Disallow /
Disallow /admin/
Disallow /app/
Disallow /downloader/
Disallow /errors/
Disallow /includes/
Disallow /lib/
Disallow /pkginfo/
Disallow /shell/
Disallow /var/
Disallow /api.php
Disallow /cron.php
Disallow /cron.sh
Disallow /error_log
Disallow /get.php
Disallow /install.php
Disallow /LICENSE.html
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /README.txt
Disallow /RELEASE_NOTES.txt
Disallow /*?dir*
Disallow /*?dir=desc
Disallow /*?dir=asc
Disallow /*?limit=all
Disallow /*?mode*
Disallow /*?SID=
Disallow /checkout/
Disallow /onestepcheckout/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /catalogsearch/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /cgi-bin/
Disallow /cleanup.php
Disallow /apc.php
Disallow /memcache.php
Disallow /phpinfo.php

msnbot-media

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

bingbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://casadomontanhista.com.br/pub/media/google_sitemap_1.xml

Comments

  • robots.txt for Magento Community and Enterprise
  • GENERAL SETTINGS
  • Enable robots.txt rules for all crawlers
  • Crawl-delay parameter: number of seconds to wait between successive requests to the same server.
  • Set a custom crawl rate if you're experiencing traffic problems with your server.
  • Magento sitemap: uncomment and replace the URL to your Magento sitemap file
  • DEVELOPMENT RELATED SETTINGS
  • Do not crawl development files and folders: CVS, svn directories and dump files
  • Baiduspider
  • GENERAL MAGENTO SETTINGS
  • Do not crawl Magento admin page
  • Do not crawl common Magento technical folders
  • Do not crawl common Magento files
  • MAGENTO SEO IMPROVEMENTS
  • Do not crawl sub category pages that are sorted or filtered.
  • Do not crawl 2-nd home page copy (example.com/index.php/). Uncomment it only if you activated Magento SEO URLs.
  • Disallow: /index.php/
  • Do not crawl links with session IDs
  • Do not crawl checkout and user account pages
  • Do not crawl seach pages and not-SEO optimized catalog links
  • SERVER SETTINGS
  • Do not crawl common server technical folders and files
  • IMAGE CRAWLERS SETTINGS
  • Extra: Uncomment if you do not wish Google and Bing to index your images
  • User-agent: Googlebot-Image
  • Disallow: /

Warnings

  • 6 invalid lines.