indiacircus.com
robots.txt

Robots Exclusion Standard data for indiacircus.com

Resource Scan

Scan Details

Site Domain indiacircus.com
Base Domain indiacircus.com
Scan Status Ok
Last Scan2024-09-19T17:56:23+00:00
Next Scan 2024-10-19T17:56:23+00:00

Last Scan

Scanned2024-09-19T17:56:23+00:00
URL https://indiacircus.com/robots.txt
Domain IPs 13.33.30.17, 13.33.30.22, 13.33.30.45, 13.33.30.78
Response IP 13.33.30.78
Found Yes
Hash 462575e59b8b72de68c7bb1ef6363ae947e929b6f72916576ac292b930bb86a3
SimHash 67f07351cc62

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

ahrefsbot
ahrefssiteaudit
*

Rule Path
Disallow /*.git$
Disallow /*.github$
Disallow /*.sql$
Disallow /*.tgz$
Disallow /admin_ufs69f/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /checkout/
Disallow /onestepcheckout/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /catalogsearch/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /*?dir*
Disallow /*?dir=desc
Disallow /*?dir=asc
Disallow /*?limit=all
Disallow /*?mode*
Disallow /icsearch/
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /lib/
Disallow /phpserver/
Disallow /pub/errors/
Disallow /pub/opt/
Disallow /pub/static/
Disallow /tag/
Disallow /review/
Disallow /*?SID=
Disallow /checkout/
Disallow /onestepcheckout/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /catalogsearch/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /info.php

googlebot-image

Rule Path
Disallow

baidou

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /
Disallow

mj12bot

Rule Path
Disallow /

vagabondo

Rule Path
Disallow /

exabot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

bspider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

moget

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

semrush

Rule Path
Disallow /

Other Records

Field Value
sitemap https://indiacircus.com/sitemap.xml

Comments

  • robots.txt for Magento By Bee Online
  • GENERAL SETTINGS
  • Enable robots.txt rules for all crawlers
  • Crawl-delay parameter: number of seconds to wait between successive requests to the same server.
  • Set a custom crawl rate if you're experiencing traffic problems with your server.
  • Crawl-delay: 60
  • Magento sitemap: uncomment and replace the URL to your Magento sitemap file
  • Sitemap: http://www.example.com/sitemap/sitemap.xml
  • DEVELOPMENT RELATED SETTINGS
  • Do not crawl development files and folders: CVS, svn directories and dump files
  • GENERAL MAGENTO SETTINGS
  • Do not crawl Magento admin page
  • Default Instructions
  • Restrict User Account & Checkout Pages
  • Disallow Catalog Search Pages
  • Disallow URL Filter Searches
  • Restrict CMS Directories
  • Disallow Duplicate Content
  • Do not crawl 2-nd home page copy (example.com/index.php/). Uncomment it only if you activated Magento SEO URLs.
  • Disallow: /index.php/
  • Do not crawl links with session IDs
  • Do not crawl checkout and user account pages
  • Do not crawl seach pages and not-SEO optimized catalog links
  • SERVER SETTINGS
  • Do not crawl common server technical folders and files
  • IMAGE CRAWLERS SETTINGS
  • Extra: Uncomment if you do not wish Google and Bing to index your images
  • User-agent: msnbot-media
  • Disallow: /
  • Baiduspider
  • User-agent: Rogerbot
  • Crawl-limit:5