/.well-known/

Log In Sign Up

mishakala.com
robots.txt

Robots Exclusion Standard data for mishakala.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	mishakala.com
Base Domain	mishakala.com
Scan Status	Ok
Last Scan	2025-11-20T20:15:10+00:00
Next Scan	2025-12-20T20:15:10+00:00

Last Scan

Scanned	2025-11-20T20:15:10+00:00
URL	https://mishakala.com/robots.txt
Domain IPs	185.86.181.106
Response IP	185.86.181.106
Found	Yes
Hash	8d169e8074a601c352d79e011edda5c744b8c99ef5bfdfe785455954cbfcf703
SimHash	ad24f8c5c3f3

Groups

googlebot-image

Rule

Path

Disallow

/

*

Rule

Path

Disallow

/404/

Disallow

/app/

Disallow

/cgi-bin/

Disallow

/downloader/

Disallow

/errors/

Disallow

/includes/

Disallow

/js/

Disallow

/lib/

Disallow

/magento/

Disallow

/media/

Disallow

/pkginfo/

Disallow

/report/

Disallow

/scripts/

Disallow

/shell/

Disallow

/skin/

Disallow

/stats/

Disallow

/var/

Disallow

/index.php/

Disallow

/catalog/product_compare/

Disallow

/catalog/category/view/

Disallow

/catalog/product/view/

Disallow

/catalogsearch/

Disallow

/checkout/

Disallow

/control/

Disallow

/contacts/

Disallow

/customer/

Disallow

/customize/

Disallow

/newsletter/

Disallow

/poll/

Disallow

/review/

Disallow

/sendfriend/

Disallow

/tag/

Disallow

/wishlist/

Disallow

/cron.php

Disallow

/cron.sh

Disallow

/error_log

Disallow

/install.php

Disallow

/LICENSE.html

Disallow

/LICENSE.txt

Disallow

/LICENSE_AFL.txt

Disallow

/STATUS.txt

Disallow

/*.js$

Disallow

/*.css$

Disallow

/*.php$

Disallow

/*?SID=

Back to top

Other Records

Field

Value

sitemap

http://didbazar.ir/sitemap/sitemap.xml

Back to top

Comments

Google Image Crawler Setup
Crawlers Setup
Directories
Paths (clean URLs)
Files
Paths (no clean URLs)

Back to top