toptenelectronics.in
robots.txt

Robots Exclusion Standard data for toptenelectronics.in

Resource Scan

Scan Details

Site Domain toptenelectronics.in
Base Domain toptenelectronics.in
Scan Status Ok
Last Scan2025-12-14T04:29:06+00:00
Next Scan 2026-01-13T04:29:06+00:00

Last Scan

Scanned2025-12-14T04:29:06+00:00
URL https://toptenelectronics.in/robots.txt
Redirect https://www.toptenelectronics.in/robots.txt
Redirect Domain www.toptenelectronics.in
Redirect Base toptenelectronics.in
Domain IPs 104.21.91.49, 172.67.166.188, 2606:4700:3035::6815:5b31, 2606:4700:3036::ac43:a6bc
Redirect IPs 104.21.91.49, 172.67.166.188, 2606:4700:3035::6815:5b31, 2606:4700:3036::ac43:a6bc
Response IP 104.21.91.49
Found Yes
Hash a949b53ee00545a063e0e47c21a9edeeafdef621e9225d20fb0828a45107ce4d
SimHash ac4645b2eff2

Groups

*

Rule Path
Disallow /shop?category=

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

baiduspider-news

Rule Path
Disallow /

baiduspider-favo

Rule Path
Disallow /

baiduspider-cpro

Rule Path
Disallow /

baiduspider-ads

Rule Path
Disallow /

baidu

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

krzana bot

Rule Path
Disallow /

krzana

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.toptenelectronics.in/sitemap.xml

Comments

  • Allow search crawlers to discover the sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot

Warnings

  • 2 invalid lines.