taneira.com
robots.txt

Robots Exclusion Standard data for taneira.com

Resource Scan

Scan Details

Site Domain taneira.com
Base Domain taneira.com
Scan Status Ok
Last Scan2026-03-09T20:23:27+00:00
Next Scan 2026-04-08T20:23:27+00:00

Last Scan

Scanned2026-03-09T20:23:27+00:00
URL https://taneira.com/robots.txt
Redirect https://www.taneira.com/robots.txt
Redirect Domain www.taneira.com
Redirect Base taneira.com
Domain IPs 104.18.30.34, 104.18.31.34, 2606:4700::6812:1e22, 2606:4700::6812:1f22
Redirect IPs 104.18.30.34, 104.18.31.34, 2606:4700::6812:1e22, 2606:4700::6812:1f22
Response IP 104.18.31.34
Found Yes
Hash e90f83c3ffa20ea40c8248e080180374c1ef87f9b5fc595cfae238c296eb82ff
SimHash 59174e504fb2

Groups

*

Rule Path
Allow /
Disallow /en
Disallow /en/*
Disallow /cart
Disallow /checkout/*
Disallow /myaccount/*
Disallow /pgcallback/*
Disallow /search-results/*
Disallow /wps/portal/*
Disallow /error
Disallow /search
Disallow */wps/
Disallow */demandware.store/
Disallow *_p.html

copyrightcheck

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.taneira.com/sitemap_index.xml

Comments

  • ______ _
  • /_ __/___ _____ ___ (_)________ _
  • / / / __ `/ __ \/ _ \/ / ___/ __ `/
  • / / / /_/ / / / / __/ / / / /_/ /
  • /_/ \__,_/_/ /_/\___/_/_/ \__,_/
  • www.taneira.com - robots.txt - crawl like no other