truvaxxl.com
robots.txt

Robots Exclusion Standard data for truvaxxl.com

Resource Scan

Scan Details

Site Domain truvaxxl.com
Base Domain truvaxxl.com
Scan Status Ok
Last Scan2025-11-03T10:58:48+00:00
Next Scan 2025-12-03T10:58:48+00:00

Last Scan

Scanned2025-11-03T10:58:48+00:00
URL https://truvaxxl.com/robots.txt
Domain IPs 104.19.156.83, 104.19.157.83
Response IP 104.19.156.83
Found Yes
Hash c3324a3d6227861f3f1421a5e34785aa9304601fba20659ec7372e410e18275f
SimHash 0146709616d3

Groups

mj12bot

Rule Path
Disallow /

baiduspiders

Rule Path
Disallow /

nerdybot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /uye-girisi
Disallow /uye-girisi?next=order2
Disallow /uye-ol
Disallow /uye-ol?next=order2
Disallow /sifremi-unuttum
Disallow /sepet
Disallow /hesabim
Disallow /hesabim/*
Disallow /dosya/
Disallow /admin/
Disallow /brand/listByCategory/categoryId/
Disallow /brand/listByCategory/categoryId/*
Disallow /cart/
Disallow /cart/*
Disallow /*index.php?do=catalog%2Fprint&
Disallow /panel/*
Disallow /order/*
Disallow /odeme
Disallow /iletisim-formu
Disallow /havale-bildirim
Disallow /iade-ve-iptal-formu
Disallow /kargo-takibi
Disallow /siparis-sorgula
Disallow /siparislerim

Other Records

Field Value
crawl-delay 30

Other Records

Field Value
sitemap https://www.truvaxxl.com/sitemap.xml

Comments

  • IdeaSoft Akilli E-ticaret | robots.txt

Warnings

  • 2 invalid lines.