ph.puma.com
robots.txt

Robots Exclusion Standard data for ph.puma.com

Resource Scan

Scan Details

Site Domain ph.puma.com
Base Domain puma.com
Scan Status Ok
Last Scan2025-09-29T16:18:58+00:00
Next Scan 2025-10-29T16:18:58+00:00

Last Scan

Scanned2025-09-29T16:18:58+00:00
URL https://ph.puma.com/robots.txt
Domain IPs 104.18.39.147, 172.64.148.109, 2606:4700:4405::6812:2793, 2a06:98c1:3100::ac40:946d
Response IP 104.18.39.147
Found Yes
Hash 9e6e7e203730e16c3aedade3032a039b8978e84870e98cc2058b072f368ad066
SimHash 235bc804effb

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /fr_FR/
Disallow /*pmin*
Disallow /*pmax*
Disallow /*prefn1*
Disallow /*prefn2*
Disallow /*prefn3*
Disallow /*prefn4*
Disallow /*prefv1*
Disallow /*prefv2*
Disallow /*prefv3*
Disallow /*prefv4*
Disallow *Wishlist-*
Disallow *Cart-Show*
Disallow *Order-History*

msnbot

Rule Path
Disallow *Wishlist-*
Disallow *Cart-Show*
Disallow *Order-History*

Other Records

Field Value
crawl-delay 30

bingbot

Rule Path
Disallow *Wishlist-*
Disallow *Cart-Show*
Disallow *Order-History*

Other Records

Field Value
crawl-delay 30

mj12bot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ph.puma.com/sitemap_index.xml

Warnings

  • 2 invalid lines.