sg.puma.com
robots.txt

Robots Exclusion Standard data for sg.puma.com

Resource Scan

Scan Details

Site Domain sg.puma.com
Base Domain puma.com
Scan Status Ok
Last Scan2024-11-14T02:22:46+00:00
Next Scan 2024-12-14T02:22:46+00:00

Last Scan

Scanned2024-11-14T02:22:46+00:00
URL https://sg.puma.com/robots.txt
Domain IPs 104.18.33.130, 172.64.154.126, 2606:4700:4400::6812:2182, 2606:4700:4400::ac40:9a7e
Response IP 172.64.154.126
Found Yes
Hash ab0dde637c87d0455e79c92ea4a793839183a76712497de4441e458ab22edd74
SimHash 235bc804effb

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /fr_FR/
Disallow /*pmin*
Disallow /*pmax*
Disallow /*prefn1*
Disallow /*prefn2*
Disallow /*prefn3*
Disallow /*prefn4*
Disallow /*prefv1*
Disallow /*prefv2*
Disallow /*prefv3*
Disallow /*prefv4*
Disallow *Wishlist-*
Disallow *Cart-Show*
Disallow *Order-History*

msnbot

Rule Path
Disallow *Wishlist-*
Disallow *Cart-Show*
Disallow *Order-History*

Other Records

Field Value
crawl-delay 30

bingbot

Rule Path
Disallow *Wishlist-*
Disallow *Cart-Show*
Disallow *Order-History*

Other Records

Field Value
crawl-delay 30

mj12bot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://sg.puma.com/sitemap_index.xml

Warnings

  • 2 invalid lines.