gnc.com
robots.txt

Robots Exclusion Standard data for gnc.com

Resource Scan

Scan Details

Site Domain gnc.com
Base Domain gnc.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-19T08:42:39+00:00
Next Scan 2024-11-18T08:42:39+00:00

Last Successful Scan

Scanned2024-06-29T08:41:18+00:00
URL https://gnc.com/robots.txt
Redirect https://www.gnc.com/robots.txt
Redirect Domain www.gnc.com
Redirect Base gnc.com
Domain IPs 204.2.133.124
Redirect IPs 116.51.25.143, 116.51.25.144, 116.51.25.145, 116.51.25.146
Response IP 116.51.25.143
Found Yes
Hash e1faf12b9ddd87b46164b4570b2b3965562e338783b58de66fb9008e151d798e
SimHash 4e20d0c27755

Groups

*

Rule Path
Disallow /*?q*
Disallow /*dm3*
Disallow /*dm2*
Disallow /*dm1*
Disallow /*cgid*
Disallow /*prefn*
Disallow /*pmin*
Disallow /cart*
Disallow /account*
Disallow /s/*
Disallow /checkout*
Disallow /review-order*
Disallow /order-confirmation*
Disallow /search*
Disallow /orders*
Disallow /your-order*
Disallow /*srule*
Disallow /*productId*
Disallow /profile*
Disallow /register*
Disallow /*item_group_id*
Disallow /*title*
Disallow /*start%3D*
Disallow /*sz*
Disallow /*?categoryId=*
Disallow /*Product-Variation*
Disallow /*Stores-FindStoreInventory
Disallow /*format%3Dajax
Disallow /*fdid%3D*
Disallow /*gnc/600762.html
Disallow /*gnc-rx.html
Allow /dw/image*

mediapartners-google

Rule Path
Allow /dw/image*
Allow /*utm_medium*

googlebot-image

Rule Path
Allow /dw/image*
Allow /*utm_medium*

adsbot-google

Rule Path
Allow /dw/image*
Allow /*utm_medium*

Other Records

Field Value
sitemap https://www.gnc.com/sitemap_index.xml