justjeans.jgl.co.nz
robots.txt

Robots Exclusion Standard data for justjeans.jgl.co.nz

Resource Scan

Scan Details

Site Domain justjeans.jgl.co.nz
Base Domain jgl.co.nz
Scan Status Ok
Last Scan2024-09-21T20:28:55+00:00
Next Scan 2024-10-21T20:28:55+00:00

Last Scan

Scanned2024-09-21T20:28:55+00:00
URL https://justjeans.jgl.co.nz/robots.txt
Domain IPs 23.215.7.16, 23.215.7.21
Response IP 96.17.180.48
Found Yes
Hash 9e38379eb6f31abe632f9dd6b8933c85b0d7e9ec405d05d182a3012dd505557e
SimHash 5501ca02edb0

Groups

*

Rule Path
Disallow */AjaxOrderItemDisplayView*
Disallow */CategoryDisplay*
Disallow */GiftCardsDisplayURL*
Disallow */InterestItemDisplay*
Disallow */LogonForm*
Disallow */OrderCalculate*
Disallow */OrderShippingBillingView*
Disallow */ProductDisplay*
Disallow */ProductQuickDisplayView*
Disallow */ReLogonFormView*
Disallow */ResetPasswordGuestErrorView*
Disallow */SafetyAndRecallDisplayURL*
Disallow */SearchDisplay*
Disallow */shop/OverlayDisplayView*
Disallow */ShoppingInfoView*
Disallow */UserRegistrationForm*
Disallow */webapp/*
Disallow *pageView*
Disallow *promo_name*

Other Records

Field Value
sitemap https://justjeans.jgl.co.nz/sitemap.xml

Comments

  • Allow all search engines
  • Reduce raw url indexing and miscellaneous URLs
  • Block campaign tracking from indexing