unice.com
robots.txt

Robots Exclusion Standard data for unice.com

Resource Scan

Scan Details

Site Domain unice.com
Base Domain unice.com
Scan Status Ok
Last Scan2024-06-17T20:13:47+00:00
Next Scan 2024-07-01T20:13:47+00:00

Last Scan

Scanned2024-06-17T20:13:47+00:00
URL https://unice.com/robots.txt
Redirect https://www.unice.com/robots.txt
Redirect Domain www.unice.com
Redirect Base unice.com
Domain IPs 18.224.152.234, 3.139.167.39
Redirect IPs 23.52.40.34, 23.52.40.35, 2600:1417:3f::b81c:eb63, 2600:1417:3f::b81c:eb69
Response IP 184.50.85.164
Found Yes
Hash a41890da2251d1f8f9eca9ed90199d6abe730a104950658aebec13f19ec4f18e
SimHash d4699fe8ce50

Groups

*

Rule Path
Disallow /403
Disallow /404
Disallow /cart
Disallow /order
Disallow /search?*
Disallow /*?limit=*
Disallow /*?page=*
Disallow /customer
Disallow /login
Disallow /register
Disallow /forgotpassword
Disallow /test
Disallow /directory
Disallow /news
Disallow /catalog
Disallow /topic
Disallow /blog/*?id=

Other Records

Field Value
sitemap https://www.unice.com/sitemap/sitemap.xml
sitemap https://m.unice.com/sitemap/sitemap.xml

Comments

  • Crawlers Setup
  • Website Sitemap
  • Paths