smartcloudprint.com
robots.txt

Robots Exclusion Standard data for smartcloudprint.com

Resource Scan

Scan Details

Site Domain smartcloudprint.com
Base Domain smartcloudprint.com
Scan Status Ok
Last Scan2025-12-19T07:01:25+00:00
Next Scan 2025-12-26T07:01:25+00:00

Last Scan

Scanned2025-12-19T07:01:25+00:00
URL http://smartcloudprint.com/robots.txt
Domain IPs 15.73.145.56
Response IP 15.73.145.56
Found Yes
Hash 1e814d48d651d275d877081eb7422909bffbd6186b68c2624627724adf0fb0ae
SimHash a850995f2ef0

Groups

*

Rule Path
Allow /
Disallow */search-results
Disallow */find.do
Disallow */video-gallery/
Disallow /media/
Disallow */filter.do
Disallow */search.do
Disallow */index.do
Disallow */details.do
Disallow */assets/*
Disallow */mpc/*
Disallow */upp/*

Other Records

Field Value
sitemap https://www8.hp.com/sitemap.xml
sitemap https://www8.hp.com/sitemap-product-catalog.xml
sitemap https://www8.hp.com/sitemap-hreflang-global-10k-1.xml
sitemap https://www8.hp.com/sitemap-hreflang-global-10k-2.xml
sitemap https://www8.hp.com/sitemap-hreflang-global-10k-3.xml
sitemap https://www8.hp.com/sitemap-hreflang-global-10k-4.xml

Comments

  • robots.txt v 6.19.1 June 2019
  • Comments & revision requests should be sent to HP SEO Forum hp-seo-forum [at] hp.com
  • robots.txt file for www8.hp.com & www.hp.com
  • Format is:
  • User-agent: <name of bot>
  • Disallow: <nothing> | <path>
  • ------------------------------------------------------------------------------
  • Sitemaps