global.ricohsoftware.com
robots.txt

Robots Exclusion Standard data for global.ricohsoftware.com

Resource Scan

Scan Details

Site Domain global.ricohsoftware.com
Base Domain ricohsoftware.com
Scan Status Ok
Last Scan2025-10-02T16:14:00+00:00
Next Scan 2025-11-01T16:14:00+00:00

Last Scan

Scanned2025-10-02T16:14:00+00:00
URL https://global.ricohsoftware.com/robots.txt
Domain IPs 76.76.21.21
Response IP 76.76.21.21
Found Yes
Hash 67dc08eb077d084fdbeca48a716aa86bedd02a9281b7824ba1e713cb0dbbed20
SimHash 6f0614cfeb97

Groups

*

Rule Path
Allow /
Disallow */App_Config*
Disallow /itchannel/
Disallow /en/about-us/terms-of-use
Disallow /en/about-us/privacy-policy
Disallow */downloads/*
Disallow */en/search/*
Disallow /en/About-Us/Safe-Harbor-Privacy-Statement
Disallow */about/awards/*
Disallow /about/docs/pdf/NECS/
Disallow /technology/
Disallow /cloud-hosting-managed-it/
Disallow /en/products/supplies/search/
Disallow */test/*
Disallow */Test/*

googlebot-image

Rule Path
Allow /_next/image?*
Allow /_next/legacy/image?*

gsa-crawler

Rule Path
Allow /

ninjabot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.ricoh-usa.com/sitemap.xml

Comments

  • Ricoh Americas corporation
  • Disallow wellbehaved webcrawlers from indexing
  • Note to auditors: If your webscanning tool reports this robots.txt
  • file as a potential vulnerability, and suggests removing it, please
  • ignore it, and log a bug against the webscanning tool.