cpi-syndication.com
robots.txt

Robots Exclusion Standard data for cpi-syndication.com

Resource Scan

Scan Details

Site Domain cpi-syndication.com
Base Domain cpi-syndication.com
Scan Status Ok
Last Scan2026-01-15T04:42:39+00:00
Next Scan 2026-02-14T04:42:39+00:00

Last Scan

Scanned2026-01-15T04:42:39+00:00
URL https://cpi-syndication.com/robots.txt
Domain IPs 34.231.4.84
Response IP 34.231.4.84
Found Yes
Hash a75b90ae8f66ca4dd54872e199a592bd15f66ed8f486d1dfb6b3c7a8d5fc245f
SimHash ca1a495f0bd0

Groups

*

Rule Path
Disallow /filestore

Other Records

Field Value
crawl-delay 10

Comments

  • Sample robots.txt file - ensures that a Google Appliance can still access the spider page (if configured)
  • and assumes an installation in the site root. For sites in a subfolder you must move the robots.txt file
  • to the site root and alter the paths accordingly.