integriworks.net
robots.txt

Robots Exclusion Standard data for integriworks.net

Resource Scan

Scan Details

Site Domain integriworks.net
Base Domain integriworks.net
Scan Status Ok
Last Scan2026-01-22T17:13:29+00:00
Next Scan 2026-02-21T17:13:29+00:00

Last Scan

Scanned2026-01-22T17:13:29+00:00
URL http://integriworks.net/robots.txt
Domain IPs 69.144.123.100
Response IP 69.144.123.100
Found Yes
Hash bcd95a59b378837464090b8b675dbf5561ab5750af4ee887325be750c474f30d
SimHash a964da039962

Groups

*

Rule Path
Disallow /ads/
Disallow /banner/
Disallow /reports/
Disallow /test/
Disallow /OLD/

susedig

Rule Path
Disallow /ads/
Disallow /banner/
Disallow /reports/
Disallow /test/
Disallow /OLD/

stress-agent

Rule Path
Disallow /

Comments

  • exclude help system from robots
  • but allow htdig to index our doc-tree
  • disallow stress test