helpful1001.com
robots.txt

Robots Exclusion Standard data for helpful1001.com

Resource Scan

Scan Details

Site Domain helpful1001.com
Base Domain helpful1001.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2025-11-15T16:58:06+00:00
Next Scan 2026-02-13T16:58:06+00:00

Last Successful Scan

Scanned2024-07-01T15:23:50+00:00
URL https://helpful1001.com/robots.txt
Domain IPs 104.21.59.123, 172.67.177.78, 2606:4700:3034::ac43:b14e, 2606:4700:3036::6815:3b7b
Response IP 172.67.177.78
Found Yes
Hash dd9e0ff32eff52f95935df63329169cda8d227e8c43749378755f85ae986fb10
SimHash 0814c4c2c373

Groups

*

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

Comments

  • Disallow Web Bots
  • Disallow Archive Bots