the-acap.org
robots.txt

Robots Exclusion Standard data for the-acap.org

Resource Scan

Scan Details

Site Domain the-acap.org
Base Domain the-acap.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonRequest timed out.
Last Scan2024-09-12T18:01:07+00:00
Next Scan 2024-12-11T18:01:07+00:00

Last Successful Scan

Scanned2021-11-13T18:43:56+00:00
URL http://the-acap.org/robots.txt
Found Yes
Hash baec49420a7fb0582f893948e15cc7aab349fc9a6a291e38710fe04813364197
SimHash 0154cb334b51

Groups

*

Rule Path
Allow /

*

Rule Path
Disallow /files/

Comments

  • ACAP version=1.1
  •    User-agent: *
  •    Allow: /
  •    User-agent: *
  •    Disallow:  /files/

Warnings

  • `acap-allow-crawl` is not a known field.
  • `acap-crawler` is not a known field.
  • `acap-disallow-crawl` is not a known field.