insecresources.org.np
robots.txt

Robots Exclusion Standard data for insecresources.org.np

Resource Scan

Scan Details

Site Domain insecresources.org.np
Base Domain insecresources.org.np
Scan Status Ok
Last Scan2026-01-27T18:50:21+00:00
Next Scan 2026-02-10T18:50:21+00:00

Last Scan

Scanned2026-01-27T18:50:21+00:00
URL https://insecresources.org.np/robots.txt
Domain IPs 82.180.144.109
Response IP 82.180.144.109
Found Yes
Hash a75b90ae8f66ca4dd54872e199a592bd15f66ed8f486d1dfb6b3c7a8d5fc245f
SimHash ca1a495f0bd0

Groups

*

Rule Path
Disallow /filestore

Other Records

Field Value
crawl-delay 10

Comments

  • Sample robots.txt file - ensures that a Google Appliance can still access the spider page (if configured)
  • and assumes an installation in the site root. For sites in a subfolder you must move the robots.txt file
  • to the site root and alter the paths accordingly.