arsenalinc.com
robots.txt

Robots Exclusion Standard data for arsenalinc.com

Resource Scan

Scan Details

Site Domain arsenalinc.com
Base Domain arsenalinc.com
Scan Status Ok
Last Scan2025-08-21T16:05:27+00:00
Next Scan 2025-09-20T16:05:27+00:00

Last Scan

Scanned2025-08-21T16:05:27+00:00
URL https://arsenalinc.com/robots.txt
Domain IPs 104.26.12.60, 104.26.13.60, 172.67.74.186, 2606:4700:20::681a:c3c, 2606:4700:20::681a:d3c, 2606:4700:20::ac43:4aba
Response IP 104.26.12.60
Found Yes
Hash 1201bd6dac8d88ba492d584afbb7d007588eb07cbd5ff8e86198294725223e5f
SimHash 604cdff0a693

Groups

*

Rule Path
Disallow /usa/Includes/
Disallow /usa/classes/
Disallow /usa/etc/
Disallow /usa/files/
Disallow /usa/lib/
Disallow /usa/sql/
Disallow /usa/upgrade/
Disallow /usa/var/export/
Disallow /usa/var/html/
Disallow /usa/var/import/
Disallow /usa/var/locale/
Disallow /usa/var/log/
Disallow /usa/var/run/
Disallow /usa/var/theme/
Disallow /usa/var/tmp/
Disallow /usa/admin.php
Disallow /usa/console.php
Disallow /usa/error_handler.php
Disallow /usa/https_check.php
Disallow /usa/install.php
Disallow /usa/LICENSE.txt
Disallow /usa/probe.php
Disallow /usa/public/error.css
Disallow /usa/public/error.html
Disallow /usa/public/error_image.png
Disallow /usa/top.inc.PHP53.php
Disallow /usa/top.inc.php
Disallow /usa/register.php

Other Records

Field Value
crawl-delay 10

Comments

  • robots.txt
  • Sitemap example
  • Sitemap: http://example.com/sitemap.xml
  • Directories
  • Files