almanarasoft.com
robots.txt

Robots Exclusion Standard data for almanarasoft.com

Resource Scan

Scan Details

Site Domain almanarasoft.com
Base Domain almanarasoft.com
Scan Status Ok
Last Scan2026-02-10T01:19:23+00:00
Next Scan 2026-03-12T01:19:23+00:00

Last Scan

Scanned2026-02-10T01:19:23+00:00
URL https://almanarasoft.com/robots.txt
Domain IPs 50.87.253.152
Response IP 50.87.253.152
Found Yes
Hash 33353bf2a6fe9376fc3f8102470af0ac34eb5481892731021f9bc363b8a718ec
SimHash 67881e316431

Groups

*

Rule Path
Allow /
Allow /products/
Allow /mobile-apps/
Allow /hardware
Allow /about
Allow /contact
Allow /pricing
Allow /self-hosted-pricing
Allow /sectors
Allow /partners
Allow /reseller-program
Allow /news
Allow /careers
Allow /downloads
Allow /integrations
Allow /service-programs
Allow /knowledge-base
Allow /developers
Allow /academy
Allow /distributors/find
Allow /white-friday
Allow /auth
Allow /replication
Disallow /admin
Disallow /admin/
Disallow /admin-login
Disallow /create-admin
Disallow /reset-password
Disallow /api/
Disallow /support-tickets
Disallow /ticket_system
Disallow /distributors/*
Disallow /news/*
Disallow /version/*
Disallow /assets/
Disallow /*.js$
Disallow /*.css$

googlebot

Rule Path
Allow /
Disallow /admin
Disallow /admin/

googlebot-image

Rule Path
Allow /images/
Allow /og-image.jpg
Disallow /admin

bingbot

Rule Path
Allow /
Disallow /admin
Disallow /admin/

Other Records

Field Value
crawl-delay 1

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://almanarasoft.com/sitemap-index.xml
sitemap https://almanarasoft.com/sitemap.xml
sitemap https://almanarasoft.com/sitemap-images.xml
sitemap https://almanarasoft.com/sitemap-ar.xml

Comments

  • Robots.txt for AlmanaraSoft
  • https://almanarasoft.com
  • Last updated: 2025-12-17
  • ================================
  • Default rules for all crawlers
  • ================================
  • Allow crawling of main content
  • Block admin and sensitive areas
  • Block dynamic/parameterized pages
  • Block asset directories from indexing
  • ================================
  • Google-specific rules
  • ================================
  • ================================
  • Bing-specific rules
  • ================================
  • ================================
  • Block AI training bots
  • ================================
  • ================================
  • Block aggressive/spam bots
  • ================================
  • ================================
  • Sitemaps
  • ================================
  • ================================
  • Host directive
  • ================================

Warnings

  • `host` is not a known field.