al-marsd.com
robots.txt

Robots Exclusion Standard data for al-marsd.com

Resource Scan

Scan Details

Site Domain al-marsd.com
Base Domain al-marsd.com
Scan Status Ok
Last Scan2025-11-15T13:40:00+00:00
Next Scan 2025-11-22T13:40:00+00:00

Last Scan

Scanned2025-11-15T13:40:00+00:00
URL https://al-marsd.com/robots.txt
Domain IPs 104.26.8.123, 104.26.9.123, 172.67.75.33, 2606:4700:20::681a:87b, 2606:4700:20::681a:97b, 2606:4700:20::ac43:4b21
Response IP 104.26.9.123
Found Yes
Hash 4aeb63926eac068d008aacc728743cee35a21658e9ee7da78d1b28e149e5dc16
SimHash 4b92df137431

Groups

*
mediapartners-google

Rule Path
Disallow /admin/
Disallow /adminPanel/
Disallow /api/
Disallow /telescope/
Disallow /storage/
Disallow /vendor/
Disallow /bootstrap/
Disallow /config/
Disallow /database/
Disallow /.env
Disallow /artisan
Allow /css/
Allow /js/
Allow /images/
Allow /web/assets/
Allow /favicon.ico
Allow /robots.txt
Allow /sitemap.xml
Allow /article/
Allow /category/
Allow /section/
Allow /tag/
Allow /author/
Allow /search/

Other Records

Field Value
crawl-delay 1

googlebot
googlebot-mobile
adsbot-google
googlebot-image
googlebot-news
googlebot-video
mediapartners
mediapartners-google

Rule Path
Allow /

Other Records

Field Value
crawl-delay 1

bingbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 2

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

yandex
yandexbot
yandexmedia
yandeximages
yandexcatalog
yandexdirect
yandexblogs
yandexnews
yandexpagechecker
dotbot
rogerbot
mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

chatgpt-user
google-extended
ccbot
anthropic-ai

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://al-marsd.com/sitemap.xml
sitemap https://sport.al-marsd.com/sitemap.xml

Comments

  • Global rules for all crawlers
  • Sitemaps
  • Google family (explicit allow + polite delay)
  • Microsoft
  • Yahoo
  • Yandex family (inherits global)
  • Moz tools (inherits global)
  • Aggressive crawlers—completely blocked
  • AI-focused crawlers (inherits global)