yallah-lives.com
robots.txt

Robots Exclusion Standard data for yallah-lives.com

Resource Scan

Scan Details

Site Domain yallah-lives.com
Base Domain yallah-lives.com
Scan Status Ok
Last Scan2026-03-03T04:54:32+00:00
Next Scan 2026-04-02T04:54:32+00:00

Last Scan

Scanned2026-03-03T04:54:32+00:00
URL https://yallah-lives.com/robots.txt
Domain IPs 91.218.50.23
Response IP 91.218.50.23
Found Yes
Hash 335c74e00fb3148635a3e93d361bbe5a6b8a37440692d644070138d82357d373
SimHash e545a3f3c590

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /application/
Disallow /system/
Disallow /assets/cache/
Disallow /scripts/
Disallow /clear
Disallow /news/page/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://yallah-lives.com/sitemap.xml
sitemap https://yallah-lives.com/sitemap-news.xml

Comments

  • Robots.txt for yallah-lives.com
  • Generated: 2026-03-03 04:54:33
  • Disallow admin and system directories
  • Disallow dynamic scripts
  • Disallow AJAX endpoints
  • Crawl-delay (optional, adjust as needed)
  • Sitemaps