janat.me
robots.txt

Robots Exclusion Standard data for janat.me

Resource Scan

Scan Details

Site Domain janat.me
Base Domain janat.me
Scan Status Ok
Last Scan2026-03-04T13:10:44+00:00
Next Scan 2026-04-03T13:10:44+00:00

Last Scan

Scanned2026-03-04T13:10:44+00:00
URL https://janat.me/robots.txt
Domain IPs 104.21.80.96, 172.67.177.39, 2606:4700:3033::ac43:b127, 2606:4700:3035::6815:5060
Response IP 104.21.80.96
Found Yes
Hash 220498f985d42777d8a671ae7316c849b671040f97a5463f644a61fe9483f487
SimHash 6981de53c423

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /api/
Disallow /_next/
Disallow /static/
Allow /ar-IQ/
Allow /en/
Allow /news/
Allow /activities/
Allow /seo-refresh.json

Other Records

Field Value
crawl-delay 1

bingbot

Rule Path
Allow /indexnow-key.txt

Other Records

Field Value
sitemap https://janat.me/sitemap.xml

Comments

  • Block admin and API routes
  • Allow crawling of all content
  • Crawl-delay (optional, helps prevent overwhelming server)
  • Sitemaps
  • IndexNow support