jainparichay.in
robots.txt

Robots Exclusion Standard data for jainparichay.in

Resource Scan

Scan Details

Site Domain jainparichay.in
Base Domain jainparichay.in
Scan Status Ok
Last Scan2026-01-04T06:21:18+00:00
Next Scan 2026-01-11T06:21:18+00:00

Last Scan

Scanned2026-01-04T06:21:18+00:00
URL https://jainparichay.in/robots.txt
Domain IPs 104.21.61.243, 172.67.217.38, 2606:4700:3030::6815:3df3, 2606:4700:3030::ac43:d926
Response IP 172.67.217.38
Found Yes
Hash 2cbae69e31d48aabc000330bd72de8bec98bed49fa1169b15a748613b2b1fe59
SimHash 490edad2c4e5

Groups

*

Rule Path
Allow /
Disallow /admin/
Disallow /api/
Disallow /src/
Disallow /scripts/
Disallow /docs/
Disallow /*.env
Disallow /*.log
Disallow /*.tmp
Disallow /*.bak
Disallow /*.swp
Disallow /node_modules/
Disallow /bun.lock
Disallow /.git/

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

Other Records

Field Value
sitemap https://jainparichay.in/sitemap.xml

Comments

  • Sitemap location
  • Block access to admin areas
  • Block access to sensitive files
  • Block access to development files
  • Crawl delay for respectful crawling
  • Allow specific bots