raisingbharat.com
robots.txt

Robots Exclusion Standard data for raisingbharat.com

Resource Scan

Scan Details

Site Domain raisingbharat.com
Base Domain raisingbharat.com
Scan Status Ok
Last Scan2026-01-07T01:12:24+00:00
Next Scan 2026-02-06T01:12:24+00:00

Last Scan

Scanned2026-01-07T01:12:24+00:00
URL https://raisingbharat.com/robots.txt
Redirect https://www.raisingbharat.com/robots.txt
Redirect Domain www.raisingbharat.com
Redirect Base raisingbharat.com
Domain IPs 148.72.90.87
Redirect IPs 148.72.90.87
Response IP 148.72.90.87
Found Yes
Hash 0644bfed1364a859cdf0c2cb27edc2ebb610552449ba9a96fbd0d66927db2df4
SimHash 6a085410e7d3

Groups

*

Rule Path
Allow /

facebookexternalhit

Rule Path
Allow /

twitterbot

Rule Path
Allow /

linkedinbot

Rule Path
Allow /

whatsapp

Rule Path
Allow /

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

googlebot

Rule Path
Allow /
Allow /about
Allow /contact
Allow /buy-pixel
Allow /category/

*

Rule Path
Disallow /admin/
Disallow /api/
Disallow /internal/
Disallow /*.json$

*

Rule Path
Allow /og-image.jpg
Allow /twitter-card.jpg
Allow /favicon.ico
Allow /apple-touch-icon.png

Other Records

Field Value
sitemap https://raisingbharat.com/sitemap.xml

Comments

  • Sitemap
  • Social Media Bots
  • Search Engine Crawl-delay
  • Performance - Block resource-heavy crawling during peak hours
  • Allow important pages to be crawled frequently
  • Disallow admin/internal paths (future-proofing)
  • Rich snippets and social sharing