banaka.in
robots.txt

Robots Exclusion Standard data for banaka.in

Resource Scan

Scan Details

Site Domain banaka.in
Base Domain banaka.in
Scan Status Ok
Last Scan2024-11-16T05:33:04+00:00
Next Scan 2024-11-23T05:33:04+00:00

Last Scan

Scanned2024-11-16T05:33:04+00:00
URL https://banaka.in/robots.txt
Domain IPs 35.219.200.3
Response IP 35.219.200.3
Found Yes
Hash d7e5375caf5148937d113cdc67061337000819c315142434e9eb2fa9ca265d20
SimHash 84b55e907c41

Groups

*

Rule Path
Allow /
Allow /bhartiya-nyay-sanhita-english/
Allow /privacy-policy
Allow /terms-of-service
Disallow /search
Disallow /user/
Disallow /account/
Disallow /admin/
Disallow /wp-admin/
Disallow /api/

Other Records

Field Value
sitemap https://www.banaka.in/sitemap.xml

Comments

  • robots.txt for https://www.banaka.in
  • Allow crawling of all content
  • Disallow crawling of search results, if you have a search function
  • Disallow crawling of user-specific content, if applicable
  • Prevent crawling of any potential admin areas
  • Prevent crawling of any API endpoints
  • Sitemap location
  • Crawl-delay directive (optional, use if you want to limit crawl rate)
  • Crawl-delay: 10