sahari.in
robots.txt

Robots Exclusion Standard data for sahari.in

Resource Scan

Scan Details

Site Domain sahari.in
Base Domain sahari.in
Scan Status Ok
Last Scan2025-03-13T04:59:32+00:00
Next Scan 2025-03-20T04:59:32+00:00

Last Scan

Scanned2025-03-13T04:59:32+00:00
URL https://sahari.in/robots.txt
Domain IPs 191.101.228.143, 2a02:4780:38:dbe2:7a38:410e:613e:1ddd
Response IP 77.37.115.223
Found Yes
Hash d536ef2ef73bd6b50f474f24139cfc1a761cdeb2fe6b2e16081f4347c1fbe567
SimHash 21b40d412755

Groups

*

Rule Path
Allow /weekly/
Allow /weekly/*/guest.php
Allow /monthly/
Allow /monthly/*/guest.php
Allow /serials/
Allow /stories/
Allow /freeBooks/
Disallow /weekly/*/files/
Disallow /weekly/*/files/mobile/
Disallow /weekly/*/files/thumb/
Disallow /monthly/*/files/
Disallow /monthly/*/files/mobile/
Disallow /monthly/*/files/thumb/
Disallow /admin/
Disallow /includes/
Disallow /system/
Disallow /app/
Disallow /membership/
Allow /ads.txt
Allow /gtag/js
Allow /pagead/
Allow /analytics.js
Allow /js/

Other Records

Field Value
sitemap https://sahari.in/sitemap.xml

Comments

  • Allow all user agents to crawl the site
  • âœ
  • ❌ Disallow crawling of flipbook images and thumbnails
  • ❌ Disallow any sensitive or backend folders (optional additions)
  • âœ
  • âœ

Warnings

  • 3 invalid lines.