ijnrd.org
robots.txt

Robots Exclusion Standard data for ijnrd.org

Resource Scan

Scan Details

Site Domain ijnrd.org
Base Domain ijnrd.org
Scan Status Ok
Last Scan2025-12-08T16:50:28+00:00
Next Scan 2026-01-07T16:50:28+00:00

Last Scan

Scanned2025-12-08T16:50:28+00:00
URL https://ijnrd.org/robots.txt
Domain IPs 147.93.16.245
Response IP 147.93.16.245
Found Yes
Hash 629ea813a900b623bc0b9717f093d4107b6b59de50f904007cdde2512e99fa9d
SimHash 0115ded165c7

Groups

googlebot
adsbot-google
googlebot-news
googlebot-image
bingbot
slurp
duckduckbot
baiduspider
yandexbot
ia_archiver
facebot
citeseerxbot
*

Rule Path
Allow /

*

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /manager/
Disallow /ambe001manager9898157864parinjanvi/
Disallow /editordocs/
Disallow /PD.php
Disallow /HD.php
Disallow /AD.php
Disallow /PDS.php
Disallow /HDS.php
Disallow /invoicemanager.php
Disallow /invoice.php
Disallow /certificatemanager.php
Disallow /bestpapergeneratecerti.php
Disallow /confirmationlettermanager.php
Disallow /confgeneratecerti.php
Disallow /doiandhardcopy.php
Disallow /papers/*.doc$
Disallow /papers/*.docx$
Disallow /papers/*.png$
Disallow /papers/*.jpg$
Disallow /papers/*.jpeg$
Disallow /papers/*.gif$
Disallow /paper_not_published/
Allow /papers/*.pdf$
Allow /*.pdf$

Other Records

Field Value
sitemap https://ijnrd.org/sitemap.xml

Comments

  • ------------------------------
  • Robots.txt for ijrnd.org
  • Refined for Google, Bing, and other search engines
  • ------------------------------
  • Allow all major bots to crawl everything except restricted paths
  • Default rule for all other bots
  • ------------------------------
  • Disallow sensitive or non-public directories/files
  • ------------------------------
  • ------------------------------
  • Disallow non-PDF paper formats and unpublished papers
  • ------------------------------
  • Explicitly allow PDFs
  • ------------------------------
  • Sitemap for better indexing
  • ------------------------------