pangighatidanikapatrika.in
robots.txt

Robots Exclusion Standard data for pangighatidanikapatrika.in

Resource Scan

Scan Details

Site Domain pangighatidanikapatrika.in
Base Domain pangighatidanikapatrika.in
Scan Status Ok
Last Scan2024-06-22T01:07:24+00:00
Next Scan 2024-06-29T01:07:24+00:00

Last Scan

Scanned2024-06-22T01:07:24+00:00
URL https://pangighatidanikapatrika.in/robots.txt
Domain IPs 104.21.4.114, 172.67.132.7, 2606:4700:3035::6815:472, 2606:4700:3036::ac43:8407
Response IP 172.67.132.7
Found Yes
Hash 0c57826cb926e4377aa3007348eb640b147ee016ecef225503b3148e7cc10842
SimHash 8c421c667eb0

Groups

*

Rule Path
Allow /news-sitemap.xml
Disallow /admin/
Disallow /home/archive.php
Disallow /errorfound/error.php
Disallow /blocked.html
Disallow /jmobile/*
Disallow /mobile/*
Disallow /rashi/*
Disallow /home/homepage.php
Disallow /home
Disallow /noad*
Disallow /home1
Disallow /adtype1*
Disallow /edu/pdf*
Disallow /tags/*/false
Disallow /tags/*/false/
Disallow /news-*
Disallow /*%7B%7Burl%7D%7D*
Disallow /*%7B%7Bimage%7D%7D*
Disallow /tags/asifa*
Disallow /tags/*/tags/
Disallow /edu/*
Disallow /event/selfie/*
Disallow /news-brief/*
Disallow /podcasts
Disallow /tag/8301/rss

googlebot-news

Rule Path
Disallow /brand-post/news*
Disallow /tags/*/tags/
Disallow /impact-feature/*
Disallow /event/selfie/*
Disallow /news-brief/*
Disallow /podcasts

Other Records

Field Value
sitemap https://pangighatidanikapatrika.in/sitemap.xml
sitemap https://pangighatidanikapatrika.in/sitemaps/post-category_1.xml
sitemap https://pangighatidanikapatrika.in/news-sitemap.xml