prabhatkhabar.com
robots.txt

Robots Exclusion Standard data for prabhatkhabar.com

Resource Scan

Scan Details

Site Domain prabhatkhabar.com
Base Domain prabhatkhabar.com
Scan Status Ok
Last Scan2024-10-31T07:52:14+00:00
Next Scan 2024-11-07T07:52:14+00:00

Last Scan

Scanned2024-10-31T07:52:14+00:00
URL https://prabhatkhabar.com/robots.txt
Redirect https://www.prabhatkhabar.com/robots.txt
Redirect Domain www.prabhatkhabar.com
Redirect Base prabhatkhabar.com
Domain IPs 104.22.38.78, 104.22.39.78, 172.67.36.25, 2606:4700:10::6816:264e, 2606:4700:10::6816:274e, 2606:4700:10::ac43:2419
Redirect IPs 104.22.38.78, 104.22.39.78, 172.67.36.25, 2606:4700:10::6816:264e, 2606:4700:10::6816:274e, 2606:4700:10::ac43:2419
Response IP 172.67.36.25
Found Yes
Hash 78ded164f5b87464c5c24c06f57ae42e7df67950d1820ede212bbc69a342a1b7
SimHash 45008472f4f1

Groups

*

Rule Path
Disallow /wp-admin
Allow /wp-admin/admin-ajax.php
Disallow /*.html/feed$
Disallow /*-story.html$
Disallow /*.html$
Disallow /tap.html
Disallow /tap.html?*

Other Records

Field Value
sitemap https://www.prabhatkhabar.com/sitemap_index.xml
sitemap https://www.prabhatkhabar.com/news-sitemap.xml

Comments

  • Block URLs ending with .html/feed
  • Block URLs ending with -story.html
  • Block URLs ending with .html
  • Sitemaps