prabhatkhabar.com
robots.txt
Robots Exclusion Standard data for prabhatkhabar.com
Resource Scan
Scan Details
Site Domain | prabhatkhabar.com |
Base Domain | prabhatkhabar.com |
Scan Status | Ok |
Last Scan | 2024-10-31T07:52:14+00:00 |
Next Scan | 2024-11-07T07:52:14+00:00 |
Last Scan
Scanned | 2024-10-31T07:52:14+00:00 |
URL | https://prabhatkhabar.com/robots.txt |
Redirect | https://www.prabhatkhabar.com/robots.txt |
Redirect Domain | www.prabhatkhabar.com |
Redirect Base | prabhatkhabar.com |
Domain IPs | 104.22.38.78, 104.22.39.78, 172.67.36.25, 2606:4700:10::6816:264e, 2606:4700:10::6816:274e, 2606:4700:10::ac43:2419 |
Redirect IPs | 104.22.38.78, 104.22.39.78, 172.67.36.25, 2606:4700:10::6816:264e, 2606:4700:10::6816:274e, 2606:4700:10::ac43:2419 |
Response IP | 172.67.36.25 |
Found | Yes |
Hash | 78ded164f5b87464c5c24c06f57ae42e7df67950d1820ede212bbc69a342a1b7 |
SimHash | 45008472f4f1 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin |
Allow | /wp-admin/admin-ajax.php |
Disallow | /*.html/feed$ |
Disallow | /*-story.html$ |
Disallow | /*.html$ |
Disallow | /tap.html |
Disallow | /tap.html?* |
Other Records
Field | Value |
---|---|
sitemap | https://www.prabhatkhabar.com/sitemap_index.xml |
sitemap | https://www.prabhatkhabar.com/news-sitemap.xml |
Comments