manoramanews.com
robots.txt
Robots Exclusion Standard data for manoramanews.com
Resource Scan
Scan Details
Site Domain | manoramanews.com |
Base Domain | manoramanews.com |
Scan Status | Ok |
Last Scan | 2024-04-26T13:11:14+00:00 |
Next Scan | 2024-05-03T13:11:14+00:00 |
Last Scan
Scanned | 2024-04-26T13:11:14+00:00 |
URL | https://manoramanews.com/robots.txt |
Redirect | https://www.manoramanews.com/robots.txt |
Redirect Domain | www.manoramanews.com |
Redirect Base | manoramanews.com |
Domain IPs | 104.71.49.14 |
Redirect IPs | 23.52.112.217, 2600:1413:b000:382::4a9, 2600:1413:b000:389::4a9 |
Response IP | 23.54.56.229 |
Found | Yes |
Hash | afd5ce1b3027189166867a9ee263171e73249cca6c7c249ef60a4cd5b60e07b1 |
SimHash | 02105b2471d3 |
Groups
*
Rule | Path |
---|---|
Disallow | /content/mm/mv/home.html |
Disallow | /analytics.html |
Disallow | /analytics.html/* |
Disallow | /ManoramanewsPortal/ |
Disallow | /content/mm/en/* |
Disallow | /ACP/ |
Disallow | /ACP/* |
Disallow | /in-depth/news-maker-2017.html |
Disallow | /search-results.html |
Disallow | /search-results.html* |