manoramanews.com
robots.txt

Robots Exclusion Standard data for manoramanews.com

Resource Scan

Scan Details

Site Domain manoramanews.com
Base Domain manoramanews.com
Scan Status Ok
Last Scan2024-04-26T13:11:14+00:00
Next Scan 2024-05-03T13:11:14+00:00

Last Scan

Scanned2024-04-26T13:11:14+00:00
URL https://manoramanews.com/robots.txt
Redirect https://www.manoramanews.com/robots.txt
Redirect Domain www.manoramanews.com
Redirect Base manoramanews.com
Domain IPs 104.71.49.14
Redirect IPs 23.52.112.217, 2600:1413:b000:382::4a9, 2600:1413:b000:389::4a9
Response IP 23.54.56.229
Found Yes
Hash afd5ce1b3027189166867a9ee263171e73249cca6c7c249ef60a4cd5b60e07b1
SimHash 02105b2471d3

Groups

*

Rule Path
Disallow /content/mm/mv/home.html
Disallow /analytics.html
Disallow /analytics.html/*
Disallow /ManoramanewsPortal/
Disallow /content/mm/en/*
Disallow /ACP/
Disallow /ACP/*
Disallow /in-depth/news-maker-2017.html
Disallow /search-results.html
Disallow /search-results.html*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /