manoramaonline.com
robots.txt

Robots Exclusion Standard data for manoramaonline.com

Resource Scan

Scan Details

Site Domain manoramaonline.com
Base Domain manoramaonline.com
Scan Status Ok
Last Scan2024-04-28T04:01:20+00:00
Next Scan 2024-05-05T04:01:20+00:00

Last Scan

Scanned2024-04-28T04:01:20+00:00
URL https://manoramaonline.com/robots.txt
Redirect https://www.manoramaonline.com/robots.txt
Redirect Domain www.manoramaonline.com
Redirect Base manoramaonline.com
Domain IPs 2.22.33.13
Redirect IPs 23.52.112.217, 2600:1413:b000:382::4a9, 2600:1413:b000:389::4a9
Response IP 23.54.56.229
Found Yes
Hash 2309326fb93c2b9e8b9a045e0d7db89be0822ab7ff2e6b2d199c2a13e6df1c34
SimHash 4c7530a84257

Groups

*

Rule Path
Disallow /.myModalPoll*
Disallow /content/mm/ml/tool/
Disallow /.currencyRating
Disallow /.youAndYourVehiclePopup*
Disallow /.natureAndYouPopup*
Disallow /.momAndYouPopup*
Disallow /.postYourCreativePopup*
Disallow /.commonMsgPopup
Disallow /.shareemailpopup
Disallow /.greyedMsgPopup
Disallow /cgi-bin/
Disallow /advt/
Disallow /rss/
Disallow /mmfont/
Disallow /.MM*
Disallow /servlet/
Disallow /ADVT/
Disallow /Cgi-bin/
Disallow /home.html/.myModalPoll*
Disallow /analytics.html
Disallow /analytics.html/*
Disallow /ACP/
Disallow /ACP/*
Disallow /content/mm/tv
Disallow /OnlinePortal/
Disallow /content/mm/ml/mm-push-engage.html
Disallow /mm-push-engage.html
Disallow /*feed.feed.xml$
Disallow /content/mm/ml/
Disallow /content/mm/mo/indian-super-league-2019-20.html
Disallow /indian-super-league-2019-20.html
Disallow /search-results.html

mediapartners-google

Rule Path
Disallow /404.htm
Disallow /health/sex.html
Disallow /health/sex/
Disallow /health/sexual-health.html
Disallow /health/sexual-health
Disallow /tag-results

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /