newmi.in
robots.txt

Robots Exclusion Standard data for newmi.in

Resource Scan

Scan Details

Site Domain newmi.in
Base Domain newmi.in
Scan Status Ok
Last Scan2024-10-19T18:41:16+00:00
Next Scan 2024-11-02T18:41:16+00:00

Last Scan

Scanned2024-10-19T18:41:16+00:00
URL https://newmi.in/robots.txt
Redirect https://www.newmi.in/robots.txt
Redirect Domain www.newmi.in
Redirect Base newmi.in
Domain IPs 35.190.95.30
Redirect IPs 104.18.30.208, 104.18.31.208, 2606:4700::6812:1ed0, 2606:4700::6812:1fd0
Response IP 104.18.31.208
Found Yes
Hash 2c1db101afc19a33c09b72fbe86a201107110cd272a82e1560a12d846cc787da
SimHash 8b105e22c133

Groups

*

Rule Path
Disallow /admin
Disallow /checkout
Disallow /order
Disallow /user
Disallow /account
Disallow /collections/*%2B*
Disallow /cart
Disallow /?cat=usd&order=desc%C3%82
Disallow *?cat=usd&order=desc%C3%82
Disallow /?s%C3%82
Disallow */?s%C3%82
Disallow *?s=
Allow /page/careplans
Allow /page/period-plans
Allow /page/conception-plans
Allow /page/pregnancy-plans
Allow /page/pcos-plans
Allow /page/menopause-plans
Allow /page/postpregnancy-plans

adsbot-google

Rule Path
Disallow /admin
Disallow /checkout
Disallow /cart

nutch

Rule Path
Disallow /

Other Records

Field Value
sitemap https://newmi.in/sitemap.xml

Warnings

  • 1 invalid line.