manis.com.hk
robots.txt

Robots Exclusion Standard data for manis.com.hk

Resource Scan

Scan Details

Site Domain manis.com.hk
Base Domain manis.com.hk
Scan Status Ok
Last Scan2025-10-17T03:25:07+00:00
Next Scan 2025-10-31T03:25:07+00:00

Last Scan

Scanned2025-10-17T03:25:07+00:00
URL https://manis.com.hk/robots.txt
Domain IPs 104.21.14.207, 172.67.160.142, 2606:4700:3030::ac43:a08e, 2606:4700:3034::6815:ecf
Response IP 104.21.14.207
Found Yes
Hash a1ffcb188546d93f9ab2de6ffab431e8e3a078fdf09ef29eb4a049aa4bca59f1
SimHash 6d49d4151815

Groups

yandex
mauibot
mauibot (crawler.feedback+wc@gmail.com)
petalbot
mj12bot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /cart
Disallow /cart_checkout
Disallow /payment
Disallow /confirmation
Disallow /user_cms
Disallow /business

*

Rule Path
Disallow /cart
Disallow /cart_checkout
Disallow /payment
Disallow /confirmation
Disallow /user_cms
Disallow /business
Disallow /*?lang=*
Disallow /*%26lang%3D*

Other Records

Field Value
sitemap http://manis.com.hk/sitemap.xml