carousell.com.hk
robots.txt

Robots Exclusion Standard data for carousell.com.hk

Resource Scan

Scan Details

Site Domain carousell.com.hk
Base Domain carousell.com.hk
Scan Status Ok
Last Scan2024-06-29T00:41:25+00:00
Next Scan 2024-07-06T00:41:25+00:00

Last Scan

Scanned2024-06-29T00:41:25+00:00
URL https://carousell.com.hk/robots.txt
Redirect https://www.carousell.com.hk/robots.txt
Redirect Domain www.carousell.com.hk
Redirect Base carousell.com.hk
Domain IPs 104.18.39.102, 172.64.148.154, 2606:4700:4400::6812:2766, 2606:4700:4400::ac40:949a
Redirect IPs 104.18.39.102, 172.64.148.154, 2606:4700:4400::6812:2766, 2606:4700:4400::ac40:949a
Response IP 104.18.39.102
Found Yes
Hash 6d9afaf16bd5dac2e5f65162dede6c78eb9327d2b6f543e25b50d5e410d6773d
SimHash 6a1149c24e25

Groups

*

Rule Path
Disallow /activity/
Disallow /archive/
Disallow /inbox/
Disallow /join/
Disallow /myprofile/
Disallow /*hl%3D*$
Disallow /*/review/
Disallow /search/*?
Disallow /*?
Allow /api-service/*?
Allow /smart_render/*?

grapeshot

Rule Path
Disallow /activity/
Disallow /archive/
Disallow /inbox/
Disallow /likes/
Disallow /sell/
Disallow /followers/
Disallow /following/

petalbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.carousell.com.hk/sitemap.xml