sg.carousell.com
robots.txt
Robots Exclusion Standard data for sg.carousell.com
Resource Scan
Scan Details
Site Domain | sg.carousell.com |
Base Domain | carousell.com |
Scan Status | Ok |
Last Scan | 2024-11-11T13:59:41+00:00 |
Next Scan | 2024-11-18T13:59:41+00:00 |
Last Scan
Scanned | 2024-11-11T13:59:41+00:00 |
URL | https://sg.carousell.com/robots.txt |
Redirect | https://www.carousell.sg/robots.txt |
Redirect Domain | www.carousell.sg |
Redirect Base | carousell.sg |
Domain IPs | 104.16.208.133, 104.16.209.133, 2606:4700::6810:d085, 2606:4700::6810:d185 |
Redirect IPs | 104.18.40.137, 172.64.147.119, 2606:4700:4400::6812:2889, 2606:4700:4400::ac40:9377 |
Response IP | 172.64.147.119 |
Found | Yes |
Hash | 9e873ab9ba9960eaed15a0c68adcb38720a3f9c50642007f588a8786d2310765 |
SimHash | ca19c9e24515 |
Groups
*
Rule | Path |
---|---|
Disallow | /activity/ |
Disallow | /archive/ |
Disallow | /inbox/ |
Disallow | /join/ |
Disallow | /myprofile/ |
Disallow | /*hl%3D*$ |
Disallow | /*/review/ |
Disallow | /search/ |
Disallow | /*? |
Allow | /api-service/*? |
Allow | /smart_render/*? |
Allow | /pdp/*?listing_id=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.carousell.sg/sitemap.xml |
Comments