ricebowl.my
robots.txt

Robots Exclusion Standard data for ricebowl.my

Resource Scan

Scan Details

Site Domain ricebowl.my
Base Domain ricebowl.my
Scan Status Ok
Last Scan2024-06-13T17:42:25+00:00
Next Scan 2024-06-20T17:42:25+00:00

Last Scan

Scanned2024-06-13T17:42:25+00:00
URL https://ricebowl.my/robots.txt
Redirect https://www.ricebowl.my/robots.txt
Redirect Domain www.ricebowl.my
Redirect Base ricebowl.my
Domain IPs 104.21.3.226, 172.67.153.188, 2606:4700:3033::6815:3e2, 2606:4700:3034::ac43:99bc
Redirect IPs 104.21.3.226, 172.67.153.188, 2606:4700:3033::6815:3e2, 2606:4700:3034::ac43:99bc
Response IP 104.21.3.226
Found Yes
Hash 56899d49e1d4ec82c8f6f26f36a77e658af72411f0ab17534ea3400e585366dc
SimHash 7a15d8c2fe32

Groups

*

Rule Path
Disallow /jobseeker-login
Disallow /jobseeker
Disallow /job/apply-job?*
Disallow /api/*
Disallow /v3mkapi/*
Disallow /search/*
Disallow /v1/*
Disallow /mkrbapi/*
Disallow /adhjobclick/*
Disallow /*/apply.php?
Disallow /job-region/
Disallow /en/job-region/
Disallow /ms/job-region/
Disallow /zh/job-region/
Disallow /rnetjobclick/
Disallow /jobs/
Disallow /en/jobs/
Disallow /ms/jobs/
Disallow /zh/jobs/
Disallow /content/search*
Disallow /en/content/search*
Disallow /ms/content/search*
Disallow /zh/content/search*

yandex

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /