scaleocean.com
robots.txt

Robots Exclusion Standard data for scaleocean.com

Resource Scan

Scan Details

Site Domain scaleocean.com
Base Domain scaleocean.com
Scan Status Ok
Last Scan2025-09-22T09:12:59+00:00
Next Scan 2025-10-22T09:12:59+00:00

Last Scan

Scanned2025-09-22T09:12:59+00:00
URL https://scaleocean.com/robots.txt
Domain IPs 104.26.10.240, 104.26.11.240, 172.67.75.39, 2606:4700:20::681a:af0, 2606:4700:20::681a:bf0, 2606:4700:20::ac43:4b27
Response IP 172.67.75.39
Found Yes
Hash 7046b4a999c9555121e03fd36a4ef16a314bdb77a85a7f627efa7241c089e8b4
SimHash 581c3942e161

Groups

ahrefssiteaudit
telegrambot

Rule Path
Disallow /

gptbot

Rule Path
Allow /

google-extended

Rule Path
Allow /

*

Rule Path
Allow /
Disallow /id/offer/*
Disallow /sg/offer/*
Disallow /id/thank-you
Disallow /sg/thank-you
Disallow /id/*?*
Disallow /id/blog/page/
Disallow /id/blog/feed/
Disallow /id/blog/*/feed
Disallow /id/blog/search/
Disallow /id/blog/tag/
Disallow /id/blog/wp-json/
Disallow /id/blog/industry/
Disallow /id/blog/search/
Disallow /id/blog/wp-content/uploads/*?amp=*
Disallow /id/blog/*/page/*
Disallow /sg/*?*
Disallow /sg//blog/page/
Disallow /sg//blog/feed/
Disallow /sg/blog/*/feed
Disallow /sg//blog/search/
Disallow /sg//blog/tag/
Disallow /sg//blog/wp-json/
Disallow /sg//blog/industry/
Disallow /sg//blog/search/
Disallow /sg//blog/*/page/*
Disallow /blog/

Other Records

Field Value
sitemap https://scaleocean.com/id/sitemap.xml
sitemap https://scaleocean.com/sg/sitemap.xml