highlyscalable.in
robots.txt

Robots Exclusion Standard data for highlyscalable.in

Resource Scan

Scan Details

Site Domain highlyscalable.in
Base Domain highlyscalable.in
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-12-14T14:39:06+00:00
Next Scan 2025-12-21T14:39:06+00:00

Last Successful Scan

Scanned2025-12-06T13:52:31+00:00
URL https://highlyscalable.in/robots.txt
Domain IPs 3.109.140.81
Response IP 3.109.140.81
Found Yes
Hash c4687d5bc0b0d96797755e7a8649f81a9feea79db922730e7cc3fdf31e678df7
SimHash 65b0c463cf37

Groups

*

Rule Path
Allow /blog
Allow /accounts
Allow /signup
Allow /reset
Allow /settings/*
Allow /wealth
Allow /search
Allow /?search*
Allow /pdf/
Disallow /*.pdf
Disallow /*.ppt
Disallow /*.doc
Disallow /*.xls
Disallow /admin
Disallow /login
Disallow /accounts
Disallow /logout
Disallow /signup
Disallow /reset
Disallow /settings/*
Disallow /share
Disallow /search
Disallow /?next*
Disallow /?search*

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://highlyscalable.in/sitemap.xml

Warnings

  • `host` is not a known field.