sarthaks.com
robots.txt

Robots Exclusion Standard data for sarthaks.com

Resource Scan

Scan Details

Site Domain sarthaks.com
Base Domain sarthaks.com
Scan Status Ok
Last Scan2024-10-07T00:41:06+00:00
Next Scan 2024-10-14T00:41:06+00:00

Last Scan

Scanned2024-10-07T00:41:06+00:00
URL https://sarthaks.com/robots.txt
Redirect https://www.sarthaks.com/robots.txt
Redirect Domain www.sarthaks.com
Redirect Base sarthaks.com
Domain IPs 104.26.6.26, 104.26.7.26, 172.67.73.49, 2606:4700:20::681a:61a, 2606:4700:20::681a:71a, 2606:4700:20::ac43:4931
Redirect IPs 104.26.6.26, 104.26.7.26, 172.67.73.49, 2606:4700:20::681a:61a, 2606:4700:20::681a:71a, 2606:4700:20::ac43:4931
Response IP 172.67.73.49
Found Yes
Hash 8bd7f5917da170db240d5e058c57227328aa8c1f43cd9d0dce1672c5a481ac43
SimHash 5a1ed8c0ccb9

Groups

*

Rule Path
Disallow /login
Disallow /forgot
Disallow /search?
Disallow /admin

Other Records

Field Value
crawl-delay 4

mj12bot

Rule Path
Disallow /

httrack

Rule Path
Disallow /

winhttrack

Rule Path
Disallow /

mozilla/5.0 (compatible; ezooms/1.0; ezooms.bot@gmail.com)

Rule Path
Disallow /

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

sindicebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

ccbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 30

wget

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

yahoo pipes 1.0

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.sarthaks.com/sitemap.xml