kruthai.net
robots.txt

Robots Exclusion Standard data for kruthai.net

Resource Scan

Scan Details

Site Domain kruthai.net
Base Domain kruthai.net
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2025-10-14T16:59:51+00:00
Next Scan 2026-01-12T16:59:51+00:00

Last Successful Scan

Scanned2024-03-01T15:40:41+00:00
URL http://kruthai.net/robots.txt
Domain IPs 103.91.189.179
Response IP 103.91.189.179
Found Yes
Hash a4f6e9067a9d53276872dc3c489f340d772ca0fc0b37d520b25526ea1ee0ef90
SimHash 2850f7824783

Groups

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

googlebot-mobile

Rule Path
Disallow

msnbot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

teoma

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

robozilla

Rule Path
Disallow /

nutch

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yahoo-mmcrawler

Rule Path
Disallow /

psbot

Rule Path
Disallow /

yahoo-blogs/v3.9

Rule Path
Disallow /

*

Rule Path
Disallow /
Disallow /cgi-bin/