iohost.in
robots.txt
Robots Exclusion Standard data for iohost.in
Resource Scan
Scan Details
Site Domain | iohost.in |
Base Domain | iohost.in |
Scan Status | Ok |
Last Scan | 2025-10-13T05:32:29+00:00 |
Next Scan | 2025-10-20T05:32:29+00:00 |
Last Scan
Scanned | 2025-10-13T05:32:29+00:00 |
URL | https://www.iohost.in/robots.txt |
Domain IPs | 104.21.29.85, 172.67.148.165, 2606:4700:3030::ac43:94a5, 2606:4700:3035::6815:1d55 |
Response IP | 104.21.29.85 |
Found | Yes |
Hash | 47270d22eeda206a8e9d4d9c45f892f07584c9844324c8a98a1645a9b2f4ed7f |
SimHash | b510f5076514 |
Groups
httrack disallow: / user-agent: netcaptor disallow: / user-agent: offline explorer disallow: / user-agent: spiderku/0.9 disallow: / user-agent: steeler disallow: / user-agent: webcopier v3.3 disallow: / user-agent: webcopier v3.2a disallow: / user-agent: webcopier disallow: / user-agent: webcrawler disallow: / user-agent: web downloader/4.9 disallow: / user-agent: web downloader/5.8 disallow: / user-agent: webgather 3.0 disallow: / user-agent: webstripper/2.56 disallow: / user-agent: webzip/3.65 disallow: / user-agent: webzip disallow: / user-agent: wget disallow: / user-agent: zao disallow: / user-agent: zeus 2.6 disallow: / user-agent: * disallow: /cgi-bin/
No rules defined. All paths allowed.