cybex.in
robots.txt

Robots Exclusion Standard data for cybex.in

Resource Scan

Scan Details

Site Domain cybex.in
Base Domain cybex.in
Scan Status Ok
Last Scan2025-08-20T12:35:01+00:00
Next Scan 2025-09-19T12:35:01+00:00

Last Scan

Scanned2025-08-20T12:35:01+00:00
URL https://cybex.in/robots.txt
Redirect https://www.cybex.in/robots.txt
Redirect Domain www.cybex.in
Redirect Base cybex.in
Domain IPs 104.26.2.238, 104.26.3.238, 172.67.68.134, 2606:4700:20::681a:2ee, 2606:4700:20::681a:3ee, 2606:4700:20::ac43:4486
Redirect IPs 104.26.2.238, 104.26.3.238, 172.67.68.134, 2606:4700:20::681a:2ee, 2606:4700:20::681a:3ee, 2606:4700:20::ac43:4486
Response IP 104.26.3.238
Found Yes
Hash 8ced4e985f8f709d4293a2f5a420582d5a6e634de566f48444a248a72ff2853f
SimHash 00d0990983b6

Groups

httrack

Rule Path
Disallow /

wget

Rule Path
Disallow /

curl

Rule Path
Disallow /

python

Rule Path
Disallow /

php

Rule Path
Disallow /

nmap

Rule Path
Disallow /

libwww

Rule Path
Disallow /

sogou

Rule Path
Disallow /

zmeu

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.cybex.in/sitemap.xml

Comments

  • Block specific user agents (unauthorized crawlers and website copiers)
  • Allow all legitimate crawlers
  • Specify sitemap for legitimate crawlers