windcs.com
robots.txt

Robots Exclusion Standard data for windcs.com

Resource Scan

Scan Details

Site Domain windcs.com
Base Domain windcs.com
Scan Status Ok
Last Scan2024-07-02T15:59:09+00:00
Next Scan 2024-07-09T15:59:09+00:00

Last Scan

Scanned2024-07-02T15:59:09+00:00
URL https://www.windcs.com/robots.txt
Domain IPs 2600:9000:2014:1200:c:5e61:c040:93a1, 2600:9000:2014:3400:c:5e61:c040:93a1, 2600:9000:2014:5c00:c:5e61:c040:93a1, 2600:9000:2014:8e00:c:5e61:c040:93a1, 2600:9000:2014:9000:c:5e61:c040:93a1, 2600:9000:2014:a00:c:5e61:c040:93a1, 2600:9000:2014:a800:c:5e61:c040:93a1, 2600:9000:2014:e600:c:5e61:c040:93a1, 3.165.102.35, 3.165.102.48, 3.165.102.5, 3.165.102.63
Response IP 3.165.102.48
Found Yes
Hash 94458522095c0f54630c03d73c45c23a8c58c03332373270c4428c9c86df8917
SimHash 4815d880e392

Groups

*

Rule Path
Disallow /

Comments

  • Example 3: Block all crawlers except AdsBot (AdsBot crawlers must be named explicitly)