windcs.com
robots.txt

Robots Exclusion Standard data for windcs.com

Resource Scan

Scan Details

Site Domain windcs.com
Base Domain windcs.com
Scan Status Ok
Last Scan2024-11-13T10:39:13+00:00
Next Scan 2024-11-20T10:39:13+00:00

Last Scan

Scanned2024-11-13T10:39:13+00:00
URL https://www.windcs.com/robots.txt
Domain IPs 2600:9000:2721:5a00:c:5e61:c040:93a1, 2600:9000:2721:7800:c:5e61:c040:93a1, 2600:9000:2721:7c00:c:5e61:c040:93a1, 2600:9000:2721:9200:c:5e61:c040:93a1, 2600:9000:2721:ac00:c:5e61:c040:93a1, 2600:9000:2721:b400:c:5e61:c040:93a1, 2600:9000:2721:c600:c:5e61:c040:93a1, 2600:9000:2721:cc00:c:5e61:c040:93a1, 3.165.102.35, 3.165.102.48, 3.165.102.5, 3.165.102.63
Response IP 3.165.102.35
Found Yes
Hash 94458522095c0f54630c03d73c45c23a8c58c03332373270c4428c9c86df8917
SimHash 4815d880e392

Groups

*

Rule Path
Disallow /

Comments

  • Example 3: Block all crawlers except AdsBot (AdsBot crawlers must be named explicitly)