wiglaf.org
robots.txt

Robots Exclusion Standard data for wiglaf.org

Resource Scan

Scan Details

Site Domain wiglaf.org
Base Domain wiglaf.org
Scan Status Ok
Last Scan2024-09-14T14:31:05+00:00
Next Scan 2024-10-14T14:31:05+00:00

Last Scan

Scanned2024-09-14T14:31:05+00:00
URL https://www.wiglaf.org/robots.txt
Domain IPs 2600:9000:21f8:0:12:8255:3600:93a1, 2600:9000:21f8:200:12:8255:3600:93a1, 2600:9000:21f8:4200:12:8255:3600:93a1, 2600:9000:21f8:4a00:12:8255:3600:93a1, 2600:9000:21f8:600:12:8255:3600:93a1, 2600:9000:21f8:b800:12:8255:3600:93a1, 2600:9000:21f8:e00:12:8255:3600:93a1, 2600:9000:21f8:ec00:12:8255:3600:93a1, 52.85.5.110, 52.85.5.111, 52.85.5.33, 52.85.5.63
Response IP 52.85.49.87
Found Yes
Hash e610459000d7744f3fb54aa56c984a1d98d83a4d3210a0eb046bc926b4d58fe6
SimHash 0a24d8322833

Groups

*

Rule Path
Disallow /cgi-bin/w108
Disallow /sysinfo/
Disallow /cgi-bin/faqomatic
Disallow /~aaronm/ReciPants-v1.2
Disallow /~aaronm/photography/private/

petalbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

friendlycrawler

Rule Path
Disallow /