the-area51.com
robots.txt

Robots Exclusion Standard data for the-area51.com

Resource Scan

Scan Details

Site Domain the-area51.com
Base Domain the-area51.com
Scan Status Ok
Last Scan2025-09-05T16:36:36+00:00
Next Scan 2025-09-12T16:36:36+00:00

Last Scan

Scanned2025-09-05T16:36:36+00:00
URL https://the-area51.com/robots.txt
Domain IPs 104.21.112.1, 104.21.16.1, 104.21.32.1, 104.21.48.1, 104.21.64.1, 104.21.80.1, 104.21.96.1, 2606:4700:3030::6815:1001, 2606:4700:3030::6815:2001, 2606:4700:3030::6815:3001, 2606:4700:3030::6815:4001, 2606:4700:3030::6815:5001, 2606:4700:3030::6815:6001, 2606:4700:3030::6815:7001
Response IP 104.21.112.1
Found Yes
Hash ae0bc5e553c1181fec1e0e85c6297f30768f948d39a19cf541fd3ec96abce479
SimHash 8b1ecc73e716

Groups

*

Rule Path
Disallow /user_content/*
Disallow /user_content/
Disallow /an/*
Disallow /an/
Disallow /out/*
Disallow /out/
Disallow /ajax/
Disallow /ajax/*

ia_archiver

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

grub

Rule Path
Disallow /

looksmart

Rule Path
Disallow /

webzip

Rule Path
Disallow /