unz.org
robots.txt

Robots Exclusion Standard data for unz.org

Resource Scan

Scan Details

Site Domain unz.org
Base Domain unz.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-02-16T21:10:52+00:00
Next Scan 2026-03-18T21:10:52+00:00

Last Successful Scan

Scanned2026-01-18T10:54:46+00:00
URL https://unz.org/robots.txt
Redirect https://www.unz.com/robots.txt
Redirect Domain www.unz.com
Redirect Base unz.com
Domain IPs 104.26.2.144, 104.26.3.144, 172.67.70.245, 2606:4700:20::681a:290, 2606:4700:20::681a:390, 2606:4700:20::ac43:46f5
Redirect IPs 104.26.12.29, 104.26.13.29, 172.67.69.241, 2606:4700:20::681a:c1d, 2606:4700:20::681a:d1d, 2606:4700:20::ac43:45f1
Response IP 104.26.12.29
Found Yes
Hash 4446bd343194d1ec95d539dae865d51f902815cf8d41ac974e3508b9610e8624
SimHash cc38d832caeb

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /comments/
Disallow /xfeed/
Disallow /feed/
Disallow /*?
Disallow *?replytocom
Disallow *?ItemSorts
Disallow *?lang
Disallow *?comment_count
Disallow *?last_comment_gmt
Disallow /*/Contents/
Disallow /*/Tree/
Disallow /*/Commentary/
Disallow /*/Search/
Disallow /*/*/*/*/*/*/*/*/

amazonbot

Rule Path
Disallow /

claudebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

bubing

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 4

Other Records

Field Value
sitemap http://www.unz.com/wp-content/uploads/sitemap.xml