humaninterest.com
robots.txt

Robots Exclusion Standard data for humaninterest.com

Resource Scan

Scan Details

Site Domain humaninterest.com
Base Domain humaninterest.com
Scan Status Ok
Last Scan2025-12-08T22:07:47+00:00
Next Scan 2026-01-07T22:07:47+00:00

Last Scan

Scanned2025-12-08T22:07:47+00:00
URL https://humaninterest.com/robots.txt
Domain IPs 104.26.12.219, 104.26.13.219, 172.67.68.191, 2606:4700:20::681a:cdb, 2606:4700:20::681a:ddb, 2606:4700:20::ac43:44bf
Response IP 104.26.12.219
Found Yes
Hash 361e76d54aae629e8fb78b4851c9e7aa81184bcc5414be14e01ca6bb06b32f57
SimHash 01058c524391

Groups

smarshbot/1.0

Rule Path
Allow /

*

Rule Path
Disallow /lp/
Disallow */watch/*

Other Records

Field Value
sitemap https://humaninterest.com/sitemap.xml