almanack.com
robots.txt

Robots Exclusion Standard data for almanack.com

Resource Scan

Scan Details

Site Domain almanack.com
Base Domain almanack.com
Scan Status Ok
Last Scan2025-12-30T13:37:52+00:00
Next Scan 2026-01-06T13:37:52+00:00

Last Scan

Scanned2025-12-30T13:37:52+00:00
URL https://almanack.com/robots.txt
Redirect https://www.almanack.com/robots.txt
Redirect Domain www.almanack.com
Redirect Base almanack.com
Domain IPs 216.98.79.41
Redirect IPs 216.98.79.41
Response IP 216.98.79.41
Found Yes
Hash 836c86381a79bc651008f54813ebfc61bfade531f611dec91d6cfc91837950c1
SimHash 3805d902f3d1

Groups

ia_archiver

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow /images/
Disallow /scripts/
Disallow /js/
Disallow /style/
Disallow /css/