crowdstorm.com
robots.txt

Robots Exclusion Standard data for crowdstorm.com

Resource Scan

Scan Details

Site Domain crowdstorm.com
Base Domain crowdstorm.com
Scan Status Ok
Last Scan5/28/2025, 4:30:25 PM
Next Scan 6/4/2025, 4:30:25 PM

Last Scan

Scanned5/28/2025, 4:30:25 PM
URL https://crowdstorm.com/robots.txt
Redirect https://www.crowdstorm.com/robots.txt
Redirect Domain www.crowdstorm.com
Redirect Base crowdstorm.com
Domain IPs 104.21.14.111, 172.67.158.170, 2606:4700:3031::ac43:9eaa, 2606:4700:3037::6815:e6f
Redirect IPs 104.21.14.111, 172.67.158.170, 2606:4700:3031::ac43:9eaa, 2606:4700:3037::6815:e6f
Response IP 104.21.14.111
Found Yes
Hash 1e5904a08a52cc3ed3385feb4e602f22761dacf11b0ecd28daa5f38d946650b1
SimHash 891ed9764a11

Groups

trovitbot

Rule Path
Disallow /

magpie-crawler/1.1

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Disallow /out
Disallow /out/
Disallow *?*
Allow *?v*

googlebot

Rule Path
Allow *?aj=true*