whaleboxstudio.com
robots.txt

Robots Exclusion Standard data for whaleboxstudio.com

Resource Scan

Scan Details

Site Domain whaleboxstudio.com
Base Domain whaleboxstudio.com
Scan Status Ok
Last Scan2026-01-05T07:07:20+00:00
Next Scan 2026-01-12T07:07:20+00:00

Last Scan

Scanned2026-01-05T07:07:20+00:00
URL https://whaleboxstudio.com/robots.txt
Redirect https://whalebox.studio/robots.txt
Redirect Domain whalebox.studio
Redirect Base whalebox.studio
Domain IPs 104.21.22.243, 172.67.207.222, 2606:4700:3033::6815:16f3, 2606:4700:3037::ac43:cfde
Redirect IPs 104.21.19.93, 172.67.185.183
Response IP 172.67.185.183
Found Yes
Hash a1ebc5cf14127dd8b4ab04b30e2247f73b792b91741cbccd5f1e7813b309c8ab
SimHash 24b4a1725e74

Groups

*

Rule Path
Disallow /cgi-bin
Disallow /?
Disallow /wp-
Disallow *?s=
Disallow *%26s%3D
Disallow /search
Disallow /author/
Disallow *?attachment_id=
Disallow */feed
Disallow */rss
Disallow */embed
Disallow */page/
Disallow /games/*
Allow */uploads
Allow /*/*.js
Allow /*/*.css
Allow /wp-*.png
Allow /wp-*.jpg
Allow /wp-*.jpeg
Allow /wp-*.gif
Allow /wp-*.svg
Allow /wp-*.pdf