gamblii.org
robots.txt

Robots Exclusion Standard data for gamblii.org

Resource Scan

Scan Details

Site Domain gamblii.org
Base Domain gamblii.org
Scan Status Ok
Last Scan2026-02-27T00:12:49+00:00
Next Scan 2026-03-29T00:12:49+00:00

Last Scan

Scanned2026-02-27T00:12:49+00:00
URL https://gamblii.org/robots.txt
Domain IPs 104.21.63.235, 172.67.173.24, 2606:4700:3036::6815:3feb, 2606:4700:3036::ac43:ad18
Response IP 172.67.173.24
Found Yes
Hash 07263aff1f93a3f76c89d1ecc4aed7e77fe8f7ebf007e24aca210b961525986e
SimHash 084c3152c703

Groups

peer39_crawler

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

*

Rule Path
Allow /*.js
Allow /*.css
Allow /*.jpg
Allow /*.png
Allow /*.svg
Allow /*.gif
Allow /*.jpeg
Allow /*.webp
Allow /build/frontend/*.js
Allow /build/frontend/*.css
Allow /build/frontend/*.jpg
Allow /build/frontend/*.png
Allow /build/frontend/*.svg
Allow /build/frontend/*.gif
Allow /build/frontend/*.jpeg
Allow /build/frontend/*.webp
Disallow /cdn-cgi/l/email-protection
Disallow /*?
Disallow /*%26

Other Records

Field Value
sitemap https://gamblii.org/sitemap.xml