geekydice.com
robots.txt

Robots Exclusion Standard data for geekydice.com

Resource Scan

Scan Details

Site Domain geekydice.com
Base Domain geekydice.com
Scan Status Ok
Last Scan2025-08-27T06:47:35+00:00
Next Scan 2025-09-26T06:47:35+00:00

Last Scan

Scanned2025-08-27T06:47:35+00:00
URL https://www.geekydice.com/robots.txt
Domain IPs 195.85.88.122
Response IP 195.85.88.122
Found Yes
Hash 2ddf0e376c6f8222b65c162f190431e3a904894d9d9864eed6964f80a7036b8b
SimHash e514de0275d4

Groups

*

Rule Path
Disallow /admin
Disallow /login
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /my-account
Disallow /search
Disallow /policies
Disallow /*theme_preview_id*
Disallow /checkout-additional

adsbot-google

Rule Path
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /*theme_preview_id*
Disallow /checkout-additional

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /admin
Disallow /login
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /my-account
Disallow /search
Disallow /policies
Disallow /*theme_preview_id*
Disallow /checkout-additional

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /admin
Disallow /login
Disallow /cart
Disallow /checkouts
Disallow /orders
Disallow /my-account
Disallow /search
Disallow /policies
Disallow /*theme_preview_id*
Disallow /checkout-additional

Other Records

Field Value
crawl-delay 10

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

Other Records

Field Value Comment
sitemap https://www.geekydice.com/sitemap.xml This will automatically remain your current primary domain to assure correct indexing.
sitemap https://www.geekydice.com/sitemap.xml This will automatically remain your current primary domain to assure correct indexing.
sitemap https://www.geekydice.com/sitemap.xml This will automatically remain your current primary domain to assure correct indexing.

Comments

  • Google adsbot ignores robots.txt unless specifically named!