diceglory.com
robots.txt

Robots Exclusion Standard data for diceglory.com

Resource Scan

Scan Details

Site Domain diceglory.com
Base Domain diceglory.com
Scan Status Ok
Last Scan2024-10-10T01:09:12+00:00
Next Scan 2024-10-17T01:09:12+00:00

Last Scan

Scanned2024-10-10T01:09:12+00:00
URL https://diceglory.com/robots.txt
Domain IPs 104.21.7.122, 172.67.130.77, 2606:4700:3034::ac43:824d, 2606:4700:3037::6815:77a
Response IP 104.21.7.122
Found Yes
Hash 3ef301c6b17c8f6de65baada549890b9fe6314f66c05e4222ce8c77b67b91a8c
SimHash 68595a40c580

Groups

*

Rule Path
Disallow /?s=
Disallow /search

*

Rule Path
Disallow /trackback
Disallow /*trackback
Disallow /*trackback*
Disallow /*/trackback

*

Rule Path
Allow /feed/$
Disallow /feed/
Disallow /comments/feed/
Disallow /*/feed/$
Disallow /*/feed/rss/$
Disallow /*/trackback/$
Disallow /*/*/feed/$
Disallow /*/*/feed/rss/$
Disallow /*/*/trackback/$
Disallow /*/*/*/feed/$
Disallow /*/*/*/feed/rss/$
Disallow /*/*/*/trackback/$

googlebot

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap https://diceglory.com/sitemap_index.xml

Warnings

  • 21 invalid lines.