discard.cc
robots.txt

Robots Exclusion Standard data for discard.cc

Resource Scan

Scan Details

Site Domain discard.cc
Base Domain discard.cc
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2/26/2025, 4:32:41 PM
Next Scan 5/27/2025, 4:32:41 PM

Last Successful Scan

Scanned10/30/2024, 4:10:15 PM
URL https://discard.cc/robots.txt
Domain IPs 104.21.64.108, 172.67.182.66, 2606:4700:3032::6815:406c, 2606:4700:3034::ac43:b642
Response IP 104.21.64.108
Found Yes
Hash c597ed70063022e521abf3182de83887801048c1c60587302137cfad2d2e9960
SimHash ee091b81af40

Groups

*

Rule Path
Disallow /guilds/*/join
Disallow /guilds/*/emojis
Disallow /cdn-cgi/*
Disallow /applications/*/icon
Disallow /applications/*/banner
Disallow /guilds/*/icon
Disallow /guilds/*/banner
Disallow /users/*/icon
Disallow /users/*/banner

Other Records

Field Value
sitemap https://discard.cc/sitemaps/sitemap-pages.xml
sitemap https://discard.cc/sitemaps/sitemap-guilds.xml

Comments

  • Discard robots.txt
  • Redirect Addresses do not need to be indexed