patreon.com
robots.txt

Robots Exclusion Standard data for patreon.com

Resource Scan

Scan Details

Site Domain patreon.com
Base Domain patreon.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-06-15T17:30:56+00:00
Next Scan 2024-08-14T17:30:56+00:00

Last Successful Scan

Scanned2024-04-10T16:00:23+00:00
URL https://patreon.com/robots.txt
Redirect https://www.patreon.com:443/robots.txt
Redirect Domain www.patreon.com
Redirect Base patreon.com
Domain IPs 104.16.24.14, 104.16.25.14, 2606:4700::6810:180e, 2606:4700::6810:190e
Redirect IPs 104.16.24.14, 104.16.25.14, 2606:4700::6810:180e, 2606:4700::6810:190e
Response IP 104.16.25.14
Found Yes
Hash cf8841ad58dbd88b186feb2bd32f5e9ee548d5ef02f6906da51fa316c49b2dad
SimHash 6c07d8494033

Groups

*
mozilla/5.0 (compatible; google-podcast)

Rule Path
Disallow /settings
Disallow /logout
Disallow /bePatronDone
Disallow /productPurchaseDone
Disallow /mwebWindowDone
Disallow /file
Disallow /patronNext
Disallow /userNext
Disallow /bePatron
Disallow /REST/auth/CSRFTicket
Disallow /user.php
Disallow /_generated
Disallow /api/
Disallow /rss/
Disallow /_private/admin-tools/
Disallow /corgi$
Disallow /checkout/

petalbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.patreon.com/sitemap.xml