provenant.art
robots.txt

Robots Exclusion Standard data for provenant.art

Resource Scan

Scan Details

Site Domain provenant.art
Base Domain provenant.art
Scan Status Ok
Last Scan2026-02-04T09:39:24+00:00
Next Scan 2026-03-06T09:39:24+00:00

Last Scan

Scanned2026-02-04T09:39:24+00:00
URL https://provenant.art/robots.txt
Redirect https://newsletter.provenant.art/robots.txt
Redirect Domain newsletter.provenant.art
Redirect Base provenant.art
Domain IPs 2001:8d8:100f:f000::200, 217.160.0.80
Redirect IPs 104.18.68.40, 104.18.69.40, 2606:4700::6812:4428, 2606:4700::6812:4528
Response IP 104.18.69.40
Found Yes
Hash 1fb8c8094373d6d4a9a99dd23a8e614510e0346961f87175f431e700777ab06d
SimHash 2f1d9c90bb71

Groups

amazonbot

Rule Path
Disallow /

googlebot

Rule Path
Disallow /nogooglebot/

*

Rule Path
Disallow /login

adsbot-google

Rule Path
Disallow /login

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow /login

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://newsletter.provenant.art/sitemap.xml

Comments

  • beehiiv default robots.txt
  • This is automatically used when you leave custom content empty
  • Customize below or upload your own robots.txt file