discuss.streamlit.io
robots.txt

Robots Exclusion Standard data for discuss.streamlit.io

Resource Scan

Scan Details

Site Domain discuss.streamlit.io
Base Domain streamlit.io
Scan Status Ok
Last Scan2025-08-31T01:07:47+00:00
Next Scan 2025-09-30T01:07:47+00:00

Last Scan

Scanned2025-08-31T01:07:47+00:00
URL https://discuss.streamlit.io/robots.txt
Domain IPs 184.105.99.75, 2602:fd3f:3:ff02::4b
Response IP 184.105.99.75
Found Yes
Hash be77f4ebeb01e5da513a3f869c7796e5efa0343ef6d73c6bce973f06cb08a427
SimHash a89d1dc566d1

Groups

mauibot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

seo spider

Rule Path
Disallow /

turnitin

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

*

Rule Path
Disallow /admin/
Disallow /auth/
Disallow /assets/browser-update*.js
Disallow /email/
Disallow /session
Disallow /user-api-key
Disallow /*?api_key*
Disallow /*?*api_key*
Disallow /badges
Disallow /my
Disallow /search
Disallow /tag/*/l
Disallow /g
Disallow /t/*/*.rss
Disallow /c/*.rss

googlebot

Rule Path
Disallow /admin/
Disallow /auth/
Disallow /assets/browser-update*.js
Disallow /email/
Disallow /session
Disallow /user-api-key
Disallow /*?api_key*
Disallow /*?*api_key*

Other Records

Field Value
sitemap https://discuss.streamlit.io/sitemap.xml

Comments

  • See https://datatracker.ietf.org/doc/rfc9309 for documentation on how to use the robots.txt file
  • Google uses the same format as the standard above. More info at https://developers.google.com/search/docs/crawling-indexing/robots/robots_txt