goosetheband.bandcamp.com
robots.txt

Robots Exclusion Standard data for goosetheband.bandcamp.com

Resource Scan

Scan Details

Site Domain goosetheband.bandcamp.com
Base Domain bandcamp.com
Scan Status Ok
Last Scan2024-04-17T13:46:13+00:00
Next Scan 2024-05-01T13:46:13+00:00

Last Scan

Scanned2024-04-17T13:46:13+00:00
URL https://goosetheband.bandcamp.com/robots.txt
Domain IPs 151.101.130.132, 151.101.194.132, 151.101.2.132, 151.101.66.132
Response IP 151.101.2.132
Found Yes
Hash 1d15737cb32f5ca2a1430f97dfb7db7ae0b5b9b7d3a967c39a29cf1e69703b56
SimHash 0238f85dd777

Groups

*

Rule Path
Disallow /tools
Disallow /checkout
Disallow /download_check
Disallow /cart/
Disallow /corpbanner/
Disallow /stream
Disallow /api/
Disallow /design_tokens
Allow /api/currency_data/
Allow /api/discover/1/discover_mobile_web
Disallow /*_cb$

nextgensearchbot

Rule Path
Disallow /

edisterbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

swebot

Rule Path
Disallow /

discobot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

Other Records

Field Value
sitemap https://goosetheband.bandcamp.com/sitemap.xml

Comments

  • the currency data endpoint is required to render pages
  • required to render /discover pages
  • pattern matching known to work only with Google and Yahoo
  • badly-behaving bots
  • unwanted bots