musicnotes.com
robots.txt

Robots Exclusion Standard data for musicnotes.com

Resource Scan

Scan Details

Site Domain musicnotes.com
Base Domain musicnotes.com
Scan Status Ok
Last Scan2024-11-07T16:42:03+00:00
Next Scan 2024-11-14T16:42:03+00:00

Last Scan

Scanned2024-11-07T16:42:03+00:00
URL https://musicnotes.com/robots.txt
Redirect https://www.musicnotes.com/robots.txt
Redirect Domain www.musicnotes.com
Redirect Base musicnotes.com
Domain IPs 104.18.14.231, 104.18.15.231, 2606:4700::6812:ee7, 2606:4700::6812:fe7
Redirect IPs 104.18.14.231, 104.18.15.231, 2606:4700::6812:ee7, 2606:4700::6812:fe7
Response IP 104.18.14.231
Found Yes
Hash 6f3efc27cf614848931129b03a0a28f6af38290d085e691df83743f20787425f
SimHash 7eccdc82c3b0

Groups

*

Rule Path
Disallow /commerce/
Disallow /commerce/signin.asp
Disallow /free/club/
Disallow /mysuggestions.asp
Disallow /search/
Disallow /landing/
Disallow /search/thumb.php
Disallow /ppc/
Disallow /ppcnav/
Disallow /basketshim/
Disallow /basket
Disallow /wishlist/
Disallow /wishlist
Disallow /_api4apps/
Disallow /error/
Disallow /get/html5.aspx

Other Records

Field Value
crawl-delay 1

psbot

Rule Path
Disallow /

crazywebcrawler-spider

Rule Path
Disallow /

crazywebcrawler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.musicnotes.com/sitemap.xml
sitemap https://www.musicnotes.com/sheet-music/sitemapindex.xml.gz