The sound is the key; audiences will accept visual discontinuity much more easily than they'll accept jumps in the sound. If the track makes sense, you can do almost anything visually.
The sound of the mandolin is a very curious sound because it's cheerful and melancholy at the same time, and I think it comes from that shadow string, the double strings.