Sfizz polyphony limit?

jlearman · February 22, 2026, 3:01am

I have two versions of a piano sfz, one with velocity crossfades and one without. Using crossfades, I quickly hit the polyphony limit and it’s unusable. Is there a way to increase it? Setting it using the “polyphony” keyword in the .sfz file has no effect.

I realize there’s a limit due to CPU power, but I’d like to find out what that limit actually is (currently on my PI4, which I plan to upgrade to a PI5 and use a higher limit.)

hannesmenzel · February 22, 2026, 6:26am

I think the limit it 128 for sfizz.

Here’s what I do to tame the effect you report: additionally to the xf* opcodes for crossfades I also use hivel and lovel. If you don’t, any region sharing the same note value are played, even if it has no volume because of the crossfade. This way you can limit the number of simultaneously triggered notes to 2. Here’s an example:

github.com/sfzinstruments/jlearman.jRhodes3d

jRhodes3d-st.sfz

4fbf9f92f


      
          <region> sample=A_059__B3_5-ST.flac lokey=Bb3 hikey=C4  pitch_keycenter=B3  ampeg_release=0.4
          <region> sample=A_062__D4_5-ST.flac lokey=Db4 hikey=Eb4 pitch_keycenter=D4  ampeg_release=0.3
          <region> sample=A_065__F4_5-ST.flac lokey=E4  hikey=Ab4 pitch_keycenter=F4  ampeg_release=0.3
          <region> sample=A_071__B4_5-ST.flac lokey=A4  hikey=Db5 pitch_keycenter=B4  ampeg_release=0.3
          <region> sample=A_076__E5_5-ST.flac lokey=D5  hikey=Gb5 pitch_keycenter=E5  ampeg_release=0.3
          <region> sample=A_081__A5_5-ST.flac lokey=G5  hikey=B5  pitch_keycenter=A5  ampeg_release=0.3
          <region> sample=A_086__D6_5-ST.flac lokey=C6  hikey=E6  pitch_keycenter=D6  ampeg_release=0.2
          <region> sample=A_091__G6_5-ST.flac lokey=F6  hikey=A6  pitch_keycenter=G6  ampeg_release=0.2
          <region> sample=A_096__C7_5-ST.flac lokey=Bb6 hikey=F7  pitch_keycenter=C7  ampeg_release=0.2
          
          <group> xfin_lovel=25 xfin_hivel=60 xfout_lovel=61 xfout_hivel=84 ampeg_release=0.300000
          <region> sample=A_029__F1_4-ST.flac lokey=C1  hikey=Ab1 pitch_keycenter=F1  ampeg_release=0.5
          <region> sample=A_035__B1_4-ST.flac lokey=A1  hikey=Db2 pitch_keycenter=B1  ampeg_release=0.5
          <region> sample=A_040__E2_4-ST.flac lokey=D2  hikey=Gb2 pitch_keycenter=E2  ampeg_release=0.5
          <region> sample=A_045__A2_4-ST.flac lokey=G2  hikey=B2  pitch_keycenter=A2  ampeg_release=0.4
          <region> sample=A_050__D3_4-ST.flac lokey=C3  hikey=E3  pitch_keycenter=D3  ampeg_release=0.4
          <region> sample=A_055__G3_4-ST.flac lokey=F3  hikey=A3  pitch_keycenter=G3  ampeg_release=0.4
          <region> sample=A_059__B3_4-ST.flac lokey=Bb3 hikey=C4  pitch_keycenter=B3  ampeg_release=0.4
          <region> sample=A_062__D4_4-ST.flac lokey=Db4 hikey=Eb4 pitch_keycenter=D4  ampeg_release=0.3
          <region> sample=A_065__F4_4-ST.flac lokey=E4  hikey=Ab4 pitch_keycenter=F4  ampeg_release=0.3
          <region> sample=A_071__B4_4-ST.flac lokey=A4  hikey=Db5 pitch_keycenter=B4  ampeg_release=0.3

So this group is a middle set of 5 velocity layers crossfading to each other of your very own rhodes library.

<group> xfin_lovel=25 xfin_hivel=60 xfout_lovel=61 xfout_hivel=84 lovel=25 hivel=84

By adding the hivel and lovel opcodes, the group is not triggered at all outside velocities between 25 and 84, so these triggers to not add to the polyphony count. In your original version, each of the five groups is triggered even if it is outside the crossfade range. With this changed for a triggered midi note-on event with velocity 50 this group and the group above crossfading into it will be triggerd, but not the others-

You can also consider the note_polyphony and note_selfmask option.

This is btw. also a reason to think about if you need multipe mic samples for pianos. Let’s say you got 3 mics, 5 velocity layers with velocity crossfades (but not limited) then you hit 15 regions with each key.

jlearman · February 22, 2026, 6:47pm

Wow, thanks! Looking forward to trying this. I’ll see if that’s documented at sfzformat.com , and whether sforzando and linuxsampler work the same way.

hannesmenzel · February 22, 2026, 7:54pm

I would love to know that as well. Since I rarely use sforzando on my desktop and found linuxsampler on zynthian behaving weird I just exclusively used sfizz and only know how this handles some stuff.

Otherwise, speaking of weirdo, the opcode set of sfizz is quite rich, but lacks sometimes in fields you didn’t expect.

hannesmenzel · February 22, 2026, 8:16pm

Maybe I was wrong here.

github.com/zynthian/zynthian-ui

zyngine%2Fzynthian_engine_sfizz.py

oram


      
              super().__init__(state_manager)
              self.name = "Sfizz"
              self.nickname = "SF"
              if jackname:
                  self.jackname = jackname
              else:
                  self.jackname = self.state_manager.chain_manager.get_next_jackname(
                      "sfizz")
          
              self.preload_size = 32768  # 8192, 16384, 32768, 65536
              self.num_voices = 40
              self.sfzpath = None
          
              self.command = f"sfizz_jack --client_name '{self.jackname}' --preload_size {self.preload_size} --num_voices {self.num_voices}"
              self.command_prompt = "> "
          
              self.reset()
              self.start()
          
          # ---------------------------------------------------------------------------
          # Subproccess Management & IPC

github.com/zynthian/zynthian-sys

scripts%2Frecipes%2Finstall_linuxsampler.sh

oram


      
          # Download, Build &  Install LinuxSampler
          rm -rf linuxsampler
          svn --non-interactive --trust-server-cert co https://svn.linuxsampler.org/svn/linuxsampler/trunk linuxsampler
          cd linuxsampler
          libtoolize --force
          aclocal
          autoheader
          automake --force-missing --add-missing
          autoconf
          #Configure with optimizations from Schpion
          ./configure --enable-max-voices=21 --enable-max-streams=64 --enable-stream-min-refill=4096 --enable-refill-streams=2 --enable-stream-max-refill=131072 --enable-stream-size=262144 --disable-asm --enable-subfragment-size=64 --enable-eg-min-release-time=0.001 --enable-eg-bottom=0.0025 --enable-max-pitch=2 --enable-preload-samples=65536
          cd src/scriptvm
          yacc -o parser parser.y
          cd ../..
          # Apply patch from Steveb
          git clone https://github.com/steveb/rpi_linuxsampler_patch.git
          patch -p1 < rpi_linuxsampler_patch/linuxsampler-arm.patch
          # Build LinuxSampler
          make -j 1
          make install
          make clean

Maybe this might be a feature request, at least for PI5?

jlearman · February 22, 2026, 9:11pm

Not just for pi5. When using sfizz standalone on pi4 I didn’t see this issue, IIRC. (I’ll verify this later.) Perhaps the 21 voice limit was for Pi3?

Personally, I think the hard limit should be significantly higher, and all built-in sfz’s could use a limit so that out of the box it works fine, but folks can override it and they can set whatever limit they want for sfz’s they import.

Maybe at some point I’ll bite the bullet and set up to build Zynthian, or at least, build sfizz for Zynthian and import it, and experiment.

Anyway, I’ll file an issue, and thanks big time for the insights!

hannesmenzel · February 22, 2026, 9:17pm

21 is for linuxsampler, I think sfizz os limited to 40. But maybe someone with more insight can confirm.

On windows I think you can setup the number of voices up 512. I wonder how the 40 voices limit was estimated.

jlearman · February 22, 2026, 9:35pm

Doh! Thanks. I need to read more carefully!

jlearman · February 23, 2026, 10:31pm

I tried adding the lovel and hivel opcodes and bingo, it works! Thanks big time for that hint!

I think it’s a bug in sfizz: a voice that’s too low to fade in shouldn’t be playing. Later I’ll try it with linuxsampler, and maybe sforzando (on Windows) too.

hannesmenzel · February 24, 2026, 7:39am

Great it worked for you. For velocity based crossfades it certainly should act like you propose. Maybe they just implemented all these functions together and alike, because for CC based crossfades it makes totally sense to play outside the crossfade.

jofemodo · February 27, 2026, 9:16am

Hi!

I’ve been making some tests and trying to optimize the parameters of sfizz, specially the “num_voices” and “preload_size” parameters.

I tested with Salamander V3 and a random pattern that generates lots of notes in all the range with random velocities and the sustain pedal pushed all the time. It was a really hard test and i got Xruns due to bottleneck in disk access (SD-card):

CPU peaking around 30% and memory usage going from 800 (0 voices in use) to 1500MB (all voices in use).

This is disk load:

Curiously, it’s difficult to make a difference when modifying the “preload_size” value.

It would be wise to repeat the test with a fast NVMe disk.

Regards,

jofemodo · February 27, 2026, 9:21am

I’ve adjusted the sifzz paramters like this (Vangelis):

        if pi_version >= 5:
            self.num_voices = 64
            self.preload_size = 8192  # 8192, 16384, 32768, 65536
        elif pi_version == 4:
            self.num_voices = 48
            self.preload_size = 16384  # 8192, 16384, 32768, 65536
        else:
            self.num_voices = 32
            self.preload_size = 32768  # 8192, 16384, 32768, 65536

It would increase polyphony in RPi5 while keeping current behavior for RPi4 and slighty reduce polyphony for Pi3. I didn’t test with Pi3, so it should be confirmed this reduction is enough to avoid XRuns or it needs still more reduction in polyphony limit.

Regards,

hannesmenzel · February 27, 2026, 9:26am

Somebody tested it with hint_ram_based=1 in the sfz script? Would be interesting, because it sounds like the sd speed is the bottleneck on either version of the PI then, right?

jofemodo · February 27, 2026, 10:23am

Easy
I repeated the test and i can confirm the diagnostic. With hint_ram_based=1 no Xruns at all:

CPU is almost the same (perhaps slighty less) than previously. Memory usage is now about 3500 MB, more than double. Load time, huge, of course!!!

Disk is not being used at all and as i said, no XRUNs at all. It plays super-smoothly. I could probably increase polyphony a lot more than 64 without getting XRUNs.

BTW, this is the pattern used, with velocity humanization set to 100% and sustain ON all the time:

Not a joke!

As i told in the post above, the same pattern played without the hint_ram_based=1 generated lot of XRUNs not matter the preload_size value i tried.

Regards,

hannesmenzel · February 27, 2026, 10:40am

That’s what I hoped for.

I’m not 100% sure what the preload_size parameter really is, but I guess it means it would load samples smaller than that into ram?

github.com/sfztools/sfizz

src/sfizz.h

f5c6e29f2


      
          *                      sfizz only handles stereo outputs.
          * @param num_frames    Number of frames to fill. This should be less than
          *                      or equal to the expected samples_per_block.
          *
          * @par Thread-safety constraints
          * - @b RT: the function must be invoked from the Real-time thread
          */
          SFIZZ_EXPORTED_API void sfizz_render_block(sfizz_synth_t* synth, float** channels, int num_channels, int num_frames);
          
          /**
          * @brief Get the size of the preloaded data.
          *
          * This returns the number of floats used in the preloading buffers.
          * @since 0.2.0
          *
          * @param synth  The synth.
          */
          SFIZZ_EXPORTED_API unsigned int sfizz_get_preload_size(sfizz_synth_t* synth);
          
          /**
          * @brief Set the size of the preloaded data in number of floats (not bytes).

github.com/sfztools/sfizz

clients/sfizz_jack.man.in

f5c6e29f2


      
          [\fIFILE\fR] [\fIOPTIONS\fR...]
          .SH DESCRIPTION
          sfizz_jack wraps the sfizz SFZ library in a JACK client that can be controlled by a text interface.
          .SH OPTIONS
          .IP \fIFILE\fR
          An optional SFZ file name to load on startup
          .IP "\fB--client_name\fR"
          Name for the JACK client
          .IP "\fB--num_voices\fR \fINUMBER\fR"
          Change the maximum polyphony
          .IP "\fB--preload_size\fR \fINUMBER\fR"
          Bytes to preload in cache for samples
          .IP "\fB--state\fR"
          Periodically output the state of the synth
          .IP "\fB--jack_autoconnect\fR"
          Autoconnect the JACK outputs
          .SH TEXT INTERFACE
          It is possible it interact with the JACK client through the standard input.
          The possible commands are
          .IP "load_instrument \fIFILE\fR"
          Load an instrument

Never saw the data size unit “number of floats” before, but when it is what I think, its 8192 * 4 Bytes = 32768 Bytes? Other comment says “bytes”, so 8192.

Do you know there is an option in sfizz to just load libraries to RAM in any case? This could be the default behavior for our use case (fast cpu, slow sd card), at least, if the soundfont doesn’t exceed the available ram.

jofemodo · February 27, 2026, 11:17am

As far as i know, there is no such option, but we could implement it quite easily by injecting the “hint_ram_based” op code on-the-fly.
The problem is it takes a live to load big soundfonts. It’s really annoying. We could work around preset preloading to use streamed-loading and then, when asserting the soundfont, load full to RAM, but as i told, it takes a lot of time for big soundfonts, and it block the engine process until this is done. No progress indication. Just a frozen process until loading is finished.

Perhaps we should use smaller soundfonts in the Factory Collection, and warn against big ones, explaining that they require a NVMe disk to perform without XRUNs.

Anyway, think the test i ran is really overkill and probably far from most real use-cases. It could be wise to test with some “Rachmaninoff” mid files instead.

Regards,

jofemodo · February 27, 2026, 11:56am

More tests. I just repeated the initial test with Salamander Grand Piano, normal stream-loading (no hint_ram_based option). I used another V5, with same SD-card (same model and branch) but 8GB RBPi instead of the 4GB RBPi i used for the first test.

I must say i’m surprised because it improves a lot. XRUNs are almost gone and system disk load is quite lower, what makes me think that bigger OS disk cache is making the difference here.

What now seems clear to me is that using the 8GB RBPi version makes a big difference when using big soundfonts, and this is against what i thought before these tests. I’m shocked, mates!! But it has sense, anyway, when you think twice about how the whole thing works.

All the best,

jawn · February 27, 2026, 12:20pm

On Pi 5, I wonder what difference a USB mass storage device makes to boot zynthianOS or just the soundfonts from.

jlearman · February 27, 2026, 1:17pm

That’s the sample amount kept in memory to be able to start playing the note immediately while loading the rest of the sample.

That value should be based on RAM available, not CPU type. If that’s a static build parameter, it shouldn’t vary based on CPU, but if it’s at runtime it should be based on RAM size or user-configurable.

hannesmenzel · February 27, 2026, 1:22pm

So, the “attack” of the sample. Given we’re talking about bytes, this shouldn’t make any problems in any configuration (8192 bytes of let’s say even 3000 sample files each is not a thing we should worry about in terms of ram).