More precise audio waveforms
My editing style is primarily centered around cutting "to the music", and also I usually clean up interviews and narrations by cutting out all the redundant sentences and all the hesitations (um, ah, etc.). For that, I need very precise visual representations of when sound events occur, whether while I'm zoomed out or when I'm zoomed in. Compared to other editing software (especially Vegas, which evolved from an audio editor), I still find Pitivi lacking in that regard, so I can't use it for efficient editing of projects that requires precision, especially with dialogue.
It seems to me that Pitivi's waveforms sampling & drawing resolution is not high enough. Maybe a pixel histogram representation, rather than a line graph representation, would provide more precision? In any case, the problem becomes quite apparent when you compare with Audacity: even when you take only the half-waveform (the ones in the "middle row" in the screenshots below) to match Pitivi's representation and compare apples to apples, Audacity's (and Vegas') waveform is still much much more precise.
Here are some comparison screenshots of the first 45 seconds of Daft Punk's "Giorgio by Moroder" track, which is pretty much a voice interview track in the beginning, so a good benchmark:
A) Zoomed out, Pitivi's waveform (3rd row, at the bottom) visibly lacks some definition, but it's not "completely shocking":
B) Pitivi timeline zoom at 25-30%
C) Pitivi timeline zoom at 40%