FSK and compander levels are adjusted.
Eliminate offsets between subsequent speech chunks. This is done by
high-pass filter. An offset is not passed to the filter.
Do correct audio processing chain:
time compress -> compressor -> scrambler / pre-emphasis -> TX
RX -> de-scrambler / de-emphasis -> expander -> time expand