Radio Autoencoder V2

RADE (Radio AutoEncoder) is a neural codec for transmitting speech over HF radio channels. A neural encoder compresses speech into a latent vector which is modulated onto an OFDM waveform and transmitted. At the receiver a neural decoder reconstructs the speech features, which are synthesised into audio by the FARGAN vocoder. The system is trained end-to-end, jointly optimising the encoder, channel layer, and decoder for minimum speech distortion across a range of channel conditions.

RADE V2 builds on V1 with several algorithmic improvements:

	V1	V2
Carriers	30, includes pilot symbols	14, data only (no pilots)
Equalisation	Classical DSP, pilot-aided	ML-based, no pilots required
99% Occupied Bandwidth	~2100 Hz (SSB filter limited)	~860 Hz
Frame duration	~180 ms	~40 ms
PAPR	4.2 dB	3.5 dB
Frame sync	DSP	Neural network
End-of-over detection	Pilot pend sequence	Channel sparsity metric
Threshold SNR (AWGN)	-2 dB	~-4.5 dB
Threshold SNR (MPP)	0 dB	~-3 dB

The elimination of pilot symbols in V2 recovers the bandwidth and power they consumed, enabling a narrower, cleaner waveform and improved high and low SNR performance. Combined with the PAPR improvement, RADE V2 is approximately 3 dB more sensitive than V1 at low SNRs.

Threshold SNR values are approximate, based on informal listening tests and objective loss metric.

Scope

This repo is the reference Python implementation for RADE V1 and V2. The current focus is on RADE V2 development, however this repo also contains RADE V1 (including many ctests).

This repo is intended to support experimental work, with just enough information for the advanced experimenter to reproduce aspects of the work. The focus is on waveform development, not software configuration. It is not intended to be packaged for general use or to work across multiple Linux distros and operating systems. Unless otherwise stated, the code in this repo is intended to run only on Ubuntu Linux 22-24 on a non-virtual machine.

For deployment and distribution of RADE V1 please use the C port. RADE V2 is still under development but we hope to make an initial release soon.

Quickstart

Installation section below.

RADE V2 Tx and Rx example:

./inference.sh 250725/checkpoints/checkpoint_epoch_200.pth wav/brian_g8sez.wav /dev/null --rate_Fs --latent-dim 56 \
 --peak --cp 0.004 --time_offset -16 --correct_time_offset -8 --auxdata --w1_dec 128 --write_rx 250725_rx.f32
./rx2.sh 250725/checkpoints/checkpoint_epoch_200.pth 250725a_ml_sync 250725_rx.f32 test.wav
play test.wav

RADE V1 Tx and Rx example:

./inference.sh model19_check3/checkpoints/checkpoint_epoch_100.pth wav/brian_g8sez.wav /dev/null \
 --rate_Fs --pilots --pilot_eq --eq_ls --cp 0.004 --bottleneck 3 --auxdata --write_rx v1_rx.f32
cat v1_rx.f32 | python3 radae_rxe.py --model model19_check3/checkpoints/checkpoint_epoch_100.pth > features_out.f32
./build/src/lpcnet_demo -fargan-synthesis features_out.f32 - | aplay -f S16_LE -r 16000

test/v2_spot.sh is a good starting point for RADE V2 experimentation.

Reference and License

D. Rowe, J.-M. Valin, RADE: A Neural Codec for Transmitting Speech over HF Radio Channels, arXiv:2505.06671, 2025. This paper describes RADE V1; a V2 paper is planned as future work. The companion branch of this repo (with a RADE V1 focus) is waspaa_2025.

The RADE source code is released under the two-clause BSD license.

Files

File	Description
`inference.py` / `inference.sh`	RADE V2 transmitter: encodes speech and modulates to a complex IQ sample file
`rx2.py` / `rx2.sh`	RADE V2 receiver: stateful, streaming decoder
`radae_txe.py` / `radae_rxe.py`	RADE V1 transmitter and receiver
`radae/radae.py`	Core RADE model definition (encoder, channel layer, decoder)
`train.py`	Training script for the RADE encoder/decoder
`ml_sync.py` / `models_sync.py`	ML frame sync: trains and runs the neural frame synchroniser
`train_ft_sync.sh`	Automation script for training the ML sync model
`loss.py`	Measures ML loss (speech distortion) between encoder and decoder feature vectors
`compare_models_inf.sh`	Generates loss versus SNR curves across models and channel types
`ota_test.sh`	Over-the-air/over-the-cable test: generates tx signal, decodes rx, measures loss
`est_CNo.py`	C/No estimation from a received chirp signal
`chirp.py`	Generates a chirp reference signal used for timing and level calibration in OTA tests
`int16tof32.py` / `f32toint16.py`	Sample format converters between int16 and float32
`test/v2_spot.sh`	RADE V2 spot test: encodes, applies channel impairments, decodes, checks loss
`test/v2_acq.sh`	Acquisition tests: false acquisition rate on noise or noise plus sine wave
`test/ota_test_cal.sh`	Calibrated OTA test using the `ch` channel simulator, checks V1 and V2 loss
`test/snr_est_test.sh`	Steps through SNR range comparing measured vs estimated SNR3k
`test/eoo_detect_prob.sh`	Measures probability of correct EOO detection over a range of channel conditions
`test/eoo_false_prob.sh`	Measures EOO false detection rate on noise

Installation

Packages

sox, python3, python3-matplotlib and python3-tqdm, octave, octave-signal, cmake. Pytorch should be installed using the instructions from the pytorch web site.

RADE

Builds the FARGAN vocoder and ctest framework, most of RADAE is in Python.

cd ~
git clone https://github.com/drowe67/radae.git
cd radae
mkdir build
cd build
cmake ..
make

Automated Tests

The cmake/ctest framework is being used as a build and test framework. The command lines in CmakeLists.txt are a good source of examples, if you are interested in running the code in this repo. The ctests are a work in progress and may not pass on all systems (see Scope above).

To run the tests:

cd radae/build
ctest

To list tests ctest -N, to run just one test ctest -R inference_model5, to run in verbose mode ctest -V -R inference_model5.

Listening to modulated RADE

A lot of the tests generate a float IQ sample file. You can listen to this file with:

cat rx.f32 | python3 f32toint16.py --real --scale 8192 | play -t .s16 -r 8000 -c 1 - bandpass 300 2000

The scaling --scale is required as the low SNRs mean the noise peak amplitude can clip 16 bit samples if not carefully scaled.

Optional: RADE V1 C Port Tests (radae_nopy)

The radae_nopy repo contains a C port of the RADE V1 receiver. Its ctests are optional and only enabled when RADAE_NOPY_BUILD_DIR is passed to cmake:

cd ~
git clone https://github.com/peterbmarks/radae_nopy.git
cd radae_nopy && mkdir build && cd build && cmake .. && make
cd ~/radae/build
cmake -DRADAE_NOPY_BUILD_DIR=~/radae_nopy/build ..
ctest -R radae_nopy

Over the Air/Over the Cable (OTA/OTC)

The ota_test.sh script supports stored-file over-the-air and over-the-cable testing. It assembles a transmit file containing a chirp reference, compressed SSB, RADE V1, and RADE V2 signals in sequence, which can be sent over a real HF channel or processed through a channel simulator. The script performs a controlled test of RADE V2 over real world channels.

Generate a transmit file from an input speech wav (16 kHz mono):

./ota_test.sh wav/brian_g8sez.wav -x

This produces tx.wav, which is suitable for transmission OTA using your SSB transmitter. We then use a remote HF receiver to sample the received signal to a wave file, e.g. rx.wav.

To simulate a real HF channel pass it through the ch channel simulator to add noise and fading:

./build/src/ch tx.wav - --No -20 | sox -t .s16 -r 8000 -c 1 - rx.wav

Decode rx.wav and measure ML loss against the original speech:

./ota_test.sh -r rx.wav -l wav/brian_g8sez.wav

The decoded audio files rx_ssb.wav, rx_rade1.wav, and rx_rade2.wav are written to the same directory as rx.wav. A report file and spectrogram is also produced, including objective loss measurements (if -l option used).

See ota_test.sh for more information.

Training

This section is optional - pre-trained models that run on a standard laptop CPU are available for experimenting with RADAE. If you wish to perform training, a serious NVIDIA GPU is required - the author used a RTX4090.

Generate a training features file using your speech training database training_input.pcm, we used 200 hours of speech from open source databases:
```
./lpcnet_demo -features training_input.pcm training_features_file.f32
```

Generate the MPP channel simulation file:

echo "Rs=50; Nc=14; multipath_samples('mpp', Rs, Rs, Nc, 250*60*60, 'h_nc14_mpp_train_test.c64','',1); quit" | octave-cli -qf

Train the RADE V2 encoder/decoder (the 250725 model was trained with these settings):

python3 train.py --cuda-visible-devices 0 --sequence-length 400 --batch-size 512 \
  --epochs 200 --lr 0.003 --lr-decay-factor 0.0001 \
   training_features_file.f32 250725 \
  --latent-dim 56 --cp 0.004 --auxdata --w1_dec 128 --peak \
  --h_file h_nc14_mpp_train.c64 --h_complex --range_EbNo --range_EbNo_start 3 \
  --timing_rand --freq_rand --ssb_bpf --plot_loss

Generate latent vectors from the trained model for ML sync training. This runs one pass through the training data without updating weights. Note the addition of +/- 2 ms of timing jitter, to maintain frame sync across the delay spread of multipath channels:

python3 train.py --cuda-visible-devices 0 --sequence-length 400 --batch-size 512 \
  --epochs 200 --lr 0.003 --lr-decay-factor 0.0001 \
  training_features_file.f32 tmp \
  --latent-dim 56 --cp 0.004 --auxdata --w1_dec 128 --peak \
  --h_file h_nc14_mpp_train.c64 --h_complex --range_EbNo --range_EbNo_start 3 \
  --timing_rand --timing_jitter 0.002 --freq_rand --ssb_bpf \
  --plot_EqNo 250725 --initial-checkpoint 250725/checkpoints/checkpoint_epoch_200.pth \
  --write_latent 250725a_z_train.f32

Train the ML frame sync model:

python3 ml_sync.py 250725a_z_train.f32 --count 100000 --save_model 250725a_ml_sync --latent_dim 56

ASR Tests

Automatic Speech Recognition (ASR) is used as an objective speech quality metric to compare RADE V1 against SSB and FreeDV 700D. The Whisper ASR model scores Word Error Rate (WER) on LibriSpeech samples passed through the modems under test.

Install dependencies:
```
pip3 install jiwer openai-whisper
```
The LibriSpeech test-clean dataset (~400 MB) is downloaded automatically to ~/.cache/LibriSpeech/ on first run via torchaudio.

Run controls (clean speech, FARGAN vocoder only, 4 kHz bandwidth):

./asr_test.sh clean && ./asr_test.sh fargan && ./asr_test.sh 4kHz

Run a sweep across AWGN channel conditions for each mode (100 samples):

./asr_test_top.sh ssb -n 100
./asr_test_top.sh rade -n 100
./asr_test_top.sh 700D -n 100

For MPP channel, first generate fading samples (if not already present), then re-run with --g_file:
```
./test/make_g.sh
./asr_test_top.sh rade -n 100 --g_file g_mpp.f32
```

Plot WER curves in Octave:

octave:1> radae_plots; plot_wer("241221","241221_asr_test.png")

C Port of Core Encoder and Decoder

The following describes the V1 core encoder/decoder C port.

A RADE V2 pure-C RX port now exists in this repository (see C_RX_MIGRATION.md, src/radae_rx_v2.c, and rade_rx_v2_pure_c_* in src/rade_api.c). Future work refers to the remaining V2 TX C port.

The model weights can be compiled in or loaded at init-time from a binary blob. The actual model is hard coded in rade_enc.c and rade_dec.c, and can't be easily changed.

To compile-in the weights:

Export weights:

cd radae
python3 export_rade_weights.py model19_check3/checkpoints/checkpoint_epoch_100.pth src

We need to make some manual changes to the weight files to support changing input dimension at run time. In rade_enc_dat.c, the first call to linear_init() should look like:
```
int init_radeenc(RADEEnc *model, const WeightArray *arrays, int input_dim) {
  if (linear_init(&model->enc_dense1, arrays, "enc_dense1_bias", NULL, NULL,"enc_dense1_weights_float", NULL, NULL, NULL, input_dim, 64)) return 1;
```
e.g. the fixed input dimension (84 for model19_check3, 80 for earlier models without auxdata) should be changed to the input_dim variable. This allows us to enable/disable auxdata at init time, without changing the C code for the model.
Also make manual changes to support output_dim in rade_dec_dat.c, init_radedec().
Build C code.
Run ctests.

To export the compiled in weights to a binary blob:

cd radae/build
./src/write_rade_weights ../bin/model05.bin

These can then be loaded at init-time, see examples in src/test_rand_enc.c and src/test_rand_dec.c.

Name		Name	Last commit message	Last commit date
Latest commit History 875 Commits
.github/workflows		.github/workflows
250725/checkpoints		250725/checkpoints
bin		bin
cmake		cmake
cross-compile		cross-compile
doc		doc
docs		docs
model05/checkpoints		model05/checkpoints
model17/checkpoints		model17/checkpoints
model18/checkpoints		model18/checkpoints
model19/checkpoints		model19/checkpoints
model19_check3/checkpoints		model19_check3/checkpoints
model_bbfm_01/checkpoints		model_bbfm_01/checkpoints
public_html		public_html
radae		radae
src		src
test		test
wav		wav
weight-exchange		weight-exchange
250622_ft		250622_ft
250625_ml_sync		250625_ml_sync
250725_ft		250725_ft
250725_ml_sync		250725_ml_sync
250725a_ml_sync		250725a_ml_sync
251002_mpp_16k_ft		251002_mpp_16k_ft
BBFM.md		BBFM.md
CMakeLists.txt		CMakeLists.txt
C_RX_MIGRATION.md		C_RX_MIGRATION.md
LICENSE		LICENSE
README.md		README.md
adapp.m		adapp.m
adasmooth.m		adasmooth.m
analog_bbfm.sh		analog_bbfm.sh
asr_test.sh		asr_test.sh
asr_test_top.sh		asr_test_top.sh
asr_wer.py		asr_wer.py
autocorr.py		autocorr.py
autocorr_simple.py		autocorr_simple.py
bbfm_bpf.m		bbfm_bpf.m
bbfm_inference.py		bbfm_inference.py
bbfm_inference.sh		bbfm_inference.sh
bbfm_rx.py		bbfm_rx.py
bbfm_rx.sh		bbfm_rx.sh
check_snr_est.py		check_snr_est.py
chirp.py		chirp.py
compare_models.sh		compare_models.sh
compare_models_inf.sh		compare_models_inf.sh
diff_f32.py		diff_f32.py
doppler_spread.m		doppler_spread.m
dt_test.sh		dt_test.sh
eoo_ber.py		eoo_ber.py
est_CNo.py		est_CNo.py
est_snr.py		est_snr.py
est_snr_curves.sh		est_snr_curves.sh
evaluate.sh		evaluate.sh
evaluate_loop.sh		evaluate_loop.sh
export_frame_sync_weights.py		export_frame_sync_weights.py
export_rade_weights.py		export_rade_weights.py
export_rade_weights_250725.py		export_rade_weights_250725.py
export_rx2_model_data.py		export_rx2_model_data.py
export_tx2_model_data.py		export_tx2_model_data.py
f32toint16.py		f32toint16.py
firtorch.py		firtorch.py
fm.m		fm.m
generate_complex_bpf_vectors.py		generate_complex_bpf_vectors.py
generate_frame_sync_vectors.py		generate_frame_sync_vectors.py
generate_rade_rx_v2_vectors.py		generate_rade_rx_v2_vectors.py
generate_rx2_coarse_sync_vectors.py		generate_rx2_coarse_sync_vectors.py
generate_rx2_demod_vectors.py		generate_rx2_demod_vectors.py
generate_rx2_eoo_vectors.py		generate_rx2_eoo_vectors.py
generate_rx2_frame_sync_vectors.py		generate_rx2_frame_sync_vectors.py
generate_rx2_frontend_vectors.py		generate_rx2_frontend_vectors.py
generate_rx2_receiver_vectors.py		generate_rx2_receiver_vectors.py
generate_tx2_encode_vectors.py		generate_tx2_encode_vectors.py
generate_tx2_encode_vectors_via_v2.py		generate_tx2_encode_vectors_via_v2.py
inference.py		inference.py
inference.sh		inference.sh
inject_bad_pitch.py		inject_bad_pitch.py
int16tof32.py		int16tof32.py
jmv_ft.sh		jmv_ft.sh
jmv_ft_tool.sh		jmv_ft_tool.sh
load_c64.m		load_c64.m
load_f32.m		load_f32.m
load_raw.m		load_raw.m
loss.py		loss.py
ml_eq.py		ml_eq.py
ml_pilots.py		ml_pilots.py
ml_sync.py		ml_sync.py
mleq_curves.sh		mleq_curves.sh
models_ft.py		models_ft.py
models_sync.py		models_sync.py
multipath_samples.m		multipath_samples.m
ofdm_sync.sh		ofdm_sync.sh
ota_test.sh		ota_test.sh
plot_specgram.m		plot_specgram.m
ptt_test.conf		ptt_test.conf
ptt_test.sh		ptt_test.sh
radae_plots.m		radae_plots.m
radae_rx.sh		radae_rx.sh
radae_rxe.py		radae_rxe.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Radio Autoencoder V2

Scope

Quickstart

Reference and License

Files

Installation

Packages

RADE

Automated Tests

Listening to modulated RADE

Optional: RADE V1 C Port Tests (radae_nopy)

Over the Air/Over the Cable (OTA/OTC)

Training

ASR Tests

C Port of Core Encoder and Decoder

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Radio Autoencoder V2

Scope

Quickstart

Reference and License

Files

Installation

Packages

RADE

Automated Tests

Listening to modulated RADE

Optional: RADE V1 C Port Tests (radae_nopy)

Over the Air/Over the Cable (OTA/OTC)

Training

ASR Tests

C Port of Core Encoder and Decoder

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages