Mason Wang

DAC

Residual blocks with dilations 1, 3, 9 slash kernel size 7, and depthwise 1x1 These are chained together.

Downsampling blocks that double channel

Upsampling blocks that halve channel

Snake activations

feature matching loss, multiscale STFT discriminator, mel-reconstruction loss

Last Reviewed: 1/17/25