Mason Wang

‘1/f noise’ in music and speech

The power spectrum is often $f^{-\alpha}, 0.5 \leq \alpha \leq 1.5.

Loudness, pitch, melody exhibit this behavior.

Autocorrelation Functions

If $\langle v(t), v(t+\tau) \rangle$ is correlated (non-zero expectation) for $|\tau| < T$, it is “white” for frequencies $ \frac{1}{2\pi \tau_c}$ and is decreasing rapidly $f^{-2}$ for frequencies $ \geq \frac{1}{2\pi \tau_c}$. $\frac{1}{f}$ means some correlation over all time scales for which $\frac{1}{f}$ holds.

Note that $\tau=3$ implies a period of $2\pi * 3$, or that the angular frequency is $\frac{1}{3}$.

Negative slope for $S(f)$ implies correlation over scales of $\frac{1}{2\pi f}$.

Examples

For radio stations, spectrum flattens at lowest frequencies for some statistics

Power spectrums of waveforms produce peaks, take PSD of wvaeform energy, after bandpassing from 100 Hz to 10 kHz.

Music Generation