lcamtuf’s thing • 6938 implied HN points • 10 Jan 26
- Images and audio are both sampled data so you can apply similar transforms to both, but ears and eyes perceive artifacts very differently so the same operation can look fine and sound awful.
- Pixelating or reducing bit depth in audio creates stair-step or high-frequency errors that produce metallic squeals or hiss, and those artifacts are typically removed with lowpass/rolling-average filtering or proper DAC anti-aliasing.
- Frequency-domain editing works well if you process short, overlapping windows with a Hann (sin^2) weighting and 50% overlap so the attenuations cancel out, avoiding clicks and enabling effects like pitch shifting and vocoding.