ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers is published in EMNLP 2024.