Encoding audio data more efficiently?
I have the architecture below, it is pretty simple, and gives me an average error of around 0.052 when calculated by MAE. It is not that bad, but some high frequency information gets lost, and I have to eliminate that issue for a lot of reasons.