How to Improve Language Model Quantization with Outlier Channels
As language models grow in size and complexity, the need for efficient quantization becomes more pressing. However, one challenge that has emerged is the presence of outlier channels: feature dimensions whose values are orders of magnitude higher than others. While important for model performance, these outlier channels can pose significant