[Review] Convolutional Neural Networks using Logarithmic Data Representation, arxiv, 2016

July 17, 2024 1 분 소요

Convolutional Neural Networks using Logarithmic Data Representation

Logarithmic-scale quantization을 이용한 최초(?)의 non-uniform quantization 논문으로 보여져 공부해 보았습니다

Convolution layer와 Fully-connected layer 모두 matrix dot product 연산을 진행하게 되는데
하드웨어에서의 곱 연산을 bit shift 연산으로 변경해줄수만 있다면 하드웨어적 측면에서의 상당한 이득을 볼 수 있다는 점에
해당 저자들은 다음과 같은 방식을 제안하게 됩니다

Method 1.

\[\begin{align} w^Tx &\cong \sum_{i=1}^n w_i\times2^{\tilde{x}_i} \newline &= \sum_i^nBitshift ( w_i, \tilde{x}_i ) \end{align}\]

$\tilde{x}_i=Quantize(log_2(x_i))$ 라고 정의할 때
기존 Conv. & FC layer에서의 dot product를 $\tilde{x}_i$만큼 shift한 $w_i$의 합 으로 표현이 가능하다는 것을
위 식을 통해 보여주고 있습니다 $\left(Bitshift(a,b)=a»b\right)$

그렇게 되면 비싼 곱셈기 대신 간단한 bit shifter로 dot product가 구현이 가능하니
우선 하드웨어적으로 상당한 이득을 보게됩니다

그리고 Quantization에 대해서는 두 가지 옵션이 존재합니다

Floor : $\lfloor log_2(w) \rfloor$ $\rightarrow$ MSB로부터 첫 1 bit가 포착되는 지점 반환
Round : integer part 계산 후 fractional part 이용하여 가까운 값 반환

Method 2.

그렇다면 동일하게 weight에도 quantization을 적용하게 될 경우
지수간 곱셈이 더하기 연산으로 바뀌게 되고, 결국 1이라는 숫자에 대한 bit shift로 변환이 됩니다

\[\begin{align} w^Tx &\cong \sum_{i=1}^n 2^{Quantize(log_2(w_i))+Quantize(log_2(x_i))} \newline &= \sum_ {i=1}^n Bitshift ( 1, \tilde{w}_i+\tilde{x}_i ) \end{align}\]

Accumulation in log domain

Conv. & FC layer에서의 각 원소에 대한 dot product가 bit shift로 변환이 될 수 있음을 보였으니
모든 원소들에 대한 합은 어떻게 변환을 할 수 있는지에 대해 저자들은 다음과 같이 일반화 하였습니다

\[\tilde{s}_n \cong max(\tilde{s} _{n-1},\tilde{p}_n)+Bitshift(1,-|\lfloor \tilde{s} _{n-1}\rfloor-\tilde{p}_n|)\]

Experiment

실험에서 재미있던 점 몇 가지를 보자면

위 그래프에서 가로축은 quantization에 의한 error로 볼 수 있고 세로축은 해당 error에서의 count를 뜻합니다
log quantization은 linear quantization에 비해 상대적으로 작은 quantization error를 띔을 볼 수 있습니다

그리고 base를 2가 아닌 $\sqrt2$를 이용한 quantization을 적용해야 CNN에서의 accuracy drop이 덜 한 것을 볼 수 있는데
convolution layer의 weight에 대한 quantization error가 누적됨에 따라 base-2에서는 accuracy drop이 더 컸음을 알 수 있습니다

마지막으로, quantization을 학습에도 적용해 보았을 때
log-quantization의 경우 backpropagated gradient를 quantization해도 학습이 잘 되는 것을 볼 수 있으나
linear-quanitzation의 경우 qunatize하지 않은 floating point gradient를 그대로 사용해야만 학습이 가능했다고 합니다

Twitter Facebook LinkedIn

HK Park

[Review] Convolutional Neural Networks using Logarithmic Data Representation, arxiv, 2016

Method 1.

Method 2.

Accumulation in log domain

Experiment

공유하기

댓글남기기

참고

[Review] Data-Free Quantization via Pseudo-label Filtering, CVPR, 2024

[Review] GENIE: Show Me the Data for Quantization, CVPR, 2023

[Review] Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time, ICML, 2022

[Review] Similarity of Neural Architectures using Adversarial Attack Transferability, ECCV, 2024