'Image Recognition/Fundamental' 카테고리의 글 목록

FFT를 이용한 inverse FFT

Image Recognition/Fundamental 2024. 7. 31. 12:11

Forward fft:

$f f t (x + i y) = X + i Y = \sum (x + i y) (W r + i W i) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">t</mi></mrow><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mi>i</mi><mi>y</mi><mo stretchy="false">)</mo><mo>=</mo><mi>X</mi><mo>+</mo><mi>i</mi><mi>Y</mi><mo>=</mo><mo data-mjx-texclass="OP">\sum</mo><mo stretchy="false">(</mo><mi>x</mi><mo>+</mo><mi>i</mi><mi>y</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msub><mi>W</mi><mi>r</mi></msub><mo>+</mo><mi>i</mi><msub><mi>W</mi><mi>i</mi></msub><mo stretchy="false">)</mo></math>$

$= \sum (x W r - y W i) + i (x W i + y W r) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo>=</mo><mo data-mjx-texclass="OP">\sum</mo><mo stretchy="false">(</mo><mi>x</mi><msub><mi>W</mi><mi>r</mi></msub><mo>-</mo><mi>y</mi><msub><mi>W</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>+</mo><mi>i</mi><mo stretchy="false">(</mo><mi>x</mi><msub><mi>W</mi><mi>i</mi></msub><mo>+</mo><mi>y</mi><msub><mi>W</mi><mi>r</mi></msub><mo stretchy="false">)</mo></math>$

Inverse fft:

$ifft(X+iY)=1N∑(X+iY)(Wr−iWi)<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="monospace">i</mi><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">t</mi></mrow><mo stretchy="false">(</mo><mi>X</mi><mo>+</mo><mi>i</mi><mi>Y</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><mo data-mjx-texclass="OP">∑</mo><mo stretchy="false">(</mo><mi>X</mi><mo>+</mo><mi>i</mi><mi>Y</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msub><mi>W</mi><mi>r</mi></msub><mo>−</mo><mi>i</mi><msub><mi>W</mi><mi>i</mi></msub><mo stretchy="false">)</mo></math>$

$=1N∑(XWr+YWi)+i(−XWi+YWr)<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><mo data-mjx-texclass="OP">∑</mo><mo stretchy="false">(</mo><mi>X</mi><msub><mi>W</mi><mi>r</mi></msub><mo>+</mo><mi>Y</mi><msub><mi>W</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>+</mo><mi>i</mi><mo stretchy="false">(</mo><mo>−</mo><mi>X</mi><msub><mi>W</mi><mi>i</mi></msub><mo>+</mo><mi>Y</mi><msub><mi>W</mi><mi>r</mi></msub><mo stretchy="false">)</mo></math>$

$=1N∑(XWr−(−Y)Wi)+(−i)(XWi+(−Y)Wr)<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mi>X</mi><msub><mi>W</mi><mi>r</mi></msub><mo>−</mo><mo stretchy="false">(</mo><mo>−</mo><mi>Y</mi><mo stretchy="false">)</mo><msub><mi>W</mi><mi>i</mi></msub><mo data-mjx-texclass="CLOSE">)</mo></mrow><mo>+</mo><mo stretchy="false">(</mo><mo>−</mo><mi>i</mi><mo stretchy="false">)</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mi>X</mi><msub><mi>W</mi><mi>i</mi></msub><mo>+</mo><mo stretchy="false">(</mo><mo>−</mo><mi>Y</mi><mo stretchy="false">)</mo><msub><mi>W</mi><mi>r</mi></msub><mo data-mjx-texclass="CLOSE">)</mo></mrow></math>$

$=1N[fft(X−iY)]∗=1N[fft((X+iY)∗)]∗<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><msup><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">t</mi></mrow><mo stretchy="false">(</mo><mi>X</mi><mo>−</mo><mi>i</mi><mi>Y</mi><mo stretchy="false">)</mo><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>∗</mo></msup><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><msup><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">t</mi></mrow><mo stretchy="false">(</mo><mo stretchy="false">(</mo><mi>X</mi><mo>+</mo><mi>i</mi><mi>Y</mi><msup><mo stretchy="false">)</mo><mo>∗</mo></msup><mo stretchy="false">)</mo><mo data-mjx-texclass="CLOSE">]</mo></mrow><mo>∗</mo></msup></math>$

또는

$1Nfft(Y+iX)=1N∑(Y+iX)(Wr+iWi)<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mn>1</mn><mi>N</mi></mfrac><mrow data-mjx-texclass="ORD"><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">f</mi><mi mathvariant="monospace">t</mi></mrow><mo stretchy="false">(</mo><mi>Y</mi><mo>+</mo><mi>i</mi><mi>X</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><mo data-mjx-texclass="OP">∑</mo><mo stretchy="false">(</mo><mi>Y</mi><mo>+</mo><mi>i</mi><mi>X</mi><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msub><mi>W</mi><mi>r</mi></msub><mo>+</mo><mi>i</mi><msub><mi>W</mi><mi>i</mi></msub><mo stretchy="false">)</mo></math>$

$=1N∑(YWr−XWi)+i(YWi+XWr)=y+ix<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><mo data-mjx-texclass="OP">∑</mo><mo stretchy="false">(</mo><mi>Y</mi><msub><mi>W</mi><mi>r</mi></msub><mo>−</mo><mi>X</mi><msub><mi>W</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>+</mo><mi>i</mi><mo stretchy="false">(</mo><mi>Y</mi><msub><mi>W</mi><mi>i</mi></msub><mo>+</mo><mi>X</mi><msub><mi>W</mi><mi>r</mi></msub><mo stretchy="false">)</mo><mspace linebreak="newline"></mspace><mo>=</mo><mi>y</mi><mo>+</mo><mi>i</mi><mi>x</mi></math>$

저작자표시 비영리 변경금지

'Image Recognition > Fundamental' 카테고리의 다른 글

FFT 구현 (0)	2024.07.31
CLAHE (2) (1)	2024.06.26
Approximate Distance Transform (0)	2024.06.02
Graph-based Segmentation (1)	2024.05.26
Linear Least Square Fitting: perpendicular offsets (0)	2024.03.22

FFT 구현

Image Recognition/Fundamental 2024. 7. 31. 09:00

$X m = N / 2 - 1 \sum n = 0 x 2 n W m n N + W m N N / 2 - 1 \sum n = 0 x 2 n + 1 W m n N <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>X</mi><mi>m</mi></msub><mo>=</mo><munderover><mo data-mjx-texclass="OP">\sum</mo><mrow data-mjx-texclass="ORD"><mi>n</mi><mo>=</mo><mn>0</mn></mrow><mrow data-mjx-texclass="ORD"><mi>N</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn><mo>-</mo><mn>1</mn></mrow></munderover><msub><mi>x</mi><mrow data-mjx-texclass="ORD"><mn>2</mn><mi>n</mi></mrow></msub><msubsup><mi>W</mi><mi>N</mi><mrow data-mjx-texclass="ORD"><mi>m</mi><mi>n</mi></mrow></msubsup><mo>+</mo><msubsup><mi>W</mi><mi>N</mi><mi>m</mi></msubsup><munderover><mo data-mjx-texclass="OP">\sum</mo><mrow data-mjx-texclass="ORD"><mi>n</mi><mo>=</mo><mn>0</mn></mrow><mrow data-mjx-texclass="ORD"><mi>N</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn><mo>-</mo><mn>1</mn></mrow></munderover><msub><mi>x</mi><mrow data-mjx-texclass="ORD"><mn>2</mn><mi>n</mi><mo>+</mo><mn>1</mn></mrow></msub><msubsup><mi>W</mi><mi>N</mi><mrow data-mjx-texclass="ORD"><mi>m</mi><mi>n</mi></mrow></msubsup></math>$

$X m + N / 2 = N / 2 - 1 \sum n = 0 x 2 n W m n N - W m N N / 2 - 1 \sum n = 0 x 2 n + 1 W m n N <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>X</mi><mrow data-mjx-texclass="ORD"><mi>m</mi><mo>+</mo><mi>N</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn></mrow></msub><mo>=</mo><munderover><mo data-mjx-texclass="OP">\sum</mo><mrow data-mjx-texclass="ORD"><mi>n</mi><mo>=</mo><mn>0</mn></mrow><mrow data-mjx-texclass="ORD"><mi>N</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn><mo>-</mo><mn>1</mn></mrow></munderover><msub><mi>x</mi><mrow data-mjx-texclass="ORD"><mn>2</mn><mi>n</mi></mrow></msub><msubsup><mi>W</mi><mi>N</mi><mrow data-mjx-texclass="ORD"><mi>m</mi><mi>n</mi></mrow></msubsup><mo>-</mo><msubsup><mi>W</mi><mi>N</mi><mi>m</mi></msubsup><munderover><mo data-mjx-texclass="OP">\sum</mo><mrow data-mjx-texclass="ORD"><mi>n</mi><mo>=</mo><mn>0</mn></mrow><mrow data-mjx-texclass="ORD"><mi>N</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn><mo>-</mo><mn>1</mn></mrow></munderover><msub><mi>x</mi><mrow data-mjx-texclass="ORD"><mn>2</mn><mi>n</mi><mo>+</mo><mn>1</mn></mrow></msub><msubsup><mi>W</mi><mi>N</mi><mrow data-mjx-texclass="ORD"><mi>m</mi><mi>n</mi></mrow></msubsup></math>$

$m = 0, 1, . . ., N / 2 - 1 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>m</mi><mo>=</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo>,</mo><mo>.</mo><mo>.</mo><mo>.</mo><mo>,</mo><mi>N</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn><mo>-</mo><mn>1</mn></math>$

Butterfly Structure:

FFT for N=8:

/* sin(), cos() 함수를 $log 2 (N / 4) <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>log</mi><mn>2</mn></msub><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>N</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>4</mn><mo stretchy="false">)</mo></math>$ 번만 호출하면 된다. 이마저도 2배각 공식을 쓰면 sqrt() 호출로 대체할 수 있다 */

#define PI	3.1415926535897932384
#define SWAP(a, b) {double temp = (a); (a) = (b); (b)=temp;}
int fft(int n, double x[], double y[]) {
    if ((n !=0) && ((n & (n-1)) != 0) ) return 0;
    // bit-reversal;
    for (int i = 0, j = 0; i < n-1; i++) {
        if (i < j) {
            SWAP(x[i], x[j]); SWAP(y[i], y[j]);
        }
        int k = n >> 1;
        while (k <= j) {
            j -= k; k >>= 1;
        }
        j += k;
    }
    // Danileson-Lanczos section;
    double rotwr = -1, rotwi = 0;
    for (int loop = 1; loop < n; loop <<= 1) {	
        int block = loop << 1;				// block-size;2->4->8->...
        if (loop > 2) {
            // rotation factor of twiddle factor;
            double theta = PI / loop;
            rotwr = cos(theta);
            rotwi = -sin(theta);
        } else if (loop == 2) {
            rotwr = 0;
            rotwi = -1;
        }	
        // starting twiddle factor;
        double wr = 1;
        double wi = 0;
        for (int j = 0; j < loop; ++j) { 
            // 각 block의 같은 위치에서 동일한 twiddle factor가 곱해진다는 
            // 사실을 이용함;
            for (int i = j; i < n; i += block) {
                int ip = i + loop;
                // step-1; X[ip]*W
                double xwre = x[ip] * wr - y[ip] * wi;
                double xwim = x[ip] * wi + y[ip] * wr;
                // step-2; X[ip] = X[i] - X[ip]*W;
                x[ip] = x[i] - xwre;
                y[ip] = y[i] - xwim;
                // step-3; X[i] = X[i] + X[ip]*W;
                x[i] += xwre;
                y[i] += xwim;
            };
            // 각 block의 다음 차례에 곱해지는 twiddle factor 계산; T = W * rotW;
            double tre = wr * rotwr - wi * rotwi;
            double tim = wi * rotwr + wr * rotwi;
            wr = tre;
            wi = tim;
        }
    }
    return 1;
    // 역변환: (1/N)*conjugate[FFT[conjugate(X)]] 
}

저작자표시 비영리 변경금지

'Image Recognition > Fundamental' 카테고리의 다른 글

FFT를 이용한 inverse FFT (0)	2024.07.31
CLAHE (2) (1)	2024.06.26
Approximate Distance Transform (0)	2024.06.02
Graph-based Segmentation (1)	2024.05.26
Linear Least Square Fitting: perpendicular offsets (0)	2024.03.22

CLAHE (2)

Image Recognition/Fundamental 2024. 6. 26. 15:17

픽셀값의 분포가 좁은 구간에 한정된 영상은 낮은 명암 대비(contrast)를 가지게 된다. 이 경우 픽셀값을 가능한 넓은 구간에 재분포하도록 변환시키면 전체적으로 훨씬 높은 contrast를 가지는 영상을 얻을 수 있는데 이러한 기법을 histogram equalization이라고 한다. Histogram equalization은 픽셀 분포의 cdf(cumulative distribution function)가 가능한 직선이 되도록 변형을 시킨다. 그러나 어떤 경우에는 과도한 contrast로 인해서 영상의 질이 더 안 좋아지는 경우가 발생하거나 영상의 일부 영역에서는 오히려 contrast가 감소하는 경향이 발생할 수 있다. 이를 개선하기 위해는 cdf의 과도한 변형을 막고, equaliztion을 국소적으로 다르게 적용할 수 있는 adaptive 알고리즘을 고려해야 한다. 픽셀값의 분포를 연속적 변수로 생각하면 히스토그램 $h (x) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>h</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></math>$ 은 픽셀값의 pdf(probability density function)를 형성하고, 히스토그램의 최댓값과 최솟값을 각각 $m = min (h) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>m</mi><mo>=</mo><mtext>min</mtext><mo stretchy="false">(</mo><mi>h</mi><mo stretchy="false">)</mo></math>$ , $M = max (h) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>M</mi><mo>=</mo><mtext>max</mtext><mo stretchy="false">(</mo><mi>h</mi><mo stretchy="false">)</mo></math>$ 라면

$m≤1255∫2550h(x)dx≤M<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>m</mi><mo>≤</mo><mfrac><mn>1</mn><mn>255</mn></mfrac><msubsup><mo data-mjx-texclass="OP">∫</mo><mn>0</mn><mrow data-mjx-texclass="ORD"><mn>255</mn></mrow></msubsup><mi>h</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>d</mi><mi>x</mi><mo>≤</mo><mi>M</mi></math>$

Histogram equalization은 $m \approx M <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>m</mi><mo>\approx</mo><mi>M</mi></math>$ 이 되도록 $h (x) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>h</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></math>$ 을 변환을 시키는 과정에 해당한다. Histogram 함수는 cdf함수의 미분값이므로 cdf 곡선의 접선의 기울기에 해당하고, histogram equation은 cdf 함수의 기울기가 모든 픽셀값에서 일정하게 조정하는 과정이 된다. 과도한 contrast를 갖는 영상을 만들지 않기 위해서 일정한 cdf의 기울기를 요구하지 않고 일정범위를 넘어서는 경우만 제한을 두도록 하자. 제한값은 cdf의 평균변화율

$―s=1255∫2550h(x)dx<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mover><mi>s</mi><mo accent="true">―</mo></mover><mo>=</mo><mfrac><mn>1</mn><mn>255</mn></mfrac><msubsup><mo data-mjx-texclass="OP">∫</mo><mn>0</mn><mrow data-mjx-texclass="ORD"><mn>255</mn></mrow></msubsup><mi>h</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mi>d</mi><mi>x</mi></math>$

을 기준으로 선택하면 된다. 제한 기준으로 $s l o p e \times ― s <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>s</mi><mi>l</mi><mi>o</mi><mi>p</mi><mi>e</mi><mo>\times</mo><mover><mi>s</mi><mo accent="true">―</mo></mover></math>$ 를 선택하면 변형된 histogram은

$h modified (x) = {s l o p e \times ― s if h (x) \geq s l o p e \times ― s h (x) otherwise <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>h</mi><mtext>modified</mtext></msub><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">{</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mi>s</mi><mi>l</mi><mi>o</mi><mi>p</mi><mi>e</mi><mo>\times</mo><mover><mi>s</mi><mo accent="true">―</mo></mover></mtd><mtd><mtext>if</mtext><mtext> </mtext><mtext> </mtext><mi>h</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>\geq</mo><mi>s</mi><mi>l</mi><mi>o</mi><mi>p</mi><mi>e</mi><mo>\times</mo><mover><mi>s</mi><mo accent="true">―</mo></mover></mtd></mtr><mtr><mtd><mi>h</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></mtd><mtd><mtext>otherwise</mtext></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE" fence="true" stretchy="true" symmetric="true"></mo></mrow></math>$

확률을 보존하기 위해 기준을 넘는 픽셀 분포는 초과분을 histogram의 모든 bin에 고르게 재분배를 하는 과정이 필요하다.

$h modified (x) ⟹ redistribution of excess ˜ h modified (x) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>h</mi><mtext>modified</mtext></msub><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><munder><mo stretchy="false">⟹</mo><mtext>redistribution of excess</mtext></munder><msub><mrow data-mjx-texclass="ORD"><mover><mi>h</mi><mo stretchy="false">~</mo></mover></mrow><mtext>modified</mtext></msub><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></math>$

이 새로이 변환된 histogram에 대해 equalization을 적용하여 윈도 중심 픽셀의 값을 설정한다. 이 알고리즘을 contrast limited histogram equalization(CLAHE)라고 부른다. 보통 CLAHE 알고리즘에서는 각각의 픽셀에 대해 윈도를 선택하지 않고, 원본 영상을 겹치는 타일영역으로 분할한 후 각각의 타일에서 제한된 histogram equalization을 변환을 구한다. 그리고 겹치는 인접 타일에서 변환의 선형보간을 이용하면 타일과 타일 사이에서 계단현상을 막을 수 있다. 여기서는 각각의 픽셀에 대해서 슬라이딩 윈도를 설정하는 방법을 사용한다. 윈도 크기는 영상에서 관심 대상을 충분히 커버할 정도의 크기여야 한다. 또한 $s l o p e <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>s</mi><mi>l</mi><mi>o</mi><mi>p</mi><mi>e</mi></math>$ 은 보통 $3 \sim 4 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>3</mn><mo>\sim</mo><mn>4</mn></math>$ 정도를 선택한다. $s l o p e \to 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>s</mi><mi>l</mi><mi>o</mi><mi>p</mi><mi>e</mi><mo accent="false" stretchy="false">\to</mo><mn>1</mn></math>$ 인 경우는 원본 영상과 거의 변화가 없고( $h (x) \approx ― s <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>h</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>\approx</mo><mover><mi>s</mi><mo accent="true">―</mo></mover></math>$ ), $s l o p e ≫ 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>s</mi><mi>l</mi><mi>o</mi><mi>p</mi><mi>e</mi><mo>≫</mo><mn>1</mn></math>$ 이면 재분배되는 픽셀이 없으므로 마지막 단계에서 equalization이 global equalization과 동일하게 된다.

static const int halfwindow	= 63;
static const double slope	= 3;
void clahe2(BYTE **input, int width, int height, BYTE **output) {
    const int bins = 255;
    int hist[bins+1], clippedHist[bins+1];
    for (int y = 0; y < height; ++y) {
        int top = max(0, y - halfwindow);
        int bottom = min(height-1, y + halfwindow);
        int h = bottom - top + 1;

        /* 픽셀 access를 줄이기 위해 sliding 윈도우 기법을 사용함;*/
        memset(hist, 0, sizeof(hist));
        int right0 = min(width-1, halfwindow);	// x=0일 때 오른쪽;
        for (int yi = top; yi <= bottom; ++yi)
            for (int xi = 0; xi < right0; ++xi) // x=right는 다음 step에서 더해짐;
                ++hist[input[yi][xi]];

        for (int x = 0; x < width; ++x) {
            int left = max(0, x - halfwindow );
            int right = x + halfwindow;
            int w = min(width-1, right) - left + 1;
            int npixs = h * w;	// =number of pixels inside the sliding window;

            int limit = int(slope * npixs / bins + 0.5);  
            // slope >>1 -->hist_eq;
            /* 윈도우가 1픽셀 이동하면 왼쪽 열은 제거; */
            if (left > 0)
                for (int yi = top; yi <= bottom; ++yi)
                    --hist[input[yi][left-1]];						

            /* 윈도우가 움직이면 오른쪽 열을 추가함; */
            if (right < width)
                for (int yi = top; yi <= bottom; ++yi)
                    ++hist[input[yi][right]];						

            /* clip histogram and redistribute excess; */
            memcpy(clippedHist, hist, sizeof(hist));
            int excess = 0, excessBefore;
            do {
                excessBefore = excess;
                excess = 0;
                for (int i = 0; i <= bins; ++i) {
                    int over = clippedHist[i] - limit;
                    if (over > 0) {
                        excess += over;
                        clippedHist[i] = limit;
                    }
                }

                int avgExcess = excess / (bins + 1);
                int residual  = excess % (bins + 1);// 각 bin에 분배하고 남은 나머지;
                for (int i = 0; i <= bins; ++i)	// 먼저 전구간에 avgExcess를 분배;
                    clippedHist[i] += avgExcess;

                if (residual != 0) {
                    int step = bins / residual;	// 나머지는 일정한 간격마다 1씩 분배;
                    for (int i = 0; i <= bins; i += step)
                        ++clippedHist[i];
                }
            } while (excess != excessBefore);

            /* clipped histogram의 cdf 구성;*/
            int hMin = bins;
            for (int i = 0; i < hMin; ++i)
                if (clippedHist[i] != 0) hMin = i;
            int cdfMin = clippedHist[hMin];
            
            int v = input[y][x]; //현 위치의 픽셀 값;
            int cdf = 0;
            for (int i = hMin; i <= v; ++i) // cdf(현 픽셀);
                cdf += clippedHist[i];

            int cdfMax = cdf;
            for (int i = v + 1; i <= bins; ++i) // 총 픽셀;
                cdfMax += clippedHist[i];

            // 현픽셀 윈도의 clipped histogram을 구하고, 
            // 이 clipped histogram을 써서 equaliztion을 함;
            output[y][x] = int((cdf - cdfMin)/double(cdfMax - cdfMin) * 255);
        }
    }
}

https://kipl.tistory.com/291

Contrast Limited Adaptive Histogram Equalization (CLAHE)

Contrast Limited Adaptive Histogram Equalization (CLAHE). CLAHE는 영상의 평탄화 (histogram equalization) 과정에서 contrast가 과도하게 증폭이 되는 것을 방지하도록 설계된 adaptive algorithm의 일종이다. CLAHE algorithm은

kipl.tistory.com

저작자표시 비영리 변경금지

'Image Recognition > Fundamental' 카테고리의 다른 글

FFT를 이용한 inverse FFT (0)	2024.07.31
FFT 구현 (0)	2024.07.31
Approximate Distance Transform (0)	2024.06.02
Graph-based Segmentation (1)	2024.05.26
Linear Least Square Fitting: perpendicular offsets (0)	2024.03.22

이전 1 2 3 4 ··· 25 다음

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

Geometry & Recognition

FFT를 이용한 inverse FFT

'Image Recognition > Fundamental' 카테고리의 다른 글

FFT 구현

'Image Recognition > Fundamental' 카테고리의 다른 글

CLAHE (2)

'Image Recognition > Fundamental' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

글 보관함

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역