'Image Recognition/Fundamental' 카테고리의 글 목록 (3 Page)

Cubic Spline Kernel

Image Recognition/Fundamental 2024. 3. 12. 22:11

일정한 간격 $h <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>h</mi></math>$ 마다 샘플링된 데이터 ${(x k, f k)} <math xmlns="http://www.w3.org/1998/Math/MathML"><mo fence="false" stretchy="false">{</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mi>k</mi></msub><mo>,</mo><msub><mi>f</mi><mi>k</mi></msub><mo stretchy="false">)</mo><mo fence="false" stretchy="false">}</mo></math>$ 를 이용해서 이들 데이터를 표현하는 spline를 구해보자. spline은 주어진 샘플링 데이터을 통과할 필요는 없으므로 일반적으로 interpolation 함수는 아니다. 이 spline은 샘플링 데이터와 kernel이라고 불리는 함수의 convolution 형태로 표현할 수 있다.

$g(x)=∑kfkK(x−xkh)<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>g</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><munder><mo data-mjx-texclass="OP">∑</mo><mi>k</mi></munder><msub><mi>f</mi><mi>k</mi></msub><mi>K</mi><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mfrac><mrow><mi>x</mi><mo>−</mo><msub><mi>x</mi><mi>k</mi></msub></mrow><mi>h</mi></mfrac><mo data-mjx-texclass="CLOSE">)</mo></mrow></math>$

이미지의 resampling 과정에서 spline를 이용하는데 이때 사용 가능한 kernel의 형태와 그 효과를 간단히 알아보자.

3차 spline kernel은 중심을 기준으로 반지름이 2인 영역 $(- 2, 1), (- 1, 0), (0, 1), (1, 2) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mo>-</mo><mn>2</mn><mo>,</mo><mn>1</mn><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><mo>-</mo><mn>1</mn><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo stretchy="false">)</mo></math>$ 에서만 0이 아닌 piecewise 삼차함수다. 그리고 이 함수는 우함수의 특성을 갖는다. 따라서 가능한 형태는

처럼 쓸 수 있다. 계수를 완전히 결정하기 위해서는 8개의 조건이 필요한다. 우선 우함수이므로 원점에서 미분값이 제대로 정의되려면 $C 1 = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>C</mi><mn>1</mn></msub><mo>=</mo><mn>0</mn></math>$ 도 만족해야 한다, 그리고 각 node에서 연속성을 요구하면

$s = 1 \pm : A 1 + B 1 + D 1 = A 2 + B 2 + C 2 + D 2 s = 2 \pm : 8 A 2 + 4 B 2 + 2 C 2 + D 2 = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><mi>s</mi><mo>=</mo><msup><mn>1</mn><mo>\pm</mo></msup><mo>:</mo><mtext> </mtext><mtext> </mtext><mtext> </mtext></mtd><mtd><msub><mi>A</mi><mn>1</mn></msub><mo>+</mo><msub><mi>B</mi><mn>1</mn></msub><mo>+</mo><msub><mi>D</mi><mn>1</mn></msub><mo>=</mo><msub><mi>A</mi><mn>2</mn></msub><mo>+</mo><msub><mi>B</mi><mn>2</mn></msub><mo>+</mo><msub><mi>C</mi><mn>2</mn></msub><mo>+</mo><msub><mi>D</mi><mn>2</mn></msub></mtd></mtr><mtr><mtd><mi>s</mi><mo>=</mo><msup><mn>2</mn><mo>\pm</mo></msup><mo>:</mo><mtext> </mtext><mtext> </mtext><mtext> </mtext></mtd><mtd><mn>8</mn><msub><mi>A</mi><mn>2</mn></msub><mo>+</mo><mn>4</mn><msub><mi>B</mi><mn>2</mn></msub><mo>+</mo><mn>2</mn><msub><mi>C</mi><mn>2</mn></msub><mo>+</mo><msub><mi>D</mi><mn>2</mn></msub><mo>=</mo><mn>0</mn></mtd></mtr></mtable></math>$ 임을 알 수 있다. 또한 각 node에서 부드럽게 연결되기 위해서 1차 도함수가 연속적임을 요구하면

$s = 1 \pm : 3 A 1 + 2 B 1 = 3 A 2 + 2 B 2 + C 2 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><mi>s</mi><mo>=</mo><msup><mn>1</mn><mo>\pm</mo></msup><mo>:</mo><mtext> </mtext><mtext> </mtext><mtext> </mtext></mtd><mtd><mn>3</mn><msub><mi>A</mi><mn>1</mn></msub><mo>+</mo><mn>2</mn><msub><mi>B</mi><mn>1</mn></msub><mo>=</mo><mn>3</mn><msub><mi>A</mi><mn>2</mn></msub><mo>+</mo><mn>2</mn><msub><mi>B</mi><mn>2</mn></msub><mo>+</mo><msub><mi>C</mi><mn>2</mn></msub></mtd></mtr></mtable></math>$

그리고 샘플링된 데이터가 모두 같은 경우 보간함수도 상수함수가 되는 것이 타당하므로

$g(x)=∑kK(x−xkh)=1 if ∀fj=1<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>g</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><munder><mo data-mjx-texclass="OP">∑</mo><mi>k</mi></munder><mi>K</mi><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mfrac><mrow><mi>x</mi><mo>−</mo><msub><mi>x</mi><mi>k</mi></msub></mrow><mi>h</mi></mfrac><mo data-mjx-texclass="CLOSE">)</mo></mrow><mo>=</mo><mn>1</mn><mtext> </mtext><mtext> </mtext><mtext> </mtext><mtext>if</mtext><mtext> </mtext><mtext> </mtext><mi mathvariant="normal">∀</mi><msub><mi>f</mi><mi>j</mi></msub><mo>=</mo><mn>1</mn></math>$

을 만족시켜야 한다. $x j < x < x j + 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>x</mi><mi>j</mi></msub><mo><</mo><mi>x</mi><mo><</mo><msub><mi>x</mi><mrow data-mjx-texclass="ORD"><mi>j</mi><mo>+</mo><mn>1</mn></mrow></msub></math>$ 일 때 $x = x j + s h, (0 < s < 1) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi><mo>=</mo><msub><mi>x</mi><mi>j</mi></msub><mo>+</mo><mi>s</mi><mi>h</mi><mo>,</mo><mtext> </mtext><mo stretchy="false">(</mo><mn>0</mn><mo><</mo><mi>s</mi><mo><</mo><mn>1</mn><mo stretchy="false">)</mo></math>$ 로 쓸 수 있고, kernel이 반지름이 2인 support를 가지므로

$g (x) = K (s + 1) + K (s) + K (s - 1) + K (s - 2) = 1 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>g</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><mi>K</mi><mo stretchy="false">(</mo><mi>s</mi><mo>+</mo><mn>1</mn><mo stretchy="false">)</mo><mo>+</mo><mi>K</mi><mo stretchy="false">(</mo><mi>s</mi><mo stretchy="false">)</mo><mo>+</mo><mi>K</mi><mo stretchy="false">(</mo><mi>s</mi><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo><mo>+</mo><mi>K</mi><mo stretchy="false">(</mo><mi>s</mi><mo>-</mo><mn>2</mn><mo stretchy="false">)</mo><mo>=</mo><mn>1</mn></math>$

임을 알 수 있다. 위에서 주어진 $K (s) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>K</mi><mo stretchy="false">(</mo><mi>s</mi><mo stretchy="false">)</mo></math>$ 을 대입해서 정리하면 다음과 같은 항등식을 얻는다.

$- 1 + A 1 + 9 A 2 + B 1 + 5 B 2 + 3 C 2 + 2 D 1 + 2 D 2 + (- 3 A 1 - 9 A 2 - 2 B 1 - 2 B 2) s + (3 A 1 + 9 A 2 + 2 B 1 + 2 B 2) s 2 = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo>-</mo><mn>1</mn><mo>+</mo><msub><mi>A</mi><mn>1</mn></msub><mo>+</mo><mn>9</mn><msub><mi>A</mi><mn>2</mn></msub><mo>+</mo><msub><mi>B</mi><mn>1</mn></msub><mo>+</mo><mn>5</mn><msub><mi>B</mi><mn>2</mn></msub><mo>+</mo><mn>3</mn><msub><mi>C</mi><mn>2</mn></msub><mo>+</mo><mn>2</mn><msub><mi>D</mi><mn>1</mn></msub><mo>+</mo><mn>2</mn><msub><mi>D</mi><mn>2</mn></msub><mspace linebreak="newline"></mspace><mo>+</mo><mo stretchy="false">(</mo><mo>-</mo><mn>3</mn><msub><mi>A</mi><mn>1</mn></msub><mo>-</mo><mn>9</mn><msub><mi>A</mi><mn>2</mn></msub><mo>-</mo><mn>2</mn><msub><mi>B</mi><mn>1</mn></msub><mo>-</mo><mn>2</mn><msub><mi>B</mi><mn>2</mn></msub><mo stretchy="false">)</mo><mi>s</mi><mo>+</mo><mo stretchy="false">(</mo><mn>3</mn><msub><mi>A</mi><mn>1</mn></msub><mo>+</mo><mn>9</mn><msub><mi>A</mi><mn>2</mn></msub><mo>+</mo><mn>2</mn><msub><mi>B</mi><mn>1</mn></msub><mo>+</mo><mn>2</mn><msub><mi>B</mi><mn>2</mn></msub><mo stretchy="false">)</mo><msup><mi>s</mi><mn>2</mn></msup><mo>=</mo><mn>0</mn></math>$

이 항등식의 계수가 0이 되어야 한다는 사실에서 2 개의 추가 조건을 얻으므로 총 8개 계수 중 2개가 미결정 free parameter로 남는다. 보통 이 두 계수는 $D 1 = 1 - B / 3 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>D</mi><mn>1</mn></msub><mo>=</mo><mn>1</mn><mo>-</mo><mi>B</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>3</mn></math>$ , $D 2 = 4 C + 4 B / 3 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>D</mi><mn>2</mn></msub><mo>=</mo><mn>4</mn><mi>C</mi><mo>+</mo><mn>4</mn><mi>B</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>3</mn></math>$ 처럼 매개화한다. 이 경우 kernel 함수는

$K(s)=16{(12−9B−6C)|s|3+(−18+12B+6C)|s|2+(6−2B)|s|<1(−B−6C)|s|3+(6B+30C)|s|2+(−12B−48C)|s|+(8B+24C)1≤|s|<20otherwise<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>K</mi><mo stretchy="false">(</mo><mi>s</mi><mo stretchy="false">)</mo><mo>=</mo><mfrac><mn>1</mn><mn>6</mn></mfrac><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">{</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mo stretchy="false">(</mo><mn>12</mn><mo>−</mo><mn>9</mn><mi>B</mi><mo>−</mo><mn>6</mn><mi>C</mi><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mi>s</mi><msup><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mn>3</mn></msup><mo>+</mo><mo stretchy="false">(</mo><mo>−</mo><mn>18</mn><mo>+</mo><mn>12</mn><mi>B</mi><mo>+</mo><mn>6</mn><mi>C</mi><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mi>s</mi><msup><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mn>2</mn></msup><mo>+</mo><mo stretchy="false">(</mo><mn>6</mn><mo>−</mo><mn>2</mn><mi>B</mi><mo stretchy="false">)</mo></mtd><mtd><mo stretchy="false">|</mo><mi>s</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mo><</mo><mn>1</mn></mtd></mtr><mtr><mtd><mo stretchy="false">(</mo><mo>−</mo><mi>B</mi><mo>−</mo><mn>6</mn><mi>C</mi><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mi>s</mi><msup><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mn>3</mn></msup><mo>+</mo><mo stretchy="false">(</mo><mn>6</mn><mi>B</mi><mo>+</mo><mn>30</mn><mi>C</mi><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mi>s</mi><msup><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mn>2</mn></msup><mo>+</mo><mo stretchy="false">(</mo><mo>−</mo><mn>12</mn><mi>B</mi><mo>−</mo><mn>48</mn><mi>C</mi><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mi>s</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mo>+</mo><mo stretchy="false">(</mo><mn>8</mn><mi>B</mi><mo>+</mo><mn>24</mn><mi>C</mi><mo stretchy="false">)</mo></mtd><mtd><mn>1</mn><mo>≤</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mi>s</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mo><</mo><mn>2</mn></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mtext>otherwise</mtext></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE" fence="true" stretchy="true" symmetric="true"></mo></mrow></math>$

따라서 cubic spline kernel은 두 개의 파라미터 (B,C)에 의해서 정해진다. 또한 kernel 함수의 적분은 $B, C <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>B</mi><mo>,</mo><mi>C</mi></math>$ 에 상관없이 항상 1이어서 총 가중치의 합이 1임이 자동으로 보증된다.

$\int \infty - \infty K (s) d s = 1 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msubsup><mo data-mjx-texclass="OP">\int</mo><mrow data-mjx-texclass="ORD"><mo>-</mo><mi mathvariant="normal">\infty</mi></mrow><mi mathvariant="normal">\infty</mi></msubsup><mi>K</mi><mo stretchy="false">(</mo><mi>s</mi><mo stretchy="false">)</mo><mi>d</mi><mi>s</mi><mo>=</mo><mn>1</mn></math>$

이 중에는 이미지의 resampling에서 많이 사용되는 커널도 있는데, 잘 알려진 경우를 보면

$(B, C) = (0, 1) Cardinal spline (B, C) = (0, 1 / 2) Catmull-Rom spline (B, C) = (0, 3 / 4) used in photoshop (B, C) = (1 / 3, 1 / 3) Mitchell-Netravali spline (B, C) = (1, 0) B-spline <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mo stretchy="false">(</mo><mi>B</mi><mo>,</mo><mi>C</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>1</mn><mo stretchy="false">)</mo></mtd><mtd><mtext>Cardinal spline</mtext></mtd></mtr><mtr><mtd><mo stretchy="false">(</mo><mi>B</mi><mo>,</mo><mi>C</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>1</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn><mo stretchy="false">)</mo></mtd><mtd><mtext>Catmull-Rom spline </mtext></mtd></mtr><mtr><mtd><mo stretchy="false">(</mo><mi>B</mi><mo>,</mo><mi>C</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>3</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>4</mn><mo stretchy="false">)</mo></mtd><mtd><mtext>used in photoshop</mtext></mtd></mtr><mtr><mtd><mo stretchy="false">(</mo><mi>B</mi><mo>,</mo><mi>C</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mn>1</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>3</mn><mo>,</mo><mn>1</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>3</mn><mo stretchy="false">)</mo></mtd><mtd><mtext> Mitchell-Netravali spline</mtext></mtd></mtr><mtr><mtd><mo stretchy="false">(</mo><mi>B</mi><mo>,</mo><mi>C</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mn>1</mn><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo></mtd><mtd><mtext>B-spline</mtext></mtd></mtr></mtable></math>$

$B = 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>B</mi><mo>=</mo><mn>0</mn></math>$ 인 경우는 $s = 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>s</mi><mo>=</mo><mn>0</mn></math>$ 일 때 1이고, $| s | = 1, 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">|</mo><mi>s</mi><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mo>=</mo><mn>1</mn><mo>,</mo><mn>2</mn></math>$ 일 0이므로 interpolation kernel( $K (i - j) = δ i j <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>K</mi><mo stretchy="false">(</mo><mi>i</mi><mo>-</mo><mi>j</mi><mo stretchy="false">)</mo><mo>=</mo><msub><mi>δ</mi><mrow data-mjx-texclass="ORD"><mi>i</mi><mi>j</mi></mrow></msub></math>$ )에 해당한다. 그리고 $B = 0, C = 1 / 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>B</mi><mo>=</mo><mn>0</mn><mo>,</mo><mi>C</mi><mo>=</mo><mn>1</mn><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>2</mn></math>$ 인 경우인 Catmul-Rom spline은 node에서 2차 도함수까지도 연속이므로 샘플링 데이터를 생성한 원 아날로그 함수에 $O (h 3) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>O</mi><mo stretchy="false">(</mo><msup><mi>h</mi><mn>3</mn></msup><mo stretchy="false">)</mo></math>$ 이내에서 가장 유사하게 근사함을 보일 수도 있다.

// Mitchell Netravali Reconstruction Filter
// B = 0    C = 0   - Hermite B-Spline interpolator 
// B = 0,   C = 1/2 - Catmull-Rom spline
// B = 1/3, C = 1/3 - Mitchell Netravali spline
// B = 1,   C = 0   - cubic B-spline
double MitchellNetravali(double x, double B, double C) {
    x = fabs(x);
    if (x >= 2) return 0;
    double xx = x*x;
    if (x >= 1) return ((-B - 6*C)*xx*x 
                + (6*B + 30*C)*xx + (-12*B - 48*C)*x 
                + (8*B + 24*C))/6;
    if (x < 1) return ((12 - 9*B - 6*C)*xx*x +
        (-18 + 12*B + 6*C) * xx + (6 - 2*B))/6;
}

저작자표시 비영리 변경금지

'Image Recognition > Fundamental' 카테고리의 다른 글

Graph-based Segmentation (1)	2024.05.26
Linear Least Square Fitting: perpendicular offsets (0)	2024.03.22
Ellipse Fitting (0)	2024.03.02
Bilateral Filter (0)	2024.02.18
파라미터 공간에서 본 최소자승 Fitting (0)	2023.05.21

Geometry & Recognition 알고리즘,계산기하,물리학,...

Ellipse Fitting

Image Recognition/Fundamental 2024. 3. 2. 17:09

일반적인 conic section 피팅은 주어진 데이터 ${(x i, y i)} <math xmlns="http://www.w3.org/1998/Math/MathML"><mo fence="false" stretchy="false">{</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mi>i</mi></msub><mo>,</mo><msub><mi>y</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo fence="false" stretchy="false">}</mo></math>$ 를 가장 잘 기술하는 이차식

$F (x, y) = a x 2 + b x y + c y 2 + d x + e y + f = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>F</mi><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo><mo>=</mo><mi>a</mi><msup><mi>x</mi><mn>2</mn></msup><mo>+</mo><mi>b</mi><mi>x</mi><mi>y</mi><mo>+</mo><mi>c</mi><msup><mi>y</mi><mn>2</mn></msup><mo>+</mo><mi>d</mi><mi>x</mi><mo>+</mo><mi>e</mi><mi>y</mi><mo>+</mo><mi>f</mi><mo>=</mo><mn>0</mn></math>$

의 계수 $u T = (a, b, c, d, e, f) <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><msup><mi mathvariant="bold">u</mi><mi mathvariant="bold">T</mi></msup></mrow><mo>=</mo><mo stretchy="false">(</mo><mi>a</mi><mo>,</mo><mi>b</mi><mo>,</mo><mi>c</mi><mo>,</mo><mi>d</mi><mo>,</mo><mi>e</mi><mo>,</mo><mi>f</mi><mo stretchy="false">)</mo></math>$ 을 찾는 문제이다. 이 conic section이 타원이기 위해서는 2차항의 계수 사이에 다음과 같은 조건을 만족해야 한다.

$ellipse constraint: a c - b 2 / 4 > 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtext>ellipse constraint:</mtext><mtext> </mtext><mtext> </mtext><mi>a</mi><mi>c</mi><mo>-</mo><msup><mi>b</mi><mn>2</mn></msup><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mn>4</mn><mo>></mo><mn>0</mn></math>$

그리고 얼마나 잘 피팅되었난가에 척도가 필요한데 여기서는 주어진 데이터의 대수적 거리 $F (x, y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>F</mi><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo></math>$ 을 이용하자. 주어진 점이 타원 위의 점이면 이 값은 정확히 0이 된다. 물론 주어진 점에서 타원까지의 거리를 사용할 수도 있으나 이는 훨씬 복잡한 문제가 된다. 따라서 해결해야 하는 문제는

을 최소화시키는 계수 벡터 $u <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">u</mi></math>$ 를 찾는 것이다. 여기서 제한조건으로 $4 a c - b 2 = 1 = u T C u <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>4</mn><mi>a</mi><mi>c</mi><mo>-</mo><msup><mi>b</mi><mn>2</mn></msup><mo>=</mo><mn>1</mn><mo>=</mo><msup><mi mathvariant="bold">u</mi><mi mathvariant="bold">T</mi></msup><mi mathvariant="bold">C</mi><mi mathvariant="bold">u</mi></math>$ 로 설정했다.

$u T <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi mathvariant="bold">u</mi><mi mathvariant="bold">T</mi></msup></math>$ 에 대해서 미분을 하면

$∂L∂uT=Su−λCu=0<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mrow><mi>∂</mi><mi>L</mi></mrow><mrow><mi>∂</mi><msup><mi mathvariant="bold">u</mi><mi mathvariant="bold">T</mi></msup></mrow></mfrac><mo>=</mo><mi mathvariant="bold">S</mi><mi mathvariant="bold">u</mi><mo mathvariant="bold">−</mo><mi>λ</mi><mi mathvariant="bold">C</mi><mi mathvariant="bold">u</mi><mo mathvariant="bold">=</mo><mn mathvariant="bold">0</mn></math>$

즉, 주어진 제한조건 $4 a c - b 2 = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>4</mn><mi>a</mi><mi>c</mi><mo>-</mo><msup><mi>b</mi><mn>2</mn></msup><mo>=</mo><mn>1</mn></math>$ 하에서 대수적 거리를 최소화시키는 타원방정식의 계수 $u <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">u</mi></math>$ 를 구하는 문제는 scattering matrix $S = D T D <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">S</mi><mo mathvariant="bold">=</mo><msup><mi mathvariant="bold">D</mi><mi mathvariant="bold">T</mi></msup><mi mathvariant="bold">D</mi></math>$ 에 대한 일반화된 고유값 문제로 환원이 된다.

$S u = λ C u u T C u = 1 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi mathvariant="bold">S</mi><mi mathvariant="bold">u</mi><mo mathvariant="bold">=</mo><mi>λ</mi><mi mathvariant="bold">C</mi><mi mathvariant="bold">u</mi><mspace linebreak="newline"></mspace><msup><mi mathvariant="bold">u</mi><mi mathvariant="bold">T</mi></msup><mi mathvariant="bold">C</mi><mi mathvariant="bold">u</mi><mo mathvariant="bold">=</mo><mn mathvariant="bold">1</mn></math>$

이 문제의 풀이는 직전의 포스팅에서 다른 바 있는데 $S <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">S</mi></math>$ 의 제곱근 행렬 $Q = S 1 / 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">Q</mi><mo mathvariant="bold">=</mo><msup><mi mathvariant="bold">S</mi><mrow data-mjx-texclass="ORD"><mn mathvariant="bold">1</mn><mrow data-mjx-texclass="ORD"><mo mathvariant="bold">/</mo></mrow><mn mathvariant="bold">2</mn></mrow></msup></math>$ 를 이용하면 된다. 주어진 고유값 $λ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>λ</mi></math>$ 와 고유벡터 $u <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">u</mi></math>$ 가 구해지면 대수적 거리는 $u T S u = λ <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi mathvariant="bold">u</mi><mi mathvariant="bold">T</mi></msup><mi mathvariant="bold">S</mi><mi mathvariant="bold">u</mi><mo mathvariant="bold">=</mo><mi>λ</mi></math>$

이므로 이를 최소화시키기 위해서는 양의 값을 갖는 고유값 중에 최소에 해당하는 고유벡터를 고르면 된다. 그런데 고유값 $λ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>λ</mi></math>$ 의 부호별 개수는 $C <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">C</mi></math>$ 의 고유값 부호별 개수와 동일함을 보일 수 있는데 (Sylverster's law of inertia), $C <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">C</mi></math>$ 의 고유값이 ${- 2, - 1, 2, 0, 0, 0} <math xmlns="http://www.w3.org/1998/Math/MathML"><mo fence="false" stretchy="false">{</mo><mo>-</mo><mn>2</mn><mo>,</mo><mo>-</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>,</mo><mn>0</mn><mo>,</mo><mn>0</mn><mo>,</mo><mn>0</mn><mo fence="false" stretchy="false">}</mo></math>$ 이므로 $λ > 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>λ</mi><mo>></mo><mn>0</mn></math>$ 인 고유값은 1개 뿐임을 알 수 있다. 따라서 $S u = λ C u <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">S</mi><mi mathvariant="bold">u</mi><mo mathvariant="bold">=</mo><mi>λ</mi><mi mathvariant="bold">C</mi><mi mathvariant="bold">u</mi></math>$ 를 풀어서 얻은 유일한 양의 고유값에 해당하는 고유벡터가 원하는 답이 된다.

https://kipl.tistory.com/370

Least Squares Fitting of Ellipses

일반적인 이차곡선은 다음의 이차식으로 표현이 된다: $F (x, y) = a x 2 + b x y + c y 2 + d x + e y + f = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>F</mi><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo><mo>=</mo><mi>a</mi><msup><mi>x</mi><mn>2</mn></msup><mo>+</mo><mi>b</mi><mi>x</mi><mi>y</mi><mo>+</mo><mi>c</mi><msup><mi>y</mi><mn>2</mn></msup><mo>+</mo><mi>d</mi><mi>x</mi><mo>+</mo><mi>e</mi><mi>y</mi><mo>+</mo><mi>f</mi><mo>=</mo><mn>0</mn></math>$ 6개의 계수는 모두 독립적이지 않고 어떤 종류의 이차곡선인가에 따라 제약조건이 들어온다. 주어진

kipl.tistory.com

https://kipl.tistory.com/565

Generalized eigenvalues problem

$S <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">S</mi></math>$ 가 positive definite 행렬이고, $C <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">C</mi></math>$ 는 대칭행렬일 때 아래의 일반화된 eigenvalue 문제를 푸는 방법을 알아보자. $S u = λ C u <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi mathvariant="bold">S</mi><mi mathvariant="bold">u</mi><mo mathvariant="bold">=</mo><mi>λ</mi><mi mathvariant="bold">C</mi><mi mathvariant="bold">u</mi></math>$ 타원을 피팅하는 문제에서 이런 형식의 고유값 문제에 부딛

kipl.tistory.com

Ref: https://www.microsoft.com/en-us/research/wp-content/uploads/2016/02/ellipse-pami.pdf

double FitEllipse(std::vector<CPoint>& points, double einfo[6] ) {     
    if ( points.size() < 6 ) return -1;
    double eigvals[6];
    std::vector<double> D(6 * points.size());
    double S[36];/*  S = ~D * D  */
    double C[36];
    double EIGV[36];/* R^T; transposed orthogonal matrix;*/

    double offx = 0, offy = 0;
    /* shift all points to zero */
    for(int i = points.size(); i--> 0; ) {	
        offx += points[i].x;
        offy += points[i].y;        	
    }
    offx /= points.size(); 
    offy /= points.size();

    /* for the sake of numerical stability, scale down to [-1:1];*/
    double smax = points[0].x, smin = points[0].y;
    for (int i = points.size(); i-->1; ) {
        smax = max(smax, max(points[i].x, points[i].y));
        smin = min(smin, min(points[i].x, points[i].y));
    }
    double scale = smax - smin; 
    double invscale = 1 / scale;
    /* ax^2 + bxy + cy^2 + dx + ey + f = 0*/
    /* fill D matrix rows as (x*x, x*y, y*y, x, y, 1 ) */
    for(int i = points.size(); i--> 0; ) {	
        double x = points[i].x - offx; x *= invscale; 
        double y = points[i].y - offy; y *= invscale;
        D[i*6 + 0] = x*x; D[i*6 + 1] = x*y;
        D[i*6 + 2] = y*y; D[i*6 + 3] = x;
        D[i*6 + 4] = y;   D[i*6 + 5] = 1;		
    }			

    /* scattering matrix: S = ~D * D (6x6)*/
    for (int i = 0; i < 6; i++) 
        for (int j = i; j < 6; j++) { /*upper triangle;*/
            double s = 0;
            for (int k = points.size(); k-- > 0; ) 
                s += D[k*6 + i] * D[k*6 + j];
            S[i*6 + j] = s;
        }
    for (int i = 1; i < 6; i++) /*lower triangle;*/
        for (int j = 0; j < i; j++) 	
            S[i*6 + j] = S[j*6 + i] ;
    
    /* fill constraint matrix C */
    for (int i = 0; i < 36 ; i++ ) C[i] = 0;
    C[12] =  2 ;//2x0 
    C[2 ] =  2 ;//0x2 
    C[7 ] = -1 ;//1x1

    /* find eigenvalues/vectors of scattering matrix; */
    double RT[36];	/* each row contains eigenvector; */
    JacobiEigens ( S, RT, eigvals, 6, 0 );
    /* create R and INVQ;*/
    double R[36];
    for (int i = 0; i < 6 ; i++) {
        eigvals[i] = sqrt(eigvals[i]);
        for ( int k = 0; k < 6; k++ ) {
            R[k*6 + i] = RT[i*6 + k];  /* R = orthogonal mat = transpose(RT);*/
            RT[i*6 + k] /= eigvals[i]; /* RT /= sqrt(eigenvalue) row-wise)*/
        }
    }
    /* create INVQ=R*(1/sqrt(eigenval))*RT;*/
    double INVQ[36];
    _MatrixMul(R, RT, 6, INVQ);

    /* create matrix INVQ*C*INVQ */
    double TMP1[36], TMP2[36];
    _MatrixMul(INVQ, C, 6, TMP1 );
    _MatrixMul(TMP1, INVQ, 6, TMP2 );
    
    /* find eigenvalues and vectors of INVQ*C*INVQ:*/
    JacobiEigens ( TMP2, EIGV, eigvals, 6, 0 );
    /* eigvals stores eigenvalues in descending order of abs(eigvals);*/
    /* search for a unique positive eigenvalue;*/
    int index = -1, count = 0;
    for (int i = 0 ; i < 3; i++ ) {
        if (eigvals[i] > 0) {
            index = i; // break;
            count++;
        }
    }
    /* only 3 eigenvalues must be non-zero 
    ** and only one of them must be positive;*/
    if ((count != 1) || (index == -1)) 
        return -1;
     
    /* eigenvector what we want: u = INVQ * v */
    double u[6]; 
    double *vec = &EIGV[index*6];
    for (int i = 0; i < 6 ; i++) {
        double s = 0;
        for (int k = 0; k < 6; k++) s += INVQ[i*6 + k] * vec[k];
        u[i] = s;
    }
    /* extract shape infos;*/
    PoseEllipse(u, einfo);
    /* recover original scale; center(0,1) and radii(2,3)*/
    for (int i = 0; i < 4; i++) einfo[i] *= scale;
    /* recover center */
    einfo[0] += offx; 
    einfo[1] += offy;
    return FitError(points, offx, offy, scale, u);
};

저작자표시 비영리 변경금지

'Image Recognition > Fundamental' 카테고리의 다른 글

Linear Least Square Fitting: perpendicular offsets (0)	2024.03.22
Cubic Spline Kernel (1)	2024.03.12
Bilateral Filter (0)	2024.02.18
파라미터 공간에서 본 최소자승 Fitting (0)	2023.05.21
영상에 Impulse Noise 넣기 (2)	2023.02.09

Geometry & Recognition 알고리즘,계산기하,물리학,...

Bilateral Filter

Image Recognition/Fundamental 2024. 2. 18. 14:16

$BF[I]p=1Wp∑q∈SGσs(||p−q||)Gσr(|Ip−Iq|)IqWp=∑q∈SGσs(||p−q||)Gσr(|Ip−Iq|)Gσ(r)=e−||r||2/2σ2<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnspacing="1em" rowspacing="3pt"><mtr><mtd><mi>B</mi><mi>F</mi><mo stretchy="false">[</mo><mi>I</mi><msub><mo stretchy="false">]</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">p</mi></mrow></msub><mo>=</mo><mfrac><mn>1</mn><msub><mi>W</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">p</mi></mrow></msub></mfrac><munder><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">q</mi></mrow><mo>∈</mo><mi>S</mi></mrow></munder><msub><mi>G</mi><mrow data-mjx-texclass="ORD"><msub><mi>σ</mi><mi>s</mi></msub></mrow></msub><mo stretchy="false">(</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">p</mi></mrow><mo>−</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">q</mi></mrow><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mo stretchy="false">)</mo><msub><mi>G</mi><mrow data-mjx-texclass="ORD"><msub><mi>σ</mi><mi>r</mi></msub></mrow></msub><mo stretchy="false">(</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>I</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">p</mi></mrow></msub><mo>−</mo><msub><mi>I</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">q</mi></mrow></msub><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mo stretchy="false">)</mo><msub><mi>I</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">q</mi></mrow></msub></mtd></mtr><mtr><mtd><msub><mi>W</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">p</mi></mrow></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="ORD"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">q</mi></mrow><mo>∈</mo><mi>S</mi></mrow></munder><msub><mi>G</mi><mrow data-mjx-texclass="ORD"><msub><mi>σ</mi><mi>s</mi></msub></mrow></msub><mo stretchy="false">(</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">p</mi></mrow><mo>−</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">q</mi></mrow><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mo stretchy="false">)</mo><msub><mi>G</mi><mrow data-mjx-texclass="ORD"><msub><mi>σ</mi><mi>r</mi></msub></mrow></msub><mo stretchy="false">(</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>I</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">p</mi></mrow></msub><mo>−</mo><msub><mi>I</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">q</mi></mrow></msub><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mo stretchy="false">)</mo></mtd></mtr><mtr><mtd><msub><mi>G</mi><mi>σ</mi></msub><mo stretchy="false">(</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">r</mi></mrow><mo stretchy="false">)</mo><mo>=</mo><msup><mi>e</mi><mrow data-mjx-texclass="ORD"><mo>−</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">r</mi></mrow><mrow data-mjx-texclass="ORD"><mo mathvariant="bold" stretchy="false">|</mo></mrow><msup><mrow data-mjx-texclass="ORD"><mo mathvariant="bold" stretchy="false">|</mo></mrow><mn mathvariant="bold">2</mn></msup><mrow data-mjx-texclass="ORD"><mo mathvariant="bold">/</mo></mrow><mn mathvariant="bold">2</mn><msup><mi>σ</mi><mn mathvariant="bold">2</mn></msup></mrow></msup></mtd></mtr></mtable></math>$

smoothing based on the nonlinear heat eq

// sigmar controls the intensity range that is smoothed out. 
// Higher values will lead to larger regions being smoothed out. 
// The sigmar value should be selected with the dynamic range of the image pixel values in mind.
// sigmas controls smoothing factor. Higher values will lead to more smoothing.
// convolution through using lookup tables.
int BilateralFilter(BYTE *image, int width, int height, 
    double sigmas, double sigmar, int ksize, BYTE* out) {
    //const double sigmas = 1.7;
    //const double sigmar = 50.;
    double sigmas_sq = sigmas * sigmas;
    double sigmar_sq = sigmar * sigmar;
    //const int ksize = 7;
    const int hksz = ksize / 2;
    ksize = hksz * 2 + 1;
    std::vector<double> smooth(width * height, 0);
    // LUT for spatial gaussian;
    std::vector<double> spaceKer(ksize * ksize, 0);
    for (int j = -hksz, pos = 0; j <= hksz; j++) 
        for (int i = -hksz; i <= hksz; i++) 
            spaceKer[pos++] = exp(- 0.5 * double(i * i + j * j)/ sigmas_sq); 
    // LUT for image similarity gaussian;
    double pixelKer[256];
    for (int i = 0; i < 256; i++)
        pixelKer[i] = exp(- 0.5 * double(i * i) / sigmar_sq);

    for (int y = 0, imgpos = 0; y < height; y++) {
        int top = y - hksz;
        int bot = y + hksz;
        for (int x = 0; x < width; x++) {
            int left = x - hksz;
            int right = x + hksz;
            // convolution;
            double wsum = 0;
            double fsum = 0; 	
            int refVal = image[imgpos];
            for (int yy = top, kpos = 0; yy <= bot; yy++) {
                for (int xx = left; xx <= right; xx++) {
                    // check whether the kernel rect is inside the image;
                    if ((yy >= 0) && (yy < height) && (xx >= 0) && (xx < width)) {
                        int curVal = image[yy * width + xx];
                        int idiff = curVal - refVal;
                        double weight = spaceKer[kpos] * pixelKer[abs(idiff)];
                        wsum += weight;
                        fsum += weight * curVal;
                    }
                    kpos++;
                }
            }
            smooth[imgpos++] = fsum / wsum;
        }
    }

    for (int k = smooth.size(); k-- > 0;) {
        int a = int(smooth[k]);
        out[k] = a < 0 ? 0: a > 255 ? 255: a;
    }
    return 1;
}

저작자표시 비영리 변경금지

'Image Recognition > Fundamental' 카테고리의 다른 글

Cubic Spline Kernel (1)	2024.03.12
Ellipse Fitting (0)	2024.03.02
파라미터 공간에서 본 최소자승 Fitting (0)	2023.05.21
영상에 Impulse Noise 넣기 (2)	2023.02.09
Canny Edge: Non-maximal suppression (0)	2023.01.11

Geometry & Recognition 알고리즘,계산기하,물리학,...

이전 1 2 3 4 5 6 ··· 25 다음

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

Geometry & Recognition

Cubic Spline Kernel

'Image Recognition > Fundamental' 카테고리의 다른 글

Ellipse Fitting

'Image Recognition > Fundamental' 카테고리의 다른 글

Bilateral Filter

'Image Recognition > Fundamental' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

글 보관함

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역