'Least Square Method' 태그의 글 목록

Least Squares Fitting of Circles

Image Recognition/Fundamental 2020. 11. 11. 10:55

점집합을 일반적인 2차 곡선으로 피팅하는 경우에 방정식은

$a x 2 + b y 2 + c x y + d x + e y + f = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>a</mi><msup><mi>x</mi><mn>2</mn></msup><mo>+</mo><mi>b</mi><msup><mi>y</mi><mn>2</mn></msup><mo>+</mo><mi>c</mi><mi>x</mi><mi>y</mi><mo>+</mo><mi>d</mi><mi>x</mi><mo>+</mo><mi>e</mi><mi>y</mi><mo>+</mo><mi>f</mi><mo>=</mo><mn>0</mn></math>$

의 계수를 주어진 데이터를 이용하여서 구해야 한다. 실제 문제에서는 타원, 포물선 쌍곡 선등의 타입에 따라 몇 가지 제약 조건을 넣어 피팅을 한다. 원은 타원의 특별한 경우로 일반적으로 $a = b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>=</mo><mi>b</mi></math>$ , $c = 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>c</mi><mo>=</mo><mn>0</mn></math>$ 의 제약 조건이 필요하다. 그러나 보다 엄밀하게 제약을 하게 되면 $a = b = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>=</mo><mi>b</mi><mo>=</mo><mn>1</mn></math>$ 의 추가 조건을 줄 수 있다. 이 경우는 점들이 모두 일직선에 있는 경우를 ( $a = b = 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>=</mo><mi>b</mi><mo>=</mo><mn>0</mn></math>$ ) 취급할 수 없게 된다. 이 예외적인 경우를 제외하고는 최소자승법을 사용하면 계수를 매우 쉽게 구할 수 있기 때문에 많이 이용된다.

문제: 주어진 데이터를 fitting 하는 이차곡선(원)

$x 2 + y 2 + A x + B y + C = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi>x</mi><mn>2</mn></msup><mo>+</mo><msup><mi>y</mi><mn>2</mn></msup><mo>+</mo><mi>A</mi><mi>x</mi><mo>+</mo><mi>B</mi><mi>y</mi><mo>+</mo><mi>C</mi><mo>=</mo><mn>0</mn></math>$

의 계수 $A, B, C <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>,</mo><mi>B</mi><mo>,</mo><mi>C</mi></math>$ 를 최소자승법을 사용해서 구하라.

주어진 점집합이 원 위의 점이면 우변이 0이 되어야 하나, 실제 데이터를 얻는 과정에서 여러 노이즈에 노출되므로 일반적으로 0이 되지 않는다. 최소자승법은 주어진 점들이 원에서 벗어나는 정도의 제곱 합이 최소가 되도록 하는 계수 $A, B, C <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>,</mo><mi>B</mi><mo>,</mo><mi>C</mi></math>$ 를 결정한다. 원과 점의 편차의 제곱합
$L = \sum i | x 2 i + y 2 i + A x i + B y i + C | 2, <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>L</mi><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msup><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">|</mo><msubsup><mi>x</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>y</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><mi>A</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>B</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><mi>C</mi><mo data-mjx-texclass="CLOSE">|</mo></mrow><mn>2</mn></msup><mo>,</mo></math>$

의 극값을 찾기 위해서 $A, B, <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>,</mo><mi>B</mi><mo>,</mo></math>$ 그리고 $C <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>C</mi></math>$ 에 대해 미분을 하면

$∂L∂A=2∑i(x2i+y2i+Axi+Byi+C)xi=0,<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mrow><mi>∂</mi><mi>L</mi></mrow><mrow><mi>∂</mi><mi>A</mi></mrow></mfrac><mo>=</mo><mn>2</mn><munder><mo data-mjx-texclass="OP">∑</mo><mi>i</mi></munder><mo stretchy="false">(</mo><msubsup><mi>x</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>y</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><mi>A</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>B</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><mi>C</mi><mo stretchy="false">)</mo><msub><mi>x</mi><mi>i</mi></msub><mo>=</mo><mn>0</mn><mo>,</mo></math>$

$∂L∂B=2∑i(x2i+y2i+Axi+Byi+C)yi=0,<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mrow><mi>∂</mi><mi>L</mi></mrow><mrow><mi>∂</mi><mi>B</mi></mrow></mfrac><mo>=</mo><mn>2</mn><munder><mo data-mjx-texclass="OP">∑</mo><mi>i</mi></munder><mo stretchy="false">(</mo><msubsup><mi>x</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>y</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><mi>A</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>B</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><mi>C</mi><mo stretchy="false">)</mo><msub><mi>y</mi><mi>i</mi></msub><mo>=</mo><mn>0</mn><mo>,</mo></math>$

$∂L∂C=2∑i(x2i+y2i+Axi+Byi+C)=0.<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mrow><mi>∂</mi><mi>L</mi></mrow><mrow><mi>∂</mi><mi>C</mi></mrow></mfrac><mo>=</mo><mn>2</mn><munder><mo data-mjx-texclass="OP">∑</mo><mi>i</mi></munder><mo stretchy="false">(</mo><msubsup><mi>x</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>y</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><mi>A</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>B</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><mi>C</mi><mo stretchy="false">)</mo><mo>=</mo><mn>0.</mn></math>$

이 연립방정식을 풀면 $A, B, C <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>,</mo><mi>B</mi><mo>,</mo><mi>C</mi></math>$ 를 구할 수 있다. 계산을 단순하게 만들고 수치적 안정성을 높이기 위해 입력점들의 질량중심

$mx=1N∑ixi,my=1N∑iyi<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>m</mi><mi>x</mi></msub><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><munder><mo data-mjx-texclass="OP">∑</mo><mi>i</mi></munder><msub><mi>x</mi><mi>i</mi></msub><mo>,</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><msub><mi>m</mi><mi>y</mi></msub><mo>=</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><munder><mo data-mjx-texclass="OP">∑</mo><mi>i</mi></munder><msub><mi>y</mi><mi>i</mi></msub></math>$

계에서 계산을 하자. 이를 위해 입력점의 $x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi></math>$ , $y <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi></math>$ 성분에서 각각 $m x <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>m</mi><mi>x</mi></msub></math>$ , $m y <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>m</mi><mi>y</mi></msub></math>$ 만큼을 빼준 값을 좌표값으로 대입하면 된다:

$x i \to x i - m x, y i \to y i - m y <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>x</mi><mi>i</mi></msub><mo accent="false" stretchy="false">\to</mo><msub><mi>x</mi><mi>i</mi></msub><mo>-</mo><msub><mi>m</mi><mi>x</mi></msub><mo>,</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><msub><mi>y</mi><mi>i</mi></msub><mo accent="false" stretchy="false">\to</mo><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><msub><mi>m</mi><mi>y</mi></msub></math>$

그러면 질량중심 좌표계에서는 $S x = \sum i x i = 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>x</mi></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>x</mi><mi>i</mi></msub><mo>=</mo><mn>0</mn></math>$ , $S y = \sum i y i = 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>y</mi></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>y</mi><mi>i</mi></msub><mo>=</mo><mn>0</mn></math>$ 이 된다.

우선 세 번째 식에서

$C=−Sx2+Sy2N<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>C</mi><mo>=</mo><mo>−</mo><mfrac><mrow><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>2</mn></msup></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>y</mi><mn>2</mn></msup></mrow></msub></mrow><mi>N</mi></mfrac></math>$

을 얻을 수 있고, 첫번째와 두 번째 식에서는 각각

$S x 2 A + S x y B = - (S x 3 + S x y 2) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>2</mn></msup></mrow></msub><mi>A</mi><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>y</mi></mrow></msub><mi>B</mi><mo>=</mo><mo>-</mo><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>3</mn></msup></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><msup><mi>y</mi><mn>2</mn></msup></mrow></msub><mo stretchy="false">)</mo></math>$

$S x y A + S y 2 B = - (S y 3 + S x 2 y) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>y</mi></mrow></msub><mi>A</mi><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>y</mi><mn>2</mn></msup></mrow></msub><mi>B</mi><mo>=</mo><mo>-</mo><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>y</mi><mn>3</mn></msup></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>2</mn></msup><mi>y</mi></mrow></msub><mo stretchy="false">)</mo></math>$

을 얻을 수 있다. 이를 풀면

$A=−Sy2(Sx3+Sxy2)+Sxy(Sy3+Sx2y)Sx2Sy2−S2xy<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>A</mi><mo>=</mo><mfrac><mrow><mo>−</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>y</mi><mn>2</mn></msup></mrow></msub><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>3</mn></msup></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><msup><mi>y</mi><mn>2</mn></msup></mrow></msub><mo stretchy="false">)</mo><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>y</mi></mrow></msub><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>y</mi><mn>3</mn></msup></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>2</mn></msup><mi>y</mi></mrow></msub><mo stretchy="false">)</mo></mrow><mrow><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>2</mn></msup></mrow></msub><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>y</mi><mn>2</mn></msup></mrow></msub><mo>−</mo><msubsup><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>y</mi></mrow><mn>2</mn></msubsup></mrow></mfrac></math>$

$B=−Sx2(Sy3+Sx2y)+Sxy(Sx3+Sxy2)Sx2Sy2−S2xy<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>B</mi><mo>=</mo><mfrac><mrow><mo>−</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>2</mn></msup></mrow></msub><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>y</mi><mn>3</mn></msup></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>2</mn></msup><mi>y</mi></mrow></msub><mo stretchy="false">)</mo><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>y</mi></mrow></msub><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>3</mn></msup></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><msup><mi>y</mi><mn>2</mn></msup></mrow></msub><mo stretchy="false">)</mo></mrow><mrow><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>2</mn></msup></mrow></msub><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>y</mi><mn>2</mn></msup></mrow></msub><mo>−</mo><msubsup><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>y</mi></mrow><mn>2</mn></msubsup></mrow></mfrac></math>$

여기서 주어진 데이터의 각 차수에 해당하는 moment는 아래처럼 계산된다:

추정된 원의 중심 $(c x, c y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><msub><mi>c</mi><mi>x</mi></msub><mo>,</mo><msub><mi>c</mi><mi>y</mi></msub><mo stretchy="false">)</mo></math>$ 는

$cx=−A2,cy=−B2<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>c</mi><mi>x</mi></msub><mo>=</mo><mo>−</mo><mfrac><mi>A</mi><mn>2</mn></mfrac><mo>,</mo><mstyle scriptlevel="0"><mspace width="2em"></mspace></mstyle><msub><mi>c</mi><mi>y</mi></msub><mo>=</mo><mo>−</mo><mfrac><mi>B</mi><mn>2</mn></mfrac></math>$

로 주어지고, 반지름은

$r2=c2x+c2y−C=c2x+c2y+1N(Sx2+Sy2)<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi>r</mi><mn>2</mn></msup><mo>=</mo><msubsup><mi>c</mi><mi>x</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>c</mi><mi>y</mi><mn>2</mn></msubsup><mo>−</mo><mi>C</mi><mo>=</mo><msubsup><mi>c</mi><mi>x</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>c</mi><mi>y</mi><mn>2</mn></msubsup><mo>+</mo><mfrac><mn>1</mn><mi>N</mi></mfrac><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>x</mi><mn>2</mn></msup></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><msup><mi>y</mi><mn>2</mn></msup></mrow></msub><mo stretchy="false">)</mo></math>$

로 주어진다.

Ref: I. Kasa, A curve fitting procedure and its error analysis. IEEE Trans. Inst. Meas., 25:8-14, 1976

/* 구현 코드: 2024.04.01, typing error 수정 & 질량중심계 계산으로 수정;*/
double circleFit_LS(std::vector<CPoint> &Q, double &cx, double &cy, double &radius) {
    if (Q.size() < 3) return -1;
    double sx2 = 0.0, sy2 = 0.0, sxy  = 0.0;
    double sx3 = 0.0, sy3 = 0.0, sx2y = 0.0, sxy2 = 0.0;
    double mx = 0, my = 0;            /* center of mass;*/
    for (int k = Q.size(); k-->0;)
        mx += Q[k].x, my += Q[k].y;
    mx /= Q.size(); my /= Q.size();
    /* compute moments; */
    for (int k = Q.size(); k-->0;) { /* offset (mx, my)*/
        double x = Q[k].x - mx, xx = x * x;
        double y = Q[k].y - my, yy = y * y;
        sx2  += xx;      sy2  += yy;      sxy  += x * y;
        sx3  += x * xx;  sy3  += y * yy;
        sx2y += xx * y;  sxy2 += yy * x;
    }
    double det = sx2 * sy2 - sxy * sxy;
    if (fabs(det) < 1.e-10) return -1;    /*collinear한 경우임;*/
    /* center in cm frame; */
    double a = sx3 + sxy2;
    double b = sy3 + sx2y;
    cx = (sy2 * a - sxy * b) / det / 2;
    cy = (sx2 * b - sxy * a) / det / 2;
    /* radius squared */
    double radsq = cx * cx + cy * cy + (sx2 + sy2) / Q.size();
    radius = sqrt(radsq);
    cx += mx; cy += my; /* recover offset; */
    return fitError(Q, cx, cy, radius);
}

https://kipl.tistory.com/357

Circle Fitting: Pratt

주어진 점집합을 원으로 피팅하기 위해 이차식 $A (x 2 + y 2) + B x + C y + D = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>A</mi><mo stretchy="false">(</mo><msup><mi>x</mi><mn>2</mn></msup><mo>+</mo><msup><mi>y</mi><mn>2</mn></msup><mo stretchy="false">)</mo><mo>+</mo><mi>B</mi><mi>x</mi><mo>+</mo><mi>C</mi><mi>y</mi><mo>+</mo><mi>D</mi><mo>=</mo><mn>0</mn></math>$ 을 이용하자. 원의 경우는 $A = 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>=</mo><mn>0</mn></math>$ 인 경우는 직선을 나타내고, $A \neq 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>\neq</mo><mn>0</mn></math>$ 인 경우가 원을 표현한다. 물론 $A = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>=</mo><mn>1</mn></math>$ 로 설정을 할 수 있으

kipl.tistory.com

https://kipl.tistory.com/32

RANSAC: Circle Fit

RANSAC 알고리즘을 써서 주어진 2차원 점집합에서 원을 추정한다. 원을 만들기 위해서는 최소한 3점이 필요하고, 또 일직선에 있지 않아야 한다. 이렇게 만들어진 원은 세 점을 꼭짓점으로 하는 삼

kipl.tistory.com

저작자표시 비영리 변경금지

'Image Recognition > Fundamental' 카테고리의 다른 글

PCA Line Fitting (0)	2020.11.12
Histogram Equalization (0)	2020.11.12
Integer Sqrt (0)	2020.11.11
Parabolic Interpolation in Peak Finding (3)	2020.11.10
Histogram Matching (0)	2012.11.03

Geometry & Recognition 알고리즘,계산기하,물리학,...

2차원 Savitzky-Golay Filters 응용

Image Recognition 2012. 2. 28. 17:45

Savitzky-Golay 필터는 일차원의 데이터에 대해 이동평균을 취하는 경우와 같은 방식으로 동작하는 필터이지만, 윈도의 모든 점에 동일한 가중치를 주는 이동평균과 다르게 윈도 픽셀 값을 보간하는 다항식을 최소자승법으로 찾아서 해당 지점의 값으로 할당하는 방식을 택한다(frequency domain에서 분석하면 Savitzky-Golay 필터의 특성, 예를 들면, 피크의 위치가 잘 유지되는 점과 같은 특성을 좀 더 다양하게 볼 수 있다). 이 필터를 쓰기 위해서는 다항식의 찾수와 윈도 크기를 정해야 한다. (다항식의 찾수가 정해지면 최소 윈도 크기는 정해진다).

동일한 방식으로 이차원에 대해서도 Savitzky-Golay를 적용할 수 있다. 이 경우 다항식은 $(x, y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo></math>$ 의 2 변수 함수로 2차원 평면에서 정의되는 곡면으로 나타낸다. 2차원 영상의 경우도 국소 필터를 사용할 수 있지만, 필터 윈도를 영상 전체로 잡아서 전 영역을 보간하는 곡면을 찾을 수도 있다. 배경 조명이 균일하지 않는 영상의 경우 이 곡면을 이용하면 조명에 의한 효과를 예측할 수 있고, 이를 보정한 영상을 이용하면 인식에 도움을 받을 수 있다. (문자 인식에서 문서를 스캔할 때 생기는 균일하지 않은 배경이나, 2차원 바코드 인식에서 배경의 추정 등 다양한 부분에서 사용할 수 있다. 좀 더 간단하게는 배경의 변화를 균일하게 기울어진 평면으로 근사를 하여 추정할 수 있다)

3차 다항식으로 영상을 보간하는 경우: $I (x, y) = a 00 + a 10 x + a 01 y + a 20 x 2 + a 11 x y + a 02 y 2 + a 30 x 3 + a 21 x 2 y + a 12 x y 2 + a 03 y 3, (x, y) \in image <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><mi>I</mi><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo></mtd><mtd><mi></mi><mo>=</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>00</mn></mrow></msub></mtd></mtr><mtr><mtd></mtd><mtd><mi></mi><mo>+</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>10</mn></mrow></msub><mi>x</mi><mo>+</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>01</mn></mrow></msub><mi>y</mi></mtd></mtr><mtr><mtd></mtd><mtd><mi></mi><mo>+</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>20</mn></mrow></msub><msup><mi>x</mi><mn>2</mn></msup><mo>+</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>11</mn></mrow></msub><mi>x</mi><mi>y</mi><mo>+</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>02</mn></mrow></msub><msup><mi>y</mi><mn>2</mn></msup></mtd></mtr><mtr><mtd></mtd><mtd><mi></mi><mo>+</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>30</mn></mrow></msub><msup><mi>x</mi><mn>3</mn></msup><mo>+</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>21</mn></mrow></msub><msup><mi>x</mi><mn>2</mn></msup><mi>y</mi><mo>+</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>12</mn></mrow></msub><mi>x</mi><msup><mi>y</mi><mn>2</mn></msup><mo>+</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>03</mn></mrow></msub><msup><mi>y</mi><mn>3</mn></msup><mo>,</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo><mo>\in</mo><mstyle displaystyle="false" scriptlevel="0"><mtext>image</mtext></mstyle></mtd></mtr></mtable></math>$

다항식은 $x = [a 00, a 10, . . ., a 03] T <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi><mo>=</mo><mo stretchy="false">[</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>00</mn></mrow></msub><mo>,</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>10</mn></mrow></msub><mo>,</mo><mo>.</mo><mo>.</mo><mo>.</mo><mo>,</mo><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>03</mn></mrow></msub><msup><mo stretchy="false">]</mo><mi>T</mi></msup></math>$ 의 10개의 필터 계수를 추정하면 얻어진다. 추가적으로 Savitzky-Golay을 이용하면 영상의 미분 값을 쉽게 구할 수 있다. 로컬 버전의 필터인 경우에 필터 적용 값은 윈도의 중심인 $(x, y) = (0, 0) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo><mo>=</mo><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><mn>0</mn><mo stretchy="false">)</mo></math>$ 에서 다항식 값인 $a 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>0</mn></mrow></msub></math>$ 이다. 이 지점에서 $x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi></math>$ -방향의 편미분 값은 $a 10 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>10</mn></mrow></msub></math>$ , $y <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi></math>$ -방향의 편미분 값은 $a 01 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>a</mi><mrow data-mjx-texclass="ORD"><mn>01</mn></mrow></msub></math>$ 로 주어진다.

필터의 계수 $x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi></math>$ 는 최소자승법을 적용하면 얻을 수 있다. 위의 다항식에 $N (= w i d t h \times h e i g h t) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>N</mi><mo stretchy="false">(</mo><mo>=</mo><mi>w</mi><mi>i</mi><mi>d</mi><mi>t</mi><mi>h</mi><mo>\times</mo><mi>h</mi><mi>e</mi><mi>i</mi><mi>g</mi><mi>h</mi><mi>t</mi><mo stretchy="false">)</mo></math>$ 개의 픽셀로 구성된 영상의 각 픽셀에서 좌표와 픽셀 값을 대입하면, $N <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>N</mi></math>$ 개의 식을 얻는다. 이를 행렬로 표현하면,

$A \cdot x = b <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi mathvariant="bold">A</mi><mo>\cdot</mo><mi mathvariant="bold">x</mi><mo mathvariant="bold">=</mo><mi mathvariant="bold">b</mi></math>$

$A <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">A</mi></math>$ 는 $N \times 10 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>N</mi><mo>\times</mo><mn>10</mn></math>$ 의 행렬로 각 행은 픽셀의 좌표로 구해진다:

여기서, $i <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>i</mi></math>$ -번째의 픽셀 위치가 $(x i, y i) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><msub><mi>x</mi><mi>i</mi></msub><mo>,</mo><msub><mi>y</mi><mi>i</mi></msub><mo stretchy="false">)</mo></math>$ 로 주어진 경우다. $b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">b</mi></math>$ 는 $N <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>N</mi></math>$ -(열) 벡터로 각 픽셀 위치에서 픽셀 값을 나타내는 벡터다:

$b = [I (x 0, y 0) I (x 1, y 1) I (x 2, y 2) ⋮] <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow><mo>=</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><mi>I</mi><mo stretchy="false">(</mo><msub><mi>x</mi><mn>0</mn></msub><mo>,</mo><msub><mi>y</mi><mn>0</mn></msub><mo stretchy="false">)</mo></mtd></mtr><mtr><mtd><mi>I</mi><mo stretchy="false">(</mo><msub><mi>x</mi><mn>1</mn></msub><mo>,</mo><msub><mi>y</mi><mn>1</mn></msub><mo stretchy="false">)</mo></mtd></mtr><mtr><mtd><mi>I</mi><mo stretchy="false">(</mo><msub><mi>x</mi><mn>2</mn></msub><mo>,</mo><msub><mi>y</mi><mn>2</mn></msub><mo stretchy="false">)</mo></mtd></mtr><mtr><mtd><mrow data-mjx-texclass="ORD"><mo>⋮</mo></mrow></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow></math>$

최소자승법을 적용하면, 추정된 다항식의 계수 벡터 $x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">x</mi></math>$ 는 $| A \cdot x - b | 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">|</mo><mi mathvariant="bold">A</mi><mo>\cdot</mo><mi mathvariant="bold">x</mi><mo mathvariant="bold">-</mo><mi mathvariant="bold">b</mi><msup><mo mathvariant="bold" stretchy="false">|</mo><mn mathvariant="bold">2</mn></msup></math>$ 을 최소로 하는 벡터로,

$x = (A T \cdot A) - 1 \cdot A T \cdot b <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi mathvariant="bold">x</mi><mo mathvariant="bold">=</mo><mo mathvariant="bold" stretchy="false">(</mo><msup><mi mathvariant="bold">A</mi><mi mathvariant="bold">T</mi></msup><mo>\cdot</mo><mi mathvariant="bold">A</mi><msup><mo mathvariant="bold" stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo mathvariant="bold">-</mo><mn mathvariant="bold">1</mn></mrow></msup><mo>\cdot</mo><msup><mi mathvariant="bold">A</mi><mi mathvariant="bold">T</mi></msup><mo>\cdot</mo><mi mathvariant="bold">b</mi></math>$

로 주어짐을 알 수 있다. $A T \cdot A <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi mathvariant="bold">A</mi><mi mathvariant="bold">T</mi></msup><mo>\cdot</mo><mi mathvariant="bold">A</mi></math>$ 는 $10 \times 10 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>10</mn><mo>\times</mo><mn>10</mn></math>$ 의 대칭 행렬로 역행렬은 쉽게 구할 수 있다.

이렇게 추정된 2차원 곡면은 영상에서 추정된 배경의 픽셀 값 분포를 의미한다. 문자인식의 예를 들면, 보통 경우에 흰 배경에 검은색 활자를 인식한다. 스캔된 영상에 검은색 활자 때문에 추정된 곡명은 일반적으로 주어진 픽셀이 만드는 곡면보다도 낮게 된다. 픽셀 값이 추정된 곡면보다 더 낮은 픽셀들은 보통 검은색 문자들을 의미하므로, 이 차이의 평균값을 구하면, 대략적으로 어떤 픽셀이 배경에 속하는지 (곡면과 차이가 평균보다 작고, 또한 픽셀 값이 곡면의 아래에 놓인 경우), 아니면 문자 영역인지(곡면과 차이가 평균보다 크고, 픽셀 값이 곡면의 아래에 놓인 경우)를 구별할 있게 된다.

이제 이 정보들을 이용해서 추정을 다시 하는데 이번에는 1차 추정에서 글씨 영역으로 분류된 픽셀을 제외하고 배경을 추정하면 좀 더 정확한 배경을 기술하는 곡면을 얻을 수 있다.
로컬 필터로 사용할 때는 1차원에서와 마찬가지로 필터 계수를 lookup table로 만들어서 사용할 수 있으나, 전 영역을 대상으로 할 때는 행렬의 크기가 매우 커져서 연산량도 많아진다.

영상:

1차 추정 배경 영상:

2차 추정 배경 영상:

저작자표시 비영리 변경금지

'Image Recognition' 카테고리의 다른 글

Statistical Region Merging (2)	2012.03.25
Local Histogram Equalization (0)	2012.03.10
webcam용 QR code detector (0)	2012.02.19
Least Squares Estimation of Perspective Transformation (4)	2012.02.15
Perspective Transformation (2)	2012.02.14

Geometry & Recognition 알고리즘,계산기하,물리학,...

Least Squares Estimation of Perspective Transformation

Image Recognition 2012. 2. 15. 13:07

두 영상 사이의 perspective 변환은 8개의 매개변수 $(a, b, c, d, e, f, g, h) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>a</mi><mo>,</mo><mi>b</mi><mo>,</mo><mi>c</mi><mo>,</mo><mi>d</mi><mo>,</mo><mi>e</mi><mo>,</mo><mi>f</mi><mo>,</mo><mi>g</mi><mo>,</mo><mi>h</mi><mo stretchy="false">)</mo></math>$ 에 의해서 다음 식처럼 기술이 된다. (see, http://kipl.tistory.com/86)

또는,

따라서, 매개변수를 찾기 위해서는 두 영상에서 서로 대응하는 점이 4개 이상 주어져야 한다. N개의 대응점들이 주어진 경우

각각의 대응점을 위의 식에 대입해서 정리하면 아래의 행렬식을 얻을 수 있다.(좌변 행렬의 마지막 열은 전부 - 부호가 들어가야 한다)

또는, 간단히

로 쓸 수 있다. 그러나 대응점을 찾을 때 들어오는 noise로 인해서 실제 데이터를 이용하는 경우에는 정확히 등호로 주어지지 않는다. 따라서, 실제 문제에서는 좌변과 우변의 차이의 제곱을 최소로 만드는 $x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">x</mi></math>$ 를 찾아야 할 것이다.

$x * = argmin x | A \cdot x - b | 2 . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mrow data-mjx-texclass="ORD"><mo>*</mo></mrow></msup><mo>=</mo><munder><mtext>argmin </mtext><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow></munder><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>\cdot</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>-</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow><msup><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mn>2</mn></msup><mo>.</mo></math>$

최소자승해를 찾기 위해 $x T <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi mathvariant="bold">x</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup></math>$ 에 대해 미분을 하면

$(A T \cdot A) \cdot x = A T \cdot b, <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo mathvariant="bold" stretchy="false">(</mo><msup><mi mathvariant="bold">A</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup><mo>\cdot</mo><mi mathvariant="bold">A</mi><mo mathvariant="bold" stretchy="false">)</mo><mo>\cdot</mo><mi mathvariant="bold">x</mi><mo mathvariant="bold">=</mo><msup><mi mathvariant="bold">A</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup><mo>\cdot</mo><mi mathvariant="bold">b</mi><mo mathvariant="bold">,</mo></math>$

를 얻고, 이 식을 풀어서 $x * <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>*</mo></msup></math>$ 을 구하면 된다. $A T \cdot A <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi mathvariant="bold">A</mi><mi mathvariant="bold">T</mi></msup><mo>\cdot</mo><mi mathvariant="bold">A</mi></math>$ 는 $8 \times 8 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>8</mn><mo>\times</mo><mn>8</mn></math>$ 의 대칭 행렬로 역행렬을 구할 수 있다 (주어진 점들 중 한 직선 위에 놓이지 않는 점이 4개 이상이 있어야 한다). 따라서 최소자승해는 다음과 같이 쓸 수 있다:

$x * = (A T \cdot A) - 1 \cdot (A T \cdot b) . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi mathvariant="bold">x</mi><mrow data-mjx-texclass="ORD"><mo mathvariant="bold">*</mo></mrow></msup><mo mathvariant="bold">=</mo><mo mathvariant="bold" stretchy="false">(</mo><msup><mi mathvariant="bold">A</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup><mo>\cdot</mo><mi mathvariant="bold">A</mi><msup><mo mathvariant="bold" stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo mathvariant="bold">-</mo><mn mathvariant="bold">1</mn></mrow></msup><mo>\cdot</mo><mo mathvariant="bold" stretchy="false">(</mo><msup><mi mathvariant="bold">A</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup><mo>\cdot</mo><mi mathvariant="bold">b</mi><mo mathvariant="bold" stretchy="false">)</mo><mo mathvariant="bold">.</mo></math>$

저작자표시 비영리 변경금지

'Image Recognition' 카테고리의 다른 글

2차원 Savitzky-Golay Filters 응용 (0)	2012.02.28
webcam용 QR code detector (0)	2012.02.19
Perspective Transformation (2)	2012.02.14
Integral Image을 이용한 Adaptive Threshold (0)	2012.02.04
Peak Finder (1)	2012.02.02

Geometry & Recognition 알고리즘,계산기하,물리학,...

이전 1 2 3 다음

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

Geometry & Recognition

Least Squares Fitting of Circles

'Image Recognition > Fundamental' 카테고리의 다른 글

2차원 Savitzky-Golay Filters 응용

'Image Recognition' 카테고리의 다른 글

Least Squares Estimation of Perspective Transformation

'Image Recognition' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

글 보관함

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역