'최소자승법' 태그의 글 목록 (3 Page)

Least Squares Estimation of Perspective Transformation

Image Recognition 2012. 2. 15. 13:07

두 영상 사이의 perspective 변환은 8개의 매개변수 $(a, b, c, d, e, f, g, h) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>a</mi><mo>,</mo><mi>b</mi><mo>,</mo><mi>c</mi><mo>,</mo><mi>d</mi><mo>,</mo><mi>e</mi><mo>,</mo><mi>f</mi><mo>,</mo><mi>g</mi><mo>,</mo><mi>h</mi><mo stretchy="false">)</mo></math>$ 에 의해서 다음 식처럼 기술이 된다. (see, http://kipl.tistory.com/86)

또는,

따라서, 매개변수를 찾기 위해서는 두 영상에서 서로 대응하는 점이 4개 이상 주어져야 한다. N개의 대응점들이 주어진 경우

각각의 대응점을 위의 식에 대입해서 정리하면 아래의 행렬식을 얻을 수 있다.(좌변 행렬의 마지막 열은 전부 - 부호가 들어가야 한다)

또는, 간단히

$A \cdot x = b <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi mathvariant="bold">A</mi><mo>\cdot</mo><mi mathvariant="bold">x</mi><mo mathvariant="bold">=</mo><mi mathvariant="bold">b</mi></math>$

로 쓸 수 있다. 그러나 대응점을 찾을 때 들어오는 noise로 인해서 실제 데이터를 이용하는 경우에는 정확히 등호로 주어지지 않는다. 따라서, 실제 문제에서는 좌변과 우변의 차이의 제곱을 최소로 만드는 $x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi mathvariant="bold">x</mi></math>$ 를 찾아야 할 것이다.

$x * = argmin x | A \cdot x - b | 2 . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mrow data-mjx-texclass="ORD"><mo>*</mo></mrow></msup><mo>=</mo><munder><mtext>argmin </mtext><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow></munder><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>\cdot</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>-</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow><msup><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mn>2</mn></msup><mo>.</mo></math>$

최소자승해를 찾기 위해 $x T <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi mathvariant="bold">x</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup></math>$ 에 대해 미분을 하면

$(A T \cdot A) \cdot x = A T \cdot b, <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo mathvariant="bold" stretchy="false">(</mo><msup><mi mathvariant="bold">A</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup><mo>\cdot</mo><mi mathvariant="bold">A</mi><mo mathvariant="bold" stretchy="false">)</mo><mo>\cdot</mo><mi mathvariant="bold">x</mi><mo mathvariant="bold">=</mo><msup><mi mathvariant="bold">A</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup><mo>\cdot</mo><mi mathvariant="bold">b</mi><mo mathvariant="bold">,</mo></math>$

를 얻고, 이 식을 풀어서 $x * <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>*</mo></msup></math>$ 을 구하면 된다. $A T \cdot A <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi mathvariant="bold">A</mi><mi mathvariant="bold">T</mi></msup><mo>\cdot</mo><mi mathvariant="bold">A</mi></math>$ 는 $8 \times 8 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>8</mn><mo>\times</mo><mn>8</mn></math>$ 의 대칭 행렬로 역행렬을 구할 수 있다 (주어진 점들 중 한 직선 위에 놓이지 않는 점이 4개 이상이 있어야 한다). 따라서 최소자승해는 다음과 같이 쓸 수 있다:

$x * = (A T \cdot A) - 1 \cdot (A T \cdot b) . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi mathvariant="bold">x</mi><mrow data-mjx-texclass="ORD"><mo mathvariant="bold">*</mo></mrow></msup><mo mathvariant="bold">=</mo><mo mathvariant="bold" stretchy="false">(</mo><msup><mi mathvariant="bold">A</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup><mo>\cdot</mo><mi mathvariant="bold">A</mi><msup><mo mathvariant="bold" stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo mathvariant="bold">-</mo><mn mathvariant="bold">1</mn></mrow></msup><mo>\cdot</mo><mo mathvariant="bold" stretchy="false">(</mo><msup><mi mathvariant="bold">A</mi><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">T</mi></mrow></msup><mo>\cdot</mo><mi mathvariant="bold">b</mi><mo mathvariant="bold" stretchy="false">)</mo><mo mathvariant="bold">.</mo></math>$

저작자표시 비영리 변경금지

'Image Recognition' 카테고리의 다른 글

2차원 Savitzky-Golay Filters 응용 (0)	2012.02.28
webcam용 QR code detector (0)	2012.02.19
Perspective Transformation (2)	2012.02.14
Integral Image을 이용한 Adaptive Threshold (0)	2012.02.04
Peak Finder (1)	2012.02.02

Geometry & Recognition 알고리즘,계산기하,물리학,...

Savitzky-Golay Smoothing Filter

Image Recognition 2010. 3. 24. 10:55

The Savitzky–Golay method essentially performs a local polynomial regression (of degree $k <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>k</mi></math>$ ) on a series of values (of at least $k + 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>k</mi><mo>+</mo><mn>1</mn></math>$ points which are treated as being equally spaced in the series) to determine the smoothed value for each point.

Savitzky–Golay 필터는 일정한 간격으로 주어진 데이터들이 있을 때(이들 데이터는 원래의 정보와 노이즈를 같이 포함한다), 각각의 점에서 주변의 점들을 가장 잘 피팅하는 $k <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>k</mi></math>$ -차의 다항식을 최소자승법으로 찾아서 그 지점에서의 출력값을 결정하는 필터이다. 이 필터는 주어진 데이터에서의 극대나 극소, 또는 봉우리의 폭을 상대적으로 잘 보존한다.(주변 점들에 동등한 가중치를 주는 Moving Average Filter와 비교해 볼 수 있다).

간단한 예로, 2차의 다항식과 5개의 데이터 점

${(- 2, d 0), (- 1, d 1), (0, d 2), (1, d 3), (2, d 4)} <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo fence="false" stretchy="false">{</mo><mo stretchy="false">(</mo><mo>-</mo><mn>2</mn><mo>,</mo><msub><mi>d</mi><mn>0</mn></msub><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><mo>-</mo><mn>1</mn><mo>,</mo><msub><mi>d</mi><mn>1</mn></msub><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><mn>0</mn><mo>,</mo><msub><mi>d</mi><mn>2</mn></msub><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><mn>1</mn><mo>,</mo><msub><mi>d</mi><mn>3</mn></msub><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><mn>2</mn><mo>,</mo><msub><mi>d</mi><mn>4</mn></msub><mo stretchy="false">)</mo><mo fence="false" stretchy="false">}</mo></math>$

을 이용해서 중앙에서의 값을 결정하는 방법을 살펴보자. 사용하려고 하는 다항식은

$p (x) = a 0 + a 1 x + a 2 x 2 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>p</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo><mo>=</mo><msub><mi>a</mi><mn>0</mn></msub><mo>+</mo><msub><mi>a</mi><mn>1</mn></msub><mi>x</mi><mo>+</mo><msub><mi>a</mi><mn>2</mn></msub><msup><mi>x</mi><mn>2</mn></msup></math>$

이다. 다항식의 계수는 다항식의 값과 실제 데이터의 값과의 차이를 최소화시키도록 선택해야 한다. 즉, 최소자승의 원리를 적용하여서 구하면 된다. 계산된 다항식의 값과 실제 데이터 값 사이의 차의 제곱을 구하면:

$L = | a 0 - 2 a 1 + 4 a 2 - d 0 | 2 + | a 0 - a 1 + a 2 - d | 2 + | a 0 - d 0 | 2 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>L</mi><mo>=</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>a</mi><mn>0</mn></msub><mo>-</mo><mn>2</mn><msub><mi>a</mi><mn>1</mn></msub><mo>+</mo><mn>4</mn><msub><mi>a</mi><mn>2</mn></msub><mo>-</mo><msub><mi>d</mi><mn>0</mn></msub><msup><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mn>2</mn></msup><mo>+</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>a</mi><mn>0</mn></msub><mo>-</mo><msub><mi>a</mi><mn>1</mn></msub><mo>+</mo><msub><mi>a</mi><mn>2</mn></msub><mo>-</mo><mi>d</mi><msup><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mn>2</mn></msup><mo>+</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>a</mi><mn>0</mn></msub><mo>-</mo><msub><mi>d</mi><mn>0</mn></msub><msup><mo stretchy="false">|</mo><mn>2</mn></msup></math>$ $+ | a 0 + a 1 + a 2 - d 3 | 2 + | a 0 + 2 a 1 + 4 a 2 - d 4 | 2 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo>+</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>a</mi><mn>0</mn></msub><mo>+</mo><msub><mi>a</mi><mn>1</mn></msub><mo>+</mo><msub><mi>a</mi><mn>2</mn></msub><mo>-</mo><msub><mi>d</mi><mn>3</mn></msub><msup><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mn>2</mn></msup><mo>+</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>a</mi><mn>0</mn></msub><mo>+</mo><mn>2</mn><msub><mi>a</mi><mn>1</mn></msub><mo>+</mo><mn>4</mn><msub><mi>a</mi><mn>2</mn></msub><mo>-</mo><msub><mi>d</mi><mn>4</mn></msub><msup><mo stretchy="false">|</mo><mn>2</mn></msup></math>$

이 식의 $a 0, a 1, a 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>a</mi><mn>0</mn></msub><mo>,</mo><msub><mi>a</mi><mn>1</mn></msub><mo>,</mo><msub><mi>a</mi><mn>2</mn></msub></math>$ 에 대한 극값을 가질 조건은( $\partial L / \partial a i = 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>\partial</mi><mi>L</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mi>\partial</mi><msub><mi>a</mi><mi>i</mi></msub><mo>=</mo><mn>0</mn></math>$ ) $5 a 0 + 10 a 2 = d 0 + d 1 + d 2 + d 3 + d 4 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mn>5</mn><msub><mi>a</mi><mn>0</mn></msub><mo>+</mo><mn>10</mn><msub><mi>a</mi><mn>2</mn></msub><mo>=</mo><msub><mi>d</mi><mn>0</mn></msub><mo>+</mo><msub><mi>d</mi><mn>1</mn></msub><mo>+</mo><msub><mi>d</mi><mn>2</mn></msub><mo>+</mo><msub><mi>d</mi><mn>3</mn></msub><mo>+</mo><msub><mi>d</mi><mn>4</mn></msub></math>$ $10 a 1 = - 2 d 0 - <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mn>10</mn><msub><mi>a</mi><mn>1</mn></msub><mo>=</mo><mo>-</mo><mn>2</mn><msub><mi>d</mi><mn>0</mn></msub><mrow data-mjx-texclass="ORD"><mo>-</mo></mrow><msub><mi>d</mi><mn>1</mn></msub><mo>+</mo><msub><mi>d</mi><mn>3</mn></msub><mo>+</mo><mn>2</mn><msub><mi>d</mi><mn>4</mn></msub></math>$ $10 a_{0} + 34 a_{2} = 4 d_{0} + d_{1} + d_{3} + 4 d_{4}$

이 식을 만족시키는 $a_{0}$ 를 구하면, 필터의 출력(원도 중앙에서 값)이 결정된다.

$필터 출력: a_{0} = (- 3 d_{0} + 12 d_{1} + 17 d_{2} + 12 d_{3} - 3 d_{4}) / 35$

위에서 계수 $a_{0}$ , $a_{1}$ , $a_{2}$ 를 결정하는 방정식은 행렬로 정리하면 아래의 식과 같이 표현할 수 있다.

좌변의 5행 3열 행렬을 $A$ , $a = [a_{0}, a_{1}, a_{2}]^{T}$ , $d = [d_{0}, d_{1}, d_{2}, d_{3}, d_{4}]^{T}$ 로 놓으면, 이 행렬방정식은 $A . a = d$ 형태로 쓸 수 있다. $A$ (design matrix)가 정방행렬이 아니므로 역행렬을 바로 구할 수 없지만, $| A \cdot a - d |^{2}$ 을 최소로 하는 최소제곱해는
$(A^{T} \cdot A) \cdot a = A^{T} \cdot d$ 를 만족시켜야 하므로

$a = (A^{T} \cdot A)^{- 1} \cdot (A^{T} \cdot d)$ 로 주어짐을 알 수 있다.

이 식은 임의의 $k$ -차 다항식을 이용한 경우에도 사용할 수 있다. 이 경우 행렬(scattering matrix) $A^{T} \cdot A$ 는 $(k + 1) \times (k + 1)$ 의 대칭행렬이 된다. 행렬 $A$ 는 다항식의 찻수와 피팅에 사용이 될 데이터의 구간의 크기가 주어지면 정해지므로, 윗 식에서 $(A^{T} \cdot A)^{- 1} \cdot A^{T}$ 의 첫 행 ( $a_{0}$ 을 $d$ 로 표현하는 식의 계수들)을 구하면 코드 내에서 결과를 lookup table로 만들어서 사용할 수 있다. 아래 표는 mathematica 를 이용해서 윈도 크기가 7 (7개 점)인 경우 2차 다항식을 사용할 때 계수를 구하는 과정이다.

2차 다항식일 때, 같은 방식으로 다양한 윈도 크기에 따른 계수를 구할 수 있다.
*크기( $n$ )에 따른 필터값 결정계수 (중앙에 대해 좌우대칭이다);

$\begin{aligned} n = 5; & W [] = {- 3, 12, 17, 12, - 3}; \\ n = 7; & W [] = {- 2, 3, 6, 7, 6, 3, - 2}; \\ n = 9; & W [] = {- 21, 14, 39, 54, 59, 54, 39, 14, - 21}; \\ n = 11; & W [] = {- 36, 9, 44, 69, 84, 89, 84, 69, 44, 9, - 36}; \\ n = 13; & W [] = {- 11, 0, 9, 16, 21, 24, 25, 24, 21, 16, 9, 0, - 11} \\ n = 15; & W [] = {- 78, - 13, 42, 87, 122, 147, 162, 167, 162, 147, 122, 87, 42, - 13, - 78} \\ n = 25; & W [] = {- 253, - 138, - 33, 62, 147, 222, 287, 342, 387, 422, 447, 462, 467, \\ 462, 447, 422, 387, 342, 287, 222, 147, 62, - 33, - 138, - 253} \end{aligned}$

$필터 출력 = \frac{\sum_{i} W [i] d [i]}{\sum_{i} W [i]}$

std::vector<double> SavitzkyGolayFilter(std::vector<double>& data, double W[], int wsz) {
    const int hwsz = wsz >> 1;
    wsz = (hwsz<<1) + 1;
    std::vector<double> padded(data.size() + 2 * hwsz);
    // reflective boundary conditions;   
    // [hw]...[1][0][1][2]...[hw]....[n-1+hw][n-2+hw][n-3+hw]....[n-1];
    for (int i = 0; i < hwsz; i++) { 
        padded[i]                  = data[hwsz-i];
        padded[i+data.size()+hwsz] = data[data.size()-2-i];
    }
    for (int i = data.size(); i-->0;) padded[i+hwsz] = data[i];
    //
    std::vector<double> smoothed(data.size());
    double wsum = 0;
    for (int i = 0; i < wsz; i++) wsum += W[i]; 
    for (int i = data.size(); i-->0;) {
        double *ppad = &padded[i];
        double fsum = 0;
        for (int k = 0; k < wsz; k++) fsum += ppad[k] * W[k];
        smoothed[i] = fsum / wsum;
    }
    return smoothed;
};

$f (x) = 1 - (| x | + 0.01)^{0.01} + N (0, 0.002), - 2 \leq x \leq 2$

일반적인 찻수(order)에 대한 필터 계수 계산;

std::vector<double> sgFilterCoeffs(int width, int order/* = 2*/) {
    // ASSERT(width >= order+1);
    int hwidth = width >> 1;
    width = (hwidth << 1) + 1; // width = odd number;
    // design matrix;
    std::vector<double> D((order+1) * width);
    for (int i = 0; i < width; i++) {
        double x = double(i) - hwidth;  // -hwidth <= x <= hwidth;
        double *pD = &D[i*(order+1)];
        pD[0] = 1;
        for (int k = 1; k <= order; k++) 
            pD[k] = pD[k-1] * x;
    };
    // scattering matrix;
    std::vector<double> S((order+1)*(order+1)); // S=~D.D;
    for (int i = 0; i <= order; i++)
        for (int j = i; j <= order; j++) {
            double s = 0;
            for (int k = 0; k < width; k++)
                s += D[k*(order+1) + i] * D[k*(order+1) + j];
            S[i*(order+1) + j] = s;
        }
    for (int i = 0; i <= order; i++) 
        for (int j = 0; j < i; j++) 
            S[i*(order+1) + j]= S[j*(order+1) + i];
			
    // ccmath library;
    psinv(&S[0], order+1);
    // filter coeffs = 0-th row fo S.~D;
    std::vector<double> filter;
    for (int i = 0; i < width; i++) {
        double s = 0;
        for (int k = 0; k <= order; k++)
            s += S[k] * D[i*(order+1) + k];
        filter.push_back(s);
    }
    return filter;
}

저작자표시 비영리 변경금지

'Image Recognition' 카테고리의 다른 글

Adaboost (0)	2010.12.28
Blur Detection (0)	2010.05.25
Watershed Algorithm 구현 (0)	2010.03.19
Retinex 알고리즘 (11)	2010.02.03
Gaussian Mixture Model & KMeans (4)	2010.01.30

Geometry & Recognition 알고리즘,계산기하,물리학,...

Affine Transformation

Image Recognition 2010. 1. 20. 21:05

물체의 형상은 폴리곤이나 폴리곤의 집합으로 근사적으로 표현할 수 있다. 예를 들면 snake나 active shape model (ASM) 등에서 손 모양이나 얼굴의 윤곽, 또는 의료 영상 등에서 장기의 모양 등을 표현할 때 사용이 된다. 이러한 응용에서 주어진 형상을 기준으로 주어진 형상에 정렬을 시켜야 필요가 생긴다. 일반적으로 카메라를 써서 얻은 각 영상에서 추출한 정보들 사이에는 서로 사영 변환의 관계로 연결된다. 그러나 많은 경우에는 in-plane 변형만 고려해도 충분할 때가 많다. 이 경우에 가장 일반적인 형상의 변형은 affine 변환으로 표현된다. 회전(rotation), 평행 이동(translation), 크기 변환(scale transformation) 그리고 층 밀림(shear)을 허용하는 변환이다. 물론, 간단한 경우로는 shear를 제외할 수도 있고 (similarity transformation), 더 간단하게는 크기 변환을 제외할 수도 있다 (isometric transformation).

$N$ 개의 꼭짓점을 갖는 두 개의 형상 $S = {(x_{1}, y_{1}), (x_{2}, y_{2}), . . ., (x_{N}, y_{N})}$ , $S^{'} = {(x_{1}^{'}, y_{1}^{'}), (x_{2}^{'}, y_{2}^{'}), . . ., (x_{N}^{'}, y_{N}^{'})}$ 이 affine 변환에 의해서 연결이 되는 경우에 각 꼭짓점 사이의 관계는

$\begin{aligned} x_{i}^{'} & = a x_{i} + b y_{i} + t_{x} \\ y_{i}^{'} & = c x_{i} + d y_{i} + t_{y}, (i = 1, 2, . . ., N); \end{aligned}$

의 6개의 매개변수 $(a, b, c, d, t_{x}, t_{y})$ 에 의해서 기술이 된다(평행 이동: $x / y$ 축 방향 2개, 회전: 1개, shear: 1개, 스케일: $x / y$ 축 방향 2개). Affine 변환에 의해서 평행인 두 직선은 변환 후에도 평행인 관계를 유지한다.

꼭짓점 위치는 실제로 다양한 영상처리 과정에 의해서 얻어지므로 필연적으로 노이즈를 포함하게 되어서 일종의 랜덤 변수로 생각해야 한다. 주어진 랜덤 변수에서 최적으로 매개변수를 추출하기 위해 최소자승법을 이용한다. Affine 변환된 좌표와 실제 측정된 좌표 사이의 거리 차이를 최소화하는 매개변수를 찾도록 하자:

$L = \sum_{i} | x_{i}^{'} - a x_{i} - b y_{i} - t_{x} |^{2} + | y_{i}^{'} - c x_{i} - d y_{i} - t_{y} |^{2}$

Affine변환을 규정하는 매개변수를 구하기 위해서는 L을 각 매개변수에 대해서 미분해서 극값을 가질 조건을 구하면 된다:

        ∂L/∂a = -2 * ∑ (x'_i - a * x_i - b * y_i - t_x) * x_i ;
        ∂L/∂b = -2 * ∑ (x'_i - a * x_i - b * y_i - t_x) * y_i ;
        ∂L/∂c = -2 * ∑ (y'_i - c * x_i - d * y_i - t_y) * x_i ;
        ∂L/∂d = -2 * ∑ (y'_i - c * x_i - d * y_i - t_y) * y_i ;
        ∂L/∂t_x = -2 * ∑ (x'_i - a * x_i - b * y_i - t_x) ;
        ∂L/∂t_y = -2 * ∑ (y'_i - c * x_i - d * y_i - t_y);

각 식을 0으로 놓아서 얻어지는 연립방정식을 행렬식으로 다시 정리하면,

$[\begin{array}{ccc} S_{x x} & S_{x y} & S_{x} \\ S_{x y} & S_{y y} & S_{y} \\ S_{x} & S_{y} & N \end{array}] [\begin{array}{ll} a & c \\ b & d \\ t_{x} & t_{y} \end{array}] = [\begin{array}{cc} S_{x x^{'}} & S_{x y^{'}} \\ S_{y x^{'}} & S_{y y^{'}} \\ S_{x^{'}} & S_{y^{'}} \end{array}]$

여기서,
$\begin{aligned} S_{x x} = \sum x^{2}, S_{y y} = \sum y^{2}, S_{x y} = \sum x y, \\ S_{x} = \sum x, S_{y} = \sum y, S_{x^{'}} = \sum x^{'}, S_{y^{'}} = \sum y^{'} \\ S_{x x^{'}} = \sum x x^{'}, S_{x y^{'}} = \sum x y^{'}, S_{y x^{'}} = \sum y x^{'} \end{aligned}$ 이다.

// dst = (A,T)src;
//  [u]  = [ A0 A1 ][x] + A4
//  [v]  = [ A2 A3 ][y] + A5
//
BOOL GetAffineParameter(const std::vector<CPoint> &srcPts, 
                        const std::vector<CPoint> &dstPts, 
                        double AT[6]) 
{
    double Sx, Sy, Sxx, Sxy, Syy;
    double Su, Sv, Sxu, Sxv, Syu, Syv ;
    double A[9], invA[9];
    Sx = Sy = Sxx = Sxy = Syy = 0;
    Su = Sv = Sxu = Sxv = Syu = Syv = 0;
    for (int i = srcPts.size(); i-->0;) {
        double x = srcPts[i].x, y = srcPts[i].y ;
        double u = dstPts[i].x, v = dstPts[i].y ;
        Sx += x;        Sy += y ;
        Sxx += (x * x); Sxy += (x * y); Syy += (y * y);
        Su += u;        Sv += v ;
        Sxu += (x * u); Sxv += (x * v); Syu += (y * u); Syv += (y * v);
    }
    A[0] = Sxx; A[1] = Sxy; A[2] = Sx;
    A[3] = Sxy; A[4] = Syy; A[5] = Sy;
    A[6] = Sx ; A[7] = Sy ; A[8] = srcPts.size() ;
    double det = (A[0]*(A[4]*A[8]-A[5]*A[7])-\
                  A[1]*(A[3]*A[8]-A[5]*A[6])+\
                  A[2]*(A[3]*A[7]-A[4]*A[6]));
    if (det != 0.) {
        det = 1. / det; 
        invA[0] = (A[4]*A[8] - A[5]*A[7]) * det;
        invA[1] = (A[2]*A[7] - A[1]*A[8]) * det;
        invA[2] = (A[1]*A[5] - A[2]*A[4]) * det;
        invA[3] = (A[5]*A[6] - A[3]*A[8]) * det;
        invA[4] = (A[0]*A[8] - A[2]*A[6]) * det;
        invA[5] = (A[2]*A[3] - A[0]*A[5]) * det;
        invA[6] = (A[3]*A[7] - A[4]*A[6]) * det;
        invA[7] = (A[1]*A[6] - A[0]*A[7]) * det;
        invA[8] = (A[0]*A[4] - A[1]*A[3]) * det;
    }
    else return FALSE;

    AT[0] = invA[0] * Sxu + invA[1] * Syu + invA[2] * Su;
    AT[1] = invA[3] * Sxu + invA[4] * Syu + invA[5] * Su;
    AT[4] = invA[6] * Sxu + invA[7] * Syu + invA[8] * Su;
    AT[2] = invA[0] * Sxv + invA[1] * Syv + invA[2] * Sv;
    AT[3] = invA[3] * Sxv + invA[4] * Syv + invA[5] * Sv;
    AT[5] = invA[6] * Sxv + invA[7] * Syv + invA[8] * Sv;
    return TRUE ;
};

아래의 그림은 지문에서 얻은 특징점을 가지고 변환을 한 것이다. 밑에 그림이 기준 template (붉은 점)이고 윗 그림은 이 기준 template와 입력된 지문의 특징점(노란 점+ 녹색점) 사이에 서로 메칭이 되는 특징점(노란색)을 찾고, 그것을 기준으로 두 지문 영상 간의 affine 파라미터를 찾아서 기준 template을 변환시킨 것이다. 이렇게 하면 새로 찾은 특징점 중에서 기준 template에 없는 특징점(녹색점)을 발견할 수 있고, 이 특징점을 기준 template에 추가하여서 좀 더 넓은 범위를 커버할 수 있는 template을 만들 수 있다. 물론 추가된 녹색점이 신뢰할 수 있는 것인가에 대한 판단을 하기 위해서는 추가적인 정보가 더 요구된다.

저작자표시 비영리 변경금지

'Image Recognition' 카테고리의 다른 글

Image Morphing (0)	2010.01.24
Fant's Algorithm (0)	2010.01.22
Color Counting (0)	2010.01.18
Isometric Transformation (0)	2010.01.11
Active Shape Model (3) (0)	2009.12.30

Geometry & Recognition 알고리즘,계산기하,물리학,...

이전 1 2 3 4 5 다음

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

Geometry & Recognition

Least Squares Estimation of Perspective Transformation

'Image Recognition' 카테고리의 다른 글

Savitzky-Golay Smoothing Filter

'Image Recognition' 카테고리의 다른 글

Affine Transformation

'Image Recognition' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

글 보관함

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역