'최소자승법' 태그의 글 목록 (4 Page)

Isometric Transformation

Image Recognition 2010. 1. 11. 16:07

기준 좌표계에 대해서 원점을 이동하고 좌표축을 회전시킨 새로운 좌표계에서 점의 좌표는 바뀐다. 원래의 좌표와 바뀐 좌표값 사이의 관계를 주는 변환이 Isometric transformation (isometry)이다. 평면에서 이 변환은 평행이동을 나타내는 파라미터 2개, 그리고 1개의 회전각 파라미터에 의해서 결정이 된다. 회전각이 $θ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>θ</mi></math>$ 고, 평행이동이 $(t x, t y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><msub><mi>t</mi><mi>x</mi></msub><mo>,</mo><msub><mi>t</mi><mi>y</mi></msub><mo stretchy="false">)</mo></math>$ 인 isometry에 의해서 두 점 $(x, y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo></math>$ 가 $(u, v) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>u</mi><mo>,</mo><mi>v</mi><mo stretchy="false">)</mo></math>$ 로 연결이 되는 경우에, 아래의 식으로 표현이 된다:

$u = cos (θ) x - sin (θ) y + t x <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>u</mi><mo>=</mo><mi>cos</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><mi>x</mi><mo>-</mo><mi>sin</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><mi>y</mi><mo>+</mo><msub><mi>t</mi><mi>x</mi></msub></math>$

$v = sin (θ) x + cos (θ) y + t y; <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>v</mi><mo>=</mo><mi>sin</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><mi>x</mi><mo>+</mo><mi>cos</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><mi>y</mi><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub><mo>;</mo></math>$

따라서 isometry로 연결이 되는 두 점의 조합 ${(x 1, y 1) \to (u 1, v 1), (x 2, y 2) \to (u 2, v 2)} <math xmlns="http://www.w3.org/1998/Math/MathML"><mo fence="false" stretchy="false">{</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mn>1</mn></msub><mo>,</mo><msub><mi>y</mi><mn>1</mn></msub><mo stretchy="false">)</mo><mo stretchy="false">\to</mo><mo stretchy="false">(</mo><msub><mi>u</mi><mn>1</mn></msub><mo>,</mo><msub><mi>v</mi><mn>1</mn></msub><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mn>2</mn></msub><mo>,</mo><msub><mi>y</mi><mn>2</mn></msub><mo stretchy="false">)</mo><mo stretchy="false">\to</mo><mo stretchy="false">(</mo><msub><mi>u</mi><mn>2</mn></msub><mo>,</mo><msub><mi>v</mi><mn>2</mn></msub><mo stretchy="false">)</mo><mo fence="false" stretchy="false">}</mo></math>$ 만 있으면 이들 파라미터를 정확히 결정할 수 있다. 그러나 변환에 필요한 점 정보를 얻는 과정은 필연적으로 노이즈의 영향을 받게 되므로 주어진 모든 점을 정확히 연결하는 변환을 일반적으로 구할 수 없다. 이 경우에는 isometry 파라미터는 일반적으로 최소자승법에 의해서 결정될 수 있다.

최소자승법을 사용하기 위해서는 회전각 $θ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>θ</mi></math>$ 보다는 $a = cos θ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>=</mo><mi>cos</mi><mo data-mjx-texclass="NONE"></mo><mi>θ</mi></math>$ , $b = sin θ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi><mo>=</mo><mi>sin</mi><mo data-mjx-texclass="NONE"></mo><mi>θ</mi></math>$ 로 정의된 새로운 파라미터로 식을 표현하는 것이 더 편리하다. 그러나 이 경우에 파라미터 $a, b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>,</mo><mi>b</mi></math>$ 는 서로 독립적이 아니고 $a 2 + b 2 = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>a</mi><mn>2</mn></msup><mo>+</mo><msup><mi>b</mi><mn>2</mn></msup><mo>=</mo><mn>1</mn></math>$ 의 제한 조건을 만족시켜야 한다.

평행이동 파라미터는 질량중심의 isometry 관계로 해결이 되므로, 이 전체 계산을 각각의 질량중심을 원점으로 하는 좌표로 옮겨서 적용하면 더 이상 평행이동을 고려할 필요 없이 회전만 계산하면 된다.

최소자승법의 원리에 따라 입력점의 isometry 결과와 대응점 사이의 거리의 제곱 합 $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>L</mi></math>$ 을 주어진 제약조건 내에서 최소화시키는 파라미터 $a, b, λ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>,</mo><mi>b</mi><mo>,</mo><mi>λ</mi></math>$ 를 찾으면 된다:

$L = \sum i [(a x i - b y i - u i) 2 + (b x i + a y i - v i) 2] + λ (a 2 + b 2 - 1); <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>L</mi><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><mrow data-mjx-texclass="ORD"><mo minsize="1.2em" maxsize="1.2em">[</mo></mrow><mo stretchy="false">(</mo><mi>a</mi><msub><mi>x</mi><mi>i</mi></msub><mo>-</mo><mi>b</mi><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><msub><mi>u</mi><mi>i</mi></msub><msup><mo stretchy="false">)</mo><mn>2</mn></msup><mo>+</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><msub><mi>v</mi><mi>i</mi></msub><msup><mo stretchy="false">)</mo><mn>2</mn></msup><mrow data-mjx-texclass="ORD"><mo minsize="1.2em" maxsize="1.2em">]</mo></mrow><mo>+</mo><mi>λ</mi><mo stretchy="false">(</mo><msup><mi>a</mi><mn>2</mn></msup><mo>+</mo><msup><mi>b</mi><mn>2</mn></msup><mo>-</mo><mn>1</mn><mo stretchy="false">)</mo><mo>;</mo></math>$

여기서 $λ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>λ</mi></math>$ 는 제한 조건 $a 2 + b 2 = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>a</mi><mn>2</mn></msup><mo>+</mo><msup><mi>b</mi><mn>2</mn></msup><mo>=</mo><mn>1</mn></math>$ 를 넣기 위한 Lagrange multiplier이다. 극값을 찾기 위해서 $L <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>L</mi></math>$ 를 각각 $a, b, λ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>,</mo><mi>b</mi><mo>,</mo><mi>λ</mi></math>$ 에 대해서 미분해서 다음 조건을 얻는다:

$\sum i (a x i - b y i - u i) x i + (b x i + a y i - v i) y i + λ a = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><mo stretchy="false">(</mo><mi>a</mi><msub><mi>x</mi><mi>i</mi></msub><mo>-</mo><mi>b</mi><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><msub><mi>u</mi><mi>i</mi></msub><mo stretchy="false">)</mo><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><msub><mi>v</mi><mi>i</mi></msub><mo stretchy="false">)</mo><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><mi>λ</mi><mi>a</mi><mo>=</mo><mn>0</mn></math>$

$\sum i (a x i - b y i - u i) (- y i) + (b x i + a y i - v i) x i + λ b = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><mo stretchy="false">(</mo><mi>a</mi><msub><mi>x</mi><mi>i</mi></msub><mo>-</mo><mi>b</mi><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><msub><mi>u</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mo>-</mo><msub><mi>y</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>+</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><msub><mi>v</mi><mi>i</mi></msub><mo stretchy="false">)</mo><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>λ</mi><mi>b</mi><mo>=</mo><mn>0</mn></math>$

$a 2 + b 2 = 1 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi>a</mi><mn>2</mn></msup><mo>+</mo><msup><mi>b</mi><mn>2</mn></msup><mo>=</mo><mn>1</mn></math>$

이 식들을 $a, b, λ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>,</mo><mi>b</mi><mo>,</mo><mi>λ</mi></math>$ 에 대해서 풀면 다음의 관계식을 얻는다:

$a = \sum (x i u i + y i v i) / \sum (x 2 i + y 2 i + λ) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>a</mi><mo>=</mo><mo>\sum</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mi>i</mi></msub><msub><mi>u</mi><mi>i</mi></msub><mo>+</mo><msub><mi>y</mi><mi>i</mi></msub><msub><mi>v</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mo>\sum</mo><mo stretchy="false">(</mo><msubsup><mi>x</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>y</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><mi>λ</mi><mo stretchy="false">)</mo></math>$

$b = \sum (x i v i - y i u i) / \sum (x 2 i + y 2 i + λ) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>b</mi><mo>=</mo><mo>\sum</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mi>i</mi></msub><msub><mi>v</mi><mi>i</mi></msub><mo>-</mo><msub><mi>y</mi><mi>i</mi></msub><msub><mi>u</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mo>\sum</mo><mo stretchy="false">(</mo><msubsup><mi>x</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>y</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><mi>λ</mi><mo stretchy="false">)</mo></math>$
또한, Lagrange 멀티플라이어 $λ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>λ</mi></math>$ 는

$A = \sum (x i u i + y i v i) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>A</mi><mo>=</mo><mo>\sum</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mi>i</mi></msub><msub><mi>u</mi><mi>i</mi></msub><mo>+</mo><msub><mi>y</mi><mi>i</mi></msub><msub><mi>v</mi><mi>i</mi></msub><mo stretchy="false">)</mo></math>$

$B = \sum (x i v i - y i u i); <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>B</mi><mo>=</mo><mo>\sum</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mi>i</mi></msub><msub><mi>v</mi><mi>i</mi></msub><mo>-</mo><msub><mi>y</mi><mi>i</mi></msub><msub><mi>u</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>;</mo></math>$

로 놓으면, $a 2 + b 2 = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>a</mi><mn>2</mn></msup><mo>+</mo><msup><mi>b</mi><mn>2</mn></msup><mo>=</mo><mn>1</mn></math>$ 에서

$\sum (x 2 i + y 2 i + λ) = \sqrt A 2 + B 2; <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo>\sum</mo><mo stretchy="false">(</mo><msubsup><mi>x</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>y</mi><mi>i</mi><mn>2</mn></msubsup><mo>+</mo><mi>λ</mi><mo stretchy="false">)</mo><mo>=</mo><msqrt><msup><mi>A</mi><mn>2</mn></msup><mo>+</mo><msup><mi>B</mi><mn>2</mn></msup></msqrt><mo>;</mo></math>$

임을 쓰면 된다. 따라서 회전각은

$cosθ=a=A√A2+B2<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>cos</mi><mo data-mjx-texclass="NONE">⁡</mo><mi>θ</mi><mo>=</mo><mi>a</mi><mo>=</mo><mfrac><mi>A</mi><msqrt><msup><mi>A</mi><mn>2</mn></msup><mo>+</mo><msup><mi>B</mi><mn>2</mn></msup></msqrt></mfrac></math>$

$sinθ=b=B√A2+B2;<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>sin</mi><mo data-mjx-texclass="NONE">⁡</mo><mi>θ</mi><mo>=</mo><mi>b</mi><mo>=</mo><mfrac><mi>B</mi><msqrt><msup><mi>A</mi><mn>2</mn></msup><mo>+</mo><msup><mi>B</mi><mn>2</mn></msup></msqrt></mfrac><mo>;</mo></math>$

로 주어진다.

질량중심을 빼기 전 좌표 $(x, y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo></math>$ 의 질량중심과 $(u, v) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>u</mi><mo>,</mo><mi>v</mi><mo stretchy="false">)</mo></math>$ 의 질량중심은 서로 isometry에 의해서 연결이 되므로, 이 관계에서 평행이동 파라미터 $(t x, t y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><msub><mi>t</mi><mi>x</mi></msub><mo>,</mo><msub><mi>t</mi><mi>y</mi></msub><mo stretchy="false">)</mo></math>$ 는 결정이 된다:
$(x c, y c) \to (u c, v c) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo stretchy="false">(</mo><msub><mi>x</mi><mi>c</mi></msub><mo>,</mo><msub><mi>y</mi><mi>c</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">\to</mo><mo stretchy="false">(</mo><msub><mi>u</mi><mi>c</mi></msub><mo>,</mo><msub><mi>v</mi><mi>c</mi></msub><mo stretchy="false">)</mo></math>$

$u c = a x c - b y c + t x <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>u</mi><mi>c</mi></msub><mo>=</mo><mi>a</mi><msub><mi>x</mi><mi>c</mi></msub><mo>-</mo><mi>b</mi><msub><mi>y</mi><mi>c</mi></msub><mo>+</mo><msub><mi>t</mi><mi>x</mi></msub></math>$

$v c = b x c + a y c + t y; <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>v</mi><mi>c</mi></msub><mo>=</mo><mi>b</mi><msub><mi>x</mi><mi>c</mi></msub><mo>+</mo><mi>a</mi><msub><mi>y</mi><mi>c</mi></msub><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub><mo>;</mo></math>$

참고:
** affine transformation = similarity transformation + shear;
** similarity transformation = isometry transformation + overall scaling;

/* struct CfPt { double x, y;};
*      u = T[0] * x + T[1] * y +T[4] ;
*      v = T[2] * x + T[3] * y + T[5] ; 
*/
BOOL IsometryTransform(std::vector<CfPt> &A, std::vector<CfPt> &U, double T[6]) {
    // A.size()==U.size();
    double cx = 0, cy = 0;
    double ux = 0, uy = 0;
    for (int i = A.size(); i-->0;) {
        cx += A[i].x ;  cy += A[i].y ;
        ux += U[i].x ;  uy += U[i].y ;
    };
    //center of mass ;
    cx /= A.size(); cy /= A.size();
    ux /= A.size(); uy /= A.size();

    //centering 된 좌표계에서 계산;
    double dot = 0 , cross = 0;
    for (int i = A.size(); i-->0;) {
        double x = A[i].x - cx, y = A[i].y - cy;
        double u = U[i].x - ux, v = U[i].y - uy;
        dot += (x * u + y * v);
        cross += ( x * v - y * u) ;
    };
    double norm = sqrt(dot * dot + cross * cross) ;
    double a = dot / norm ;
    double b = cross / norm ;

    T[0] = a ; T[1] = -b ; T[2] = b; T[3] = a; 
    T[4] = ux - (a * cx - b * cy) ;
    T[5] = uy - (b * cx + b * cy) ;
    return 1;
} ;

저작자표시 비영리 변경금지

'Image Recognition' 카테고리의 다른 글

Affine Transformation (0)	2010.01.20
Color Counting (0)	2010.01.18
Active Shape Model (3) (0)	2009.12.30
Eigenface (2) (0)	2009.12.28
Active Shape Model (ASM) (2)	2009.12.25

Similarity Transformation

Image Recognition 2009. 12. 14. 20:04

2차원 이미지의 기하학적인 변형 중에서 평행이동, 회전 및 전체적인 크기의 변화를 주는 변환이 similarity transformation이다. 이 변환은 두 직선이 이루는 각을 보존하고 길이 비를 유지한다. 따라서 similarity 변환 후 물체의 모양은 변환 전과 같은 형태를 가진다. 이 변환보다도 더 일반적인 2차원의 기하학적인 변환은 affine transformation이다. Affine 변환은 한쪽 방향으로의 밀림(sheer)도 허용한다. 평행한 두 직선은 affine 변환 후에도 여전히 평행하다.

Similarity transformation은 전체적인 크기를 바꾸는 scale parameter( $s <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>s</mi></math>$ ) 1개와 회전각( $θ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>θ</mi></math>$ ) 1개, 그리고 $x, y <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi><mo>,</mo><mi>y</mi></math>$ 축으로의 평행이동을 나타내는 parameter ( $t x <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>t</mi><mi>x</mi></msub></math>$ , $t y <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>t</mi><mi>y</mi></msub></math>$ ) 2 개를 합해서 총 4개가 있어야 한다. 이 parameter에 의해서 원본 이미지의 픽셀 $(x, y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>x</mi><mo>,</mo><mi>y</mi><mo stretchy="false">)</mo></math>$ 가 변환된 이미지의 픽셀 $(u, v) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>u</mi><mo>,</mo><mi>v</mi><mo stretchy="false">)</mo></math>$ 에 대응한다고 하면, 이들 간의 관계는 다음식으로 주어진다.

$u = s cos (θ) x - s sin (θ) y + t x <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>u</mi><mo>=</mo><mi>s</mi><mi>cos</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><mi>x</mi><mo>-</mo><mi>s</mi><mi>sin</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><mi>y</mi><mo>+</mo><msub><mi>t</mi><mi>x</mi></msub></math>$

$v = s sin (θ) y + s cos (θ) y + t y <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>v</mi><mo>=</mo><mi>s</mi><mi>sin</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><mi>y</mi><mo>+</mo><mi>s</mi><mi>cos</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><mi>y</mi><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub></math>$

따라서 원본 영상의 2점에 대응하는 정보만 주어지면 파라미터 $(s, θ, t x, t y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>s</mi><mo>,</mo><mi>θ</mi><mo>,</mo><msub><mi>t</mi><mi>x</mi></msub><mo>,</mo><msub><mi>t</mi><mi>y</mi></msub><mo stretchy="false">)</mo></math>$ 를 유일하게 결정할 수 있다.

$(x 1, y 1) \to (u 1, v 1) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo stretchy="false">(</mo><msub><mi>x</mi><mn>1</mn></msub><mo>,</mo><msub><mi>y</mi><mn>1</mn></msub><mo stretchy="false">)</mo><mo stretchy="false">\to</mo><mo stretchy="false">(</mo><msub><mi>u</mi><mn>1</mn></msub><mo>,</mo><msub><mi>v</mi><mn>1</mn></msub><mo stretchy="false">)</mo></math>$

$(x 2, y 2) \to (u 2, v 2) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo stretchy="false">(</mo><msub><mi>x</mi><mn>2</mn></msub><mo>,</mo><msub><mi>y</mi><mn>2</mn></msub><mo stretchy="false">)</mo><mo stretchy="false">\to</mo><mo stretchy="false">(</mo><msub><mi>u</mi><mn>2</mn></msub><mo>,</mo><msub><mi>v</mi><mn>2</mn></msub><mo stretchy="false">)</mo></math>$

그러나 많은 경우에는 기준점을 잡는데 에러 등을 고려하여서 일반적으로 원본 영상의 $N (\geq 2) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>N</mi><mo stretchy="false">(</mo><mo>\geq</mo><mn>2</mn><mo stretchy="false">)</mo></math>$ 개의 점에 대응하는 정보를 주게 되는데, 이 경우에 변환 관계식은 overdetermined 되어서 해를 구할 수 없는 경우도 있다. 이 경우에는 최소자승법을 써서 변환점과 변환식에 의해서 의해서 주어지는 값의 차이를 최소화시키는 파라미터를 구해서 쓰면 된다.

$L = \sum i [| u i - (s cos (θ) x i - s sin (θ) y i + t x) | 2 + | v i - (s sin (θ) x i + s cos (θ) y i + t y) | 2] <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><mi>L</mi><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mrow data-mjx-texclass="ORD"><mi>i</mi></mrow></munder></mtd><mtd><mi></mi><mrow data-mjx-texclass="ORD"><mo minsize="1.623em" maxsize="1.623em">[</mo></mrow><msup><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">|</mo><msub><mi>u</mi><mi>i</mi></msub><mo>-</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mi>s</mi><mi>cos</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><msub><mi>x</mi><mi>i</mi></msub><mo>-</mo><mi>s</mi><mi>sin</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>x</mi></msub><mo data-mjx-texclass="CLOSE">)</mo></mrow><mo data-mjx-texclass="CLOSE">|</mo></mrow><mn>2</mn></msup></mtd></mtr><mtr><mtd></mtd><mtd><mi></mi><mo>+</mo><msup><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">|</mo><msub><mi>v</mi><mi>i</mi></msub><mo>-</mo><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><mi>s</mi><mi>sin</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>s</mi><mi>cos</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub><mo data-mjx-texclass="CLOSE">)</mo></mrow><mo data-mjx-texclass="CLOSE">|</mo></mrow><mn>2</mn></msup><mrow data-mjx-texclass="ORD"><mo minsize="1.623em" maxsize="1.623em">]</mo></mrow></mtd></mtr></mtable></math>$

$(s, θ, t x, t y) = argmin (L) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo stretchy="false">(</mo><mi>s</mi><mo>,</mo><mi>θ</mi><mo>,</mo><msub><mi>t</mi><mi>x</mi></msub><mo>,</mo><msub><mi>t</mi><mi>y</mi></msub><mo stretchy="false">)</mo><mo>=</mo><mtext>argmin</mtext><mo stretchy="false">(</mo><mi>L</mi><mo stretchy="false">)</mo></math>$

이 식을 최소화시키는 파라미터는 $(a = s cos (θ), b = s sin (θ) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>a</mi><mo>=</mo><mi>s</mi><mi>cos</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo><mo>,</mo><mi>b</mi><mo>=</mo><mi>s</mi><mi>sin</mi><mo data-mjx-texclass="NONE"></mo><mo stretchy="false">(</mo><mi>θ</mi><mo stretchy="false">)</mo></math>$ 로 놓으면) $a, b, t x, t y <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>,</mo><mi>b</mi><mo>,</mo><msub><mi>t</mi><mi>x</mi></msub><mo>,</mo><msub><mi>t</mi><mi>y</mi></msub></math>$ 에 대해서 극값을 가질 조건에서 얻을 수 있다. $∂L∂a=0:∑i(ui−(axi−byi+tx))(−xi)+(vi−(bxi+ayi+ty))(−yi)=0<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mrow><mi>∂</mi><mi>L</mi></mrow><mrow><mi>∂</mi><mi>a</mi></mrow></mfrac><mo>=</mo><mn>0</mn><mo>:</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><munder><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="ORD"><mi>i</mi></mrow></munder><mo stretchy="false">(</mo><msub><mi>u</mi><mi>i</mi></msub><mo>−</mo><mo stretchy="false">(</mo><mi>a</mi><msub><mi>x</mi><mi>i</mi></msub><mo>−</mo><mi>b</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>x</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mo>−</mo><msub><mi>x</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>+</mo><mo stretchy="false">(</mo><msub><mi>v</mi><mi>i</mi></msub><mo>−</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mo>−</mo><msub><mi>y</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>=</mo><mn>0</mn></math>$

$∂L∂b=0:∑i(ui−(axi−byi+tx))(yi)+(vi−(bxi+ayi+ty))(−xi)=0<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mrow><mi>∂</mi><mi>L</mi></mrow><mrow><mi>∂</mi><mi>b</mi></mrow></mfrac><mo>=</mo><mn>0</mn><mo>:</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><munder><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="ORD"><mi>i</mi></mrow></munder><mo stretchy="false">(</mo><msub><mi>u</mi><mi>i</mi></msub><mo>−</mo><mo stretchy="false">(</mo><mi>a</mi><msub><mi>x</mi><mi>i</mi></msub><mo>−</mo><mi>b</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>x</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo stretchy="false">(</mo><msub><mi>y</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>+</mo><mo stretchy="false">(</mo><msub><mi>v</mi><mi>i</mi></msub><mo>−</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo stretchy="false">(</mo><mo>−</mo><msub><mi>x</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo>=</mo><mn>0</mn></math>$

$∂L∂tx=0:∑i(ui−(axi−byi+tx))=0<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mrow><mi>∂</mi><mi>L</mi></mrow><mrow><mi>∂</mi><msub><mi>t</mi><mi>x</mi></msub></mrow></mfrac><mo>=</mo><mn>0</mn><mo>:</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><munder><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="ORD"><mi>i</mi></mrow></munder><mo stretchy="false">(</mo><msub><mi>u</mi><mi>i</mi></msub><mo>−</mo><mo stretchy="false">(</mo><mi>a</mi><msub><mi>x</mi><mi>i</mi></msub><mo>−</mo><mi>b</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>x</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>=</mo><mn>0</mn></math>$

$∂L∂ty=0:∑i(vi−(bxi+ayi+ty))=0.<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mfrac><mrow><mi>∂</mi><mi>L</mi></mrow><mrow><mi>∂</mi><msub><mi>t</mi><mi>y</mi></msub></mrow></mfrac><mo>=</mo><mn>0</mn><mo>:</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><munder><mo data-mjx-texclass="OP">∑</mo><mrow data-mjx-texclass="ORD"><mi>i</mi></mrow></munder><mo stretchy="false">(</mo><mi>v</mi><mi>i</mi><mo>−</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub><mo stretchy="false">)</mo><mo stretchy="false">)</mo><mo>=</mo><mn>0.</mn></math>$

따라서, $S u = \sum i u i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>u</mi></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>u</mi><mi>i</mi></msub></math>$ , $S v = \sum i v i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>v</mi></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>v</mi><mi>i</mi></msub></math>$ , $S u x = \sum i u i x i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>u</mi><mi>x</mi></mrow></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>u</mi><mi>i</mi></msub><msub><mi>x</mi><mi>i</mi></msub></math>$ , $S u y = \sum i u i y i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>u</mi><mi>y</mi></mrow></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>u</mi><mi>i</mi></msub><msub><mi>y</mi><mi>i</mi></msub></math>$ , $S v x = \sum i v i x i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>v</mi><mi>x</mi></mrow></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>v</mi><mi>i</mi></msub><msub><mi>x</mi><mi>i</mi></msub></math>$ , $S v y = \sum i v i y i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>v</mi><mi>y</mi></mrow></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>v</mi><mi>i</mi></msub><msub><mi>y</mi><mi>i</mi></msub></math>$ , $S x = \sum x i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>x</mi></msub><mo>=</mo><mo data-mjx-texclass="OP">\sum</mo><msub><mi>x</mi><mi>i</mi></msub></math>$ , $S y = \sum i y i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mi>y</mi></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>y</mi><mi>i</mi></msub></math>$ , $S x x = \sum i x 2 i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>x</mi></mrow></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msubsup><mi>x</mi><mi>i</mi><mn>2</mn></msubsup></math>$ , $S x y = \sum i x i y i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>y</mi></mrow></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msub><mi>x</mi><mi>i</mi></msub><msub><mi>y</mi><mi>i</mi></msub></math>$ , $S y y = \sum i y 2 i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>y</mi><mi>y</mi></mrow></msub><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><msubsup><mi>y</mi><mi>i</mi><mn>2</mn></msubsup></math>$ 라고 하면,

$- S u x + a S x x + t x S x - S v y + a S y y + t y S y = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mo>-</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>u</mi><mi>x</mi></mrow></msub><mo>+</mo><mi>a</mi><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>x</mi></mrow></msub><mo>+</mo><msub><mi>t</mi><mi>x</mi></msub><msub><mi>S</mi><mi>x</mi></msub><mo>-</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>v</mi><mi>y</mi></mrow></msub><mo>+</mo><mi>a</mi><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>y</mi><mi>y</mi></mrow></msub><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub><msub><mi>S</mi><mi>y</mi></msub><mo>=</mo><mn>0</mn></math>$

$S u y + b S y y - t x S y - S v x + b S x x + t y S x = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>u</mi><mi>y</mi></mrow></msub><mo>+</mo><mi>b</mi><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>y</mi><mi>y</mi></mrow></msub><mo>-</mo><msub><mi>t</mi><mi>x</mi></msub><msub><mi>S</mi><mi>y</mi></msub><mo>-</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>v</mi><mi>x</mi></mrow></msub><mo>+</mo><mi>b</mi><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>x</mi></mrow></msub><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub><msub><mi>S</mi><mi>x</mi></msub><mo>=</mo><mn>0</mn></math>$

$S u - a S x + b S y - t x N = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>S</mi><mi>u</mi></msub><mo>-</mo><mi>a</mi><msub><mi>S</mi><mi>x</mi></msub><mo>+</mo><mi>b</mi><msub><mi>S</mi><mi>y</mi></msub><mo>-</mo><msub><mi>t</mi><mi>x</mi></msub><mi>N</mi><mo>=</mo><mn>0</mn></math>$

$S v - b S x - a S y - t y N = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msub><mi>S</mi><mi>v</mi></msub><mo>-</mo><mi>b</mi><msub><mi>S</mi><mi>x</mi></msub><mo>-</mo><mi>a</mi><msub><mi>S</mi><mi>y</mi></msub><mo>-</mo><msub><mi>t</mi><mi>y</mi></msub><mi>N</mi><mo>=</mo><mn>0</mn></math>$

의 4개의 식을 얻으므로 $(a, b, t x, t y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>a</mi><mo>,</mo><mi>b</mi><mo>,</mo><msub><mi>t</mi><mi>x</mi></msub><mo>,</mo><msub><mi>t</mi><mi>y</mi></msub><mo stretchy="false">)</mo></math>$ 에 대한 1차 연립방정식을 풀면 된다.

$or A \cdot x = b <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtext>or</mtext><mtext> </mtext><mtext> </mtext><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mo>\cdot</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">x</mi></mrow><mo>=</mo><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">b</mi></mrow></math>$

$4 \times 4 <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>4</mn><mo>\times</mo><mn>4</mn></math>$ 행렬 $A <math xmlns="http://www.w3.org/1998/Math/MathML"><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow></math>$ 의 역행렬은 다음과 같이 쉽게 구해진다.

$A−1=1S2x+S2y−N(Sxx+Syy)[SxSy−N0−SySx0−N−(Sxx+Syy)0Sx−Sy0−(Sxx+Syy)SySx]<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mrow data-mjx-texclass="ORD"><mi mathvariant="bold">A</mi></mrow><mrow data-mjx-texclass="ORD"><mo>−</mo><mn>1</mn></mrow></msup><mo>=</mo><mfrac><mn>1</mn><mrow><msubsup><mi>S</mi><mi>x</mi><mn>2</mn></msubsup><mo>+</mo><msubsup><mi>S</mi><mi>y</mi><mn>2</mn></msubsup><mo>−</mo><mi>N</mi><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>x</mi></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>y</mi><mi>y</mi></mrow></msub><mo stretchy="false">)</mo></mrow></mfrac><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">[</mo><mtable columnspacing="1em" rowspacing="4pt"><mtr><mtd><msub><mi>S</mi><mi>x</mi></msub></mtd><mtd><msub><mi>S</mi><mi>y</mi></msub></mtd><mtd><mo>−</mo><mi>N</mi></mtd><mtd><mn>0</mn></mtd></mtr><mtr><mtd><mo>−</mo><msub><mi>S</mi><mi>y</mi></msub></mtd><mtd><msub><mi>S</mi><mi>x</mi></msub></mtd><mtd><mn>0</mn></mtd><mtd><mo>−</mo><mi>N</mi></mtd></mtr><mtr><mtd><mo>−</mo><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>x</mi></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>y</mi><mi>y</mi></mrow></msub><mo stretchy="false">)</mo></mtd><mtd><mn>0</mn></mtd><mtd><msub><mi>S</mi><mi>x</mi></msub></mtd><mtd><mo>−</mo><msub><mi>S</mi><mi>y</mi></msub></mtd></mtr><mtr><mtd><mn>0</mn></mtd><mtd><mo>−</mo><mo stretchy="false">(</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>x</mi><mi>x</mi></mrow></msub><mo>+</mo><msub><mi>S</mi><mrow data-mjx-texclass="ORD"><mi>y</mi><mi>y</mi></mrow></msub><mo stretchy="false">)</mo></mtd><mtd><msub><mi>S</mi><mi>y</mi></msub></mtd><mtd><msub><mi>S</mi><mi>x</mi></msub></mtd></mtr></mtable><mo data-mjx-texclass="CLOSE">]</mo></mrow></math>$

아래의 코드는 이것을 구현한 것이다. 물론, $N = 2 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>N</mi><mo>=</mo><mn>2</mn></math>$ 개인 경우에는 파라미터는 유일하게 정해지고 이보다도 더 간단한 식으로 주어진다.

// dst = (S|T)(src)
BOOL SimilarTransParams(std::vector<CPoint>& src, std::vector<CPoint>& dst, double ST[4]) {
    double Sx = 0, Sy = 0, Sxx = 0, Syy = 0;
    double Su = 0, Sv = 0, Sxu = 0, Sxv = 0, Syu = 0, Syv = 0;
    for (int i = srcPts.size(); i-->0;) {
        double x = src[i].x, y = src[i].y;
        double u = dst[i].x, v = dst[i].y;
        Sx  += x;        Sy  += y;
        Sxx += (x * x);  Syy += (y * y);
        Su  += u;        Sv  += v;
        Sxu += (x * u);  Syv += (y * v);
    }
    double Z = Sxx + Syy;
    double denorm = Sx * Sx + Sy * Sy - src.size() * Z;
    // det = (denorm)^2;
    if (denorm == 0) return FALSE;
    invA[16] = { Sx, Sy, -src.size(),           0,
                -Sy, Sx,           0, -src.size(),
                -Z,   0,          Sx,         -Sy,
                 0,  -Z,          Sy,          Sx};
    for (int i = 0; i < 16; i++) invA[i] /= denorm;
    //
    double b[4] = {Su, Sv, Sxu + Syv, Sxv - Syu};
    for (int i = 0; i < 4; i++) {
    	double s = 0;
        for (int j = 0; j < 4; j++) 
            s += invA[i * 4 + j] * b[j];
        ST[i] = s;
    }
    return TRUE ;
};

InvertMatrix4x4()는 4x4행렬의 역행렬을 구한다(OpenCV에서)

similarity_trans.nb

0.01MB

BOOL InvertMatrix4x4_d(double* srcMatr, double* dstMatr) {
    double di = srcMatr[0];
    double d = 1.0 / di;

    dstMatr[0] = d;
    dstMatr[4] = srcMatr[4] * -d;
    dstMatr[8] = srcMatr[8] * -d;
    dstMatr[12] = srcMatr[12] * -d;
    dstMatr[1] = srcMatr[1] * d;
    dstMatr[2] = srcMatr[2] * d;
    dstMatr[3] = srcMatr[3] * d;
    dstMatr[5] = srcMatr[5] + dstMatr[4] * dstMatr[1] * di;
    dstMatr[6] = srcMatr[6] + dstMatr[4] * dstMatr[2] * di;
    dstMatr[7] = srcMatr[7] + dstMatr[4] * dstMatr[3] * di;
    dstMatr[9] = srcMatr[9] + dstMatr[8] * dstMatr[1] * di;
    dstMatr[10] = srcMatr[10] + dstMatr[8] * dstMatr[2] * di;
    dstMatr[11] = srcMatr[11] + dstMatr[8] * dstMatr[3] * di;
    dstMatr[13] = srcMatr[13] + dstMatr[12] * dstMatr[1] * di;
    dstMatr[14] = srcMatr[14] + dstMatr[12] * dstMatr[2] * di;
    dstMatr[15] = srcMatr[15] + dstMatr[12] * dstMatr[3] * di;
    di = dstMatr[5];
    dstMatr[5] = d = 1.0 / di;
    dstMatr[1] *= -d;
    dstMatr[9] *= -d;
    dstMatr[13] *= -d;
    dstMatr[4] *= d;
    dstMatr[6] *= d;
    dstMatr[7] *= d;
    dstMatr[0] += dstMatr[1] * dstMatr[4] * di;
    dstMatr[2] += dstMatr[1] * dstMatr[6] * di;
    dstMatr[3] += dstMatr[1] * dstMatr[7] * di;
    dstMatr[8] += dstMatr[9] * dstMatr[4] * di;
    dstMatr[10] += dstMatr[9] * dstMatr[6] * di;
    dstMatr[11] += dstMatr[9] * dstMatr[7] * di;
    dstMatr[12] += dstMatr[13] * dstMatr[4] * di;
    dstMatr[14] += dstMatr[13] * dstMatr[6] * di;
    dstMatr[15] += dstMatr[13] * dstMatr[7] * di;
    di = dstMatr[10];
    dstMatr[10] = d = 1.0 / di;
    dstMatr[2] *= -d;
    dstMatr[6] *= -d;
    dstMatr[14] *= -d;
    dstMatr[8] *= d;
    dstMatr[9] *= d;
    dstMatr[11] *= d;
    dstMatr[0] += dstMatr[2] * dstMatr[8] * di;
    dstMatr[1] += dstMatr[2] * dstMatr[9] * di;
    dstMatr[3] += dstMatr[2] * dstMatr[11] * di;
    dstMatr[4] += dstMatr[6] * dstMatr[8] * di;
    dstMatr[5] += dstMatr[6] * dstMatr[9] * di;
    dstMatr[7] += dstMatr[6] * dstMatr[11] * di;
    dstMatr[12] += dstMatr[14] * dstMatr[8] * di;
    dstMatr[13] += dstMatr[14] * dstMatr[9] * di;
    dstMatr[15] += dstMatr[14] * dstMatr[11] * di;
    di = dstMatr[15];
    dstMatr[15] = d = 1.0 / di;
    dstMatr[3] *= -d;
    dstMatr[7] *= -d;
    dstMatr[11] *= -d;
    dstMatr[12] *= d;
    dstMatr[13] *= d;
    dstMatr[14] *= d;
    dstMatr[0] += dstMatr[3] * dstMatr[12] * di;
    dstMatr[1] += dstMatr[3] * dstMatr[13] * di;
    dstMatr[2] += dstMatr[3] * dstMatr[14] * di;
    dstMatr[4] += dstMatr[7] * dstMatr[12] * di;
    dstMatr[5] += dstMatr[7] * dstMatr[13] * di;
    dstMatr[6] += dstMatr[7] * dstMatr[14] * di;
    dstMatr[8] += dstMatr[11] * dstMatr[12] * di;
    dstMatr[9] += dstMatr[11] * dstMatr[13] * di;
    dstMatr[10] += dstMatr[11] * dstMatr[14] * di;
    return TRUE;
}

2개의 대응점만 주어진 경우 $(x 1, y 1), (x 2, y 2) \to (u 1, v 1), (u 2, v 2) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><msub><mi>x</mi><mn>1</mn></msub><mo>,</mo><msub><mi>y</mi><mn>1</mn></msub><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mn>2</mn></msub><mo>,</mo><msub><mi>y</mi><mn>2</mn></msub><mo stretchy="false">)</mo><mo stretchy="false">\to</mo><mo stretchy="false">(</mo><msub><mi>u</mi><mn>1</mn></msub><mo>,</mo><msub><mi>v</mi><mn>1</mn></msub><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><msub><mi>u</mi><mn>2</mn></msub><mo>,</mo><msub><mi>v</mi><mn>2</mn></msub><mo stretchy="false">)</mo></math>$ ;

bool SimilarTransParams(double x1, double y1, double x2, double y2, 
                        double u1, double v1, double u2, double v2,
                        double ST[4]) {
    double x21 = x2 - x1, y21 = y2 - y1;
    double u21 = u2 - u1, v21 = v2 - v1;
    double det = x21 * x21 + y21 * y21;
    if (det == 0.) return false;
    double a = (x21 * u21 + y21 * v21) / det ;
    double b = (x21 * v21 - y21 * u21) / det ;
    double tx = u1 - a * x1 + b * y1;
    double ty = v1 - b * x1 - a * y1;
    ST[0] = a; ST[1] = b; ST[2] = tx; ST[3] = ty;
    return true;
};

얼굴인식용 training data set을 만들기 위해서 얼굴을 정렬시키는 데 사용한 예:
- 양 눈의 위치 변환: (70,93), (114, 84) --> (30,45), (100,45)로 변환( linear interpolation사용)
- 실제로 사용되는 변환은 정해진 dst영역으로 매핑하는 src영역을 찾아야 하므로, 역변환이 필요하다.
- 필요한 역변환은 src와 dst의 역할만 바꾸면 쉽게 구할 수 있다.

'Image Recognition' 카테고리의 다른 글

Eigenface (2) (0)	2009.12.28
Active Shape Model (ASM) (2)	2009.12.25
Eigenface (0)	2009.12.12
Retinex 알고리즘 관련 자료 (1)	2009.04.29
Spline Based Snake (0)	2008.08.15

RANSAC: Circle Fit

Image Recognition 2008. 7. 21. 09:32

RANSAC 알고리즘을 써서 주어진 2차원 점집합에서 원을 추정한다. 원을 만들기 위해서는 최소한 3점이 필요하고, 또 일직선에 있지 않아야 한다. 이렇게 만들어진 원은 세 점을 꼭짓점으로 하는 삼각형의 외접원(circumcircle)이다. 주어진 외접원에서 크게 벗어나지 않는 inliers를 찾는다(추가로 이 inliers에 대해 최소자승법으로 원의 중심과 반지름을 다시 구해서 보다 정밀하게 추정하는 과정을 넣을 수도 있다). 무작위로 선택된 세 점에 대해 위의 과정을 반복 시행해서 구한 원 중에서 가장 많은 inliers를 포함하는 원을 결과로 사용한다.

// 참고: http://en.wikipedia.org/wiki/RANSAC

// 2024.5.31 재작성;
double dist_deviate(const CfPt& pts, double cparam[3]) {
    double dx = pts.x - cparam[0];
    double dy = pts.y - cparam[1];
    return fabs(hypot(dx, dy) - cparam[2]);
}
int circumcircle(CfPt pts[3], double cparam[3]) {
    double x1 = pts[0].x, x2 = pts[1].x, x3 = pts[2].x;
    double y1 = pts[0].y, y2 = pts[1].y, y3 = pts[2].y;
    double bax = x2 - x1, bay = y2 - y1;
    double cax = x3 - x1, cay = y3 - y1;
    double E = bax * (x1 + x2) + bay * (y1 + y2);
    double F = cax * (x1 + x3) + cay * (y1 + y3);
    double G = 2. * (bax * (y3 - y2) - bay * (x3 - x2));
    if (G == 0.) return 0;    //error;
    //assert(fabs(G)>small_epsilon); //to prevent collinear or degenerate case;
    cparam[0] = (cay * E - bay * F) / G; //cx;
    cparam[1] = (bax * F - cax * E) / G; //cy;
    cparam[2] = hypot(cparam[0]-x1, cparam[1]-y1); //rad;
    return 1;
};
int num_sampling3(double prob_fail, double inlier_ratio) {
    return int(log(prob_fail)/log(1-pow(inlier_ratio, 3))); 
}
std::vector<int> Ransac_CircleFit(std::vector<CfPt>& points, double circle_param[3]) {
    if (points.size() < 3)
        return std::vector<int> (); //return null_vector;

    CfPt center; double inv_scale;
    // normalize input points for the sake of numerical stability;
    std::vector<CfPt> nor_pts = normalize(points, inv_scale, center);
    // distance threshold;
    double distance_thresh = sqrt(double(points.size())) * inv_scale;
    //ransac
    int sample_num = 1000;	//number of sample
    int ransac_count = 0;
    const double prob_fail = 0.01;
    double best_cparam[3] = {0};
    std::vector<int> best_inliers;
    while (sample_num > ransac_count) {
        // pick random 3 indices:[0,points.size()-1];
        int triple[3];
        random_triple(points.size()-1, triple);
        CfPt selected[3];
        for (int i = 0; i < 3; i++) 
            selected[i] = nor_pts[triple[i]];
        // circumcircle of 3 points;
        if (circumcircle(selected, circle_param)) {
            // find inliers;
            std::vector<int> inliers;
            inliers.reserve(points.size());
            for (int i = nor_pts.size(); i-->0;) {
                // error measure = algebric distance;
                double deviation = dist_deviate(nor_pts[i], circle_param);
                if (fabs(deviation) < distance_thresh)
                    inliers.push_back(i);
            }
            if (inliers.size() > best_inliers.size()) {			
                // update sampling_num;
                sample_num = num_sampling3(prob_fail, double(inliers.size())/points.size());
                // update best_inliers;
                best_inliers.swap(inliers);
                // update best circle param;
                for (int i = 0; i < 3; i++) 
                    best_cparam[i] = circle_param[i];
            }
        }
        if (++ransac_count > 1500) {
            TRACE("error! ransac_count exceed!\n");
            break;
        }
    }
    // recover original coordinate and scale;
    denormalize(best_cparam, best_cparam, inv_scale, center);
    if (best_cparam[0] > 0 && best_cparam[1] > 0) {
        for (int i = 0; i < 3; i++)
            circle_param[i] = best_cparam[i];
        TRACE("circle_found(%d, %d)\n", sample_num, ransac_count);
        // more accurate estimation needed at this stage;
    } else 
        best_inliers.clear();
    return best_inliers;
}

https://kipl.tistory.com/207

Least Squares Fitting of Circles

점집합을 일반적인 2차 곡선으로 피팅하는 경우에 방정식은 $a x 2 + b y 2 + c x y + d x + e y + f = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>a</mi><msup><mi>x</mi><mn>2</mn></msup><mo>+</mo><mi>b</mi><msup><mi>y</mi><mn>2</mn></msup><mo>+</mo><mi>c</mi><mi>x</mi><mi>y</mi><mo>+</mo><mi>d</mi><mi>x</mi><mo>+</mo><mi>e</mi><mi>y</mi><mo>+</mo><mi>f</mi><mo>=</mo><mn>0</mn></math>$ 의 계수를 주어진 데이터를 이용하여서 구해야 한다. 실제 문제에서는 타원, 포물선 쌍곡 선등의 타입

kipl.tistory.com

https://kipl.tistory.com/357

Circle Fitting: Pratt

주어진 점집합을 원으로 피팅하기 위해 이차식 $A (x 2 + y 2) + B x + C y + D = 0 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>A</mi><mo stretchy="false">(</mo><msup><mi>x</mi><mn>2</mn></msup><mo>+</mo><msup><mi>y</mi><mn>2</mn></msup><mo stretchy="false">)</mo><mo>+</mo><mi>B</mi><mi>x</mi><mo>+</mo><mi>C</mi><mi>y</mi><mo>+</mo><mi>D</mi><mo>=</mo><mn>0</mn></math>$ 을 이용하자. 원의 경우는 $A = 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>=</mo><mn>0</mn></math>$ 인 경우는 직선을 나타내고, $A \neq 0 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>\neq</mo><mn>0</mn></math>$ 인 경우가 원을 표현한다. 물론 $A = 1 <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>A</mi><mo>=</mo><mn>1</mn></math>$ 로 설정을 할 수 있으

kipl.tistory.com

'Image Recognition' 카테고리의 다른 글

Chamfer Match (0)	2008.08.01
Retinex Algorithm (2)	2008.07.26
KMeans Algorithm (0)	2008.07.19
Robust Line Fitting (0)	2008.07.08
EM: Binarization (0)	2008.07.01

이전 1 2 3 4 5 다음

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

Geometry & Recognition

Isometric Transformation

'Image Recognition' 카테고리의 다른 글

Similarity Transformation

'Image Recognition' 카테고리의 다른 글

RANSAC: Circle Fit

'Image Recognition' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

글 보관함

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역