Affine Transformation

Image Recognition 2010. 1. 20. 21:05

물체의 형상은 폴리곤이나 폴리곤의 집합으로 근사적으로 표현할 수 있다. 예를 들면 snake나 active shape model (ASM) 등에서 손 모양이나 얼굴의 윤곽, 또는 의료 영상 등에서 장기의 모양 등을 표현할 때 사용이 된다. 이러한 응용에서 주어진 형상을 기준으로 주어진 형상에 정렬을 시켜야 필요가 생긴다. 일반적으로 카메라를 써서 얻은 각 영상에서 추출한 정보들 사이에는 서로 사영 변환의 관계로 연결된다. 그러나 많은 경우에는 in-plane 변형만 고려해도 충분할 때가 많다. 이 경우에 가장 일반적인 형상의 변형은 affine 변환으로 표현된다. 회전(rotation), 평행 이동(translation), 크기 변환(scale transformation) 그리고 층 밀림(shear)을 허용하는 변환이다. 물론, 간단한 경우로는 shear를 제외할 수도 있고 (similarity transformation), 더 간단하게는 크기 변환을 제외할 수도 있다 (isometric transformation).

$N <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>N</mi></math>$ 개의 꼭짓점을 갖는 두 개의 형상 $S = {(x 1, y 1), (x 2, y 2), . . ., (x N, y N)} <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>S</mi><mo>=</mo><mo fence="false" stretchy="false">{</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mn>1</mn></msub><mo>,</mo><msub><mi>y</mi><mn>1</mn></msub><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mn>2</mn></msub><mo>,</mo><msub><mi>y</mi><mn>2</mn></msub><mo stretchy="false">)</mo><mo>,</mo><mo>.</mo><mo>.</mo><mo>.</mo><mo>,</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mi>N</mi></msub><mo>,</mo><msub><mi>y</mi><mi>N</mi></msub><mo stretchy="false">)</mo><mo fence="false" stretchy="false">}</mo></math>$ , $S' = {(x' 1, y' 1), (x' 2, y' 2), . . ., (x' N, y' N)} <math xmlns="http://www.w3.org/1998/Math/MathML"><msup><mi>S</mi><mo data-mjx-alternate="1">'</mo></msup><mo>=</mo><mo fence="false" stretchy="false">{</mo><mo stretchy="false">(</mo><msubsup><mi>x</mi><mn>1</mn><mo data-mjx-alternate="1">'</mo></msubsup><mo>,</mo><msubsup><mi>y</mi><mn>1</mn><mo data-mjx-alternate="1">'</mo></msubsup><mo stretchy="false">)</mo><mo>,</mo><mo stretchy="false">(</mo><msubsup><mi>x</mi><mn>2</mn><mo data-mjx-alternate="1">'</mo></msubsup><mo>,</mo><msubsup><mi>y</mi><mn>2</mn><mo data-mjx-alternate="1">'</mo></msubsup><mo stretchy="false">)</mo><mo>,</mo><mo>.</mo><mo>.</mo><mo>.</mo><mo>,</mo><mo stretchy="false">(</mo><msubsup><mi>x</mi><mi>N</mi><mo data-mjx-alternate="1">'</mo></msubsup><mo>,</mo><msubsup><mi>y</mi><mi>N</mi><mo data-mjx-alternate="1">'</mo></msubsup><mo stretchy="false">)</mo><mo fence="false" stretchy="false">}</mo></math>$ 이 affine 변환에 의해서 연결이 되는 경우에 각 꼭짓점 사이의 관계는

$x' i = a x i + b y i + t x y' i = c x i + d y i + t y, (i = 1, 2, . . ., N); <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtable displaystyle="true" columnalign="right left" columnspacing="0em" rowspacing="3pt"><mtr><mtd><msubsup><mi>x</mi><mi>i</mi><mo data-mjx-alternate="1">'</mo></msubsup></mtd><mtd><mi></mi><mo>=</mo><mi>a</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>b</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>x</mi></msub></mtd></mtr><mtr><mtd><msubsup><mi>y</mi><mi>i</mi><mo data-mjx-alternate="1">'</mo></msubsup></mtd><mtd><mi></mi><mo>=</mo><mi>c</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>d</mi><msub><mi>y</mi><mi>i</mi></msub><mo>+</mo><msub><mi>t</mi><mi>y</mi></msub><mo>,</mo><mstyle scriptlevel="0"><mspace width="1em"></mspace></mstyle><mo stretchy="false">(</mo><mi>i</mi><mo>=</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>,</mo><mo>.</mo><mo>.</mo><mo>.</mo><mo>,</mo><mi>N</mi><mo stretchy="false">)</mo><mo>;</mo></mtd></mtr></mtable></math>$

의 6개의 매개변수 $(a, b, c, d, t x, t y) <math xmlns="http://www.w3.org/1998/Math/MathML"><mo stretchy="false">(</mo><mi>a</mi><mo>,</mo><mi>b</mi><mo>,</mo><mi>c</mi><mo>,</mo><mi>d</mi><mo>,</mo><msub><mi>t</mi><mi>x</mi></msub><mo>,</mo><msub><mi>t</mi><mi>y</mi></msub><mo stretchy="false">)</mo></math>$ 에 의해서 기술이 된다(평행 이동: $x / y <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mi>y</mi></math>$ 축 방향 2개, 회전: 1개, shear: 1개, 스케일: $x / y <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>x</mi><mrow data-mjx-texclass="ORD"><mo>/</mo></mrow><mi>y</mi></math>$ 축 방향 2개). Affine 변환에 의해서 평행인 두 직선은 변환 후에도 평행인 관계를 유지한다.

꼭짓점 위치는 실제로 다양한 영상처리 과정에 의해서 얻어지므로 필연적으로 노이즈를 포함하게 되어서 일종의 랜덤 변수로 생각해야 한다. 주어진 랜덤 변수에서 최적으로 매개변수를 추출하기 위해 최소자승법을 이용한다. Affine 변환된 좌표와 실제 측정된 좌표 사이의 거리 차이를 최소화하는 매개변수를 찾도록 하자:

$L = \sum i | x' i - a x i - b y i - t x | 2 + | y' i - c x i - d y i - t y | 2 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>L</mi><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><mrow data-mjx-texclass="ORD"><mo minsize="1.2em" maxsize="1.2em">|</mo></mrow><msubsup><mi>x</mi><mi>i</mi><mo data-mjx-alternate="1">'</mo></msubsup><mo>-</mo><mi>a</mi><msub><mi>x</mi><mi>i</mi></msub><mo>-</mo><mi>b</mi><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><msub><mi>t</mi><mi>x</mi></msub><msup><mrow data-mjx-texclass="ORD"><mo minsize="1.623em" maxsize="1.623em">|</mo></mrow><mn>2</mn></msup><mo>+</mo><mrow data-mjx-texclass="ORD"><mo minsize="1.2em" maxsize="1.2em">|</mo></mrow><msubsup><mi>y</mi><mi>i</mi><mo data-mjx-alternate="1">'</mo></msubsup><mo>-</mo><mi>c</mi><msub><mi>x</mi><mi>i</mi></msub><mo>-</mo><mi>d</mi><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><msub><mi>t</mi><mi>y</mi></msub><msup><mrow data-mjx-texclass="ORD"><mo minsize="1.2em" maxsize="1.2em">|</mo></mrow><mn>2</mn></msup></math>$

Affine변환을 규정하는 매개변수를 구하기 위해서는 L을 각 매개변수에 대해서 미분해서 극값을 가질 조건을 구하면 된다:

        ∂L/∂a = -2 * ∑ (x'_i - a * x_i - b * y_i - t_x) * x_i ;
        ∂L/∂b = -2 * ∑ (x'_i - a * x_i - b * y_i - t_x) * y_i ;
        ∂L/∂c = -2 * ∑ (y'_i - c * x_i - d * y_i - t_y) * x_i ;
        ∂L/∂d = -2 * ∑ (y'_i - c * x_i - d * y_i - t_y) * y_i ;
        ∂L/∂t_x = -2 * ∑ (x'_i - a * x_i - b * y_i - t_x) ;
        ∂L/∂t_y = -2 * ∑ (y'_i - c * x_i - d * y_i - t_y);

각 식을 0으로 놓아서 얻어지는 연립방정식을 행렬식으로 다시 정리하면,

여기서,
이다.

// dst = (A,T)src;
//  [u]  = [ A0 A1 ][x] + A4
//  [v]  = [ A2 A3 ][y] + A5
//
BOOL GetAffineParameter(const std::vector<CPoint> &srcPts, 
                        const std::vector<CPoint> &dstPts, 
                        double AT[6]) 
{
    double Sx, Sy, Sxx, Sxy, Syy;
    double Su, Sv, Sxu, Sxv, Syu, Syv ;
    double A[9], invA[9];
    Sx = Sy = Sxx = Sxy = Syy = 0;
    Su = Sv = Sxu = Sxv = Syu = Syv = 0;
    for (int i = srcPts.size(); i-->0;) {
        double x = srcPts[i].x, y = srcPts[i].y ;
        double u = dstPts[i].x, v = dstPts[i].y ;
        Sx += x;        Sy += y ;
        Sxx += (x * x); Sxy += (x * y); Syy += (y * y);
        Su += u;        Sv += v ;
        Sxu += (x * u); Sxv += (x * v); Syu += (y * u); Syv += (y * v);
    }
    A[0] = Sxx; A[1] = Sxy; A[2] = Sx;
    A[3] = Sxy; A[4] = Syy; A[5] = Sy;
    A[6] = Sx ; A[7] = Sy ; A[8] = srcPts.size() ;
    double det = (A[0]*(A[4]*A[8]-A[5]*A[7])-\
                  A[1]*(A[3]*A[8]-A[5]*A[6])+\
                  A[2]*(A[3]*A[7]-A[4]*A[6]));
    if (det != 0.) {
        det = 1. / det; 
        invA[0] = (A[4]*A[8] - A[5]*A[7]) * det;
        invA[1] = (A[2]*A[7] - A[1]*A[8]) * det;
        invA[2] = (A[1]*A[5] - A[2]*A[4]) * det;
        invA[3] = (A[5]*A[6] - A[3]*A[8]) * det;
        invA[4] = (A[0]*A[8] - A[2]*A[6]) * det;
        invA[5] = (A[2]*A[3] - A[0]*A[5]) * det;
        invA[6] = (A[3]*A[7] - A[4]*A[6]) * det;
        invA[7] = (A[1]*A[6] - A[0]*A[7]) * det;
        invA[8] = (A[0]*A[4] - A[1]*A[3]) * det;
    }
    else return FALSE;

    AT[0] = invA[0] * Sxu + invA[1] * Syu + invA[2] * Su;
    AT[1] = invA[3] * Sxu + invA[4] * Syu + invA[5] * Su;
    AT[4] = invA[6] * Sxu + invA[7] * Syu + invA[8] * Su;
    AT[2] = invA[0] * Sxv + invA[1] * Syv + invA[2] * Sv;
    AT[3] = invA[3] * Sxv + invA[4] * Syv + invA[5] * Sv;
    AT[5] = invA[6] * Sxv + invA[7] * Syv + invA[8] * Sv;
    return TRUE ;
};

아래의 그림은 지문에서 얻은 특징점을 가지고 변환을 한 것이다. 밑에 그림이 기준 template (붉은 점)이고 윗 그림은 이 기준 template와 입력된 지문의 특징점(노란 점+ 녹색점) 사이에 서로 메칭이 되는 특징점(노란색)을 찾고, 그것을 기준으로 두 지문 영상 간의 affine 파라미터를 찾아서 기준 template을 변환시킨 것이다. 이렇게 하면 새로 찾은 특징점 중에서 기준 template에 없는 특징점(녹색점)을 발견할 수 있고, 이 특징점을 기준 template에 추가하여서 좀 더 넓은 범위를 커버할 수 있는 template을 만들 수 있다. 물론 추가된 녹색점이 신뢰할 수 있는 것인가에 대한 판단을 하기 위해서는 추가적인 정보가 더 요구된다.

저작자표시 비영리 변경금지

'Image Recognition' 카테고리의 다른 글

Image Morphing (0)	2010.01.24
Fant's Algorithm (0)	2010.01.22
Color Counting (0)	2010.01.18
Isometric Transformation (0)	2010.01.11
Active Shape Model (3) (0)	2009.12.30

Geometry & Recognition 알고리즘,계산기하,물리학,...

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

Geometry & Recognition

Affine Transformation

'Image Recognition' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

글 보관함

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역