Robust Line Fitting

Image Recognition 2008. 7. 8. 23:50

이미지에서 관찰된 점집합이 ${(x i, y i) | i = 1, 2, \dots, N} <math xmlns="http://www.w3.org/1998/Math/MathML"><mo fence="false" stretchy="false">{</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mi>i</mi></msub><mo>,</mo><msub><mi>y</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mi>i</mi><mo>=</mo><mn>1</mn><mo>,</mo><mn>2</mn><mo>,</mo><mo>\dots</mo><mo>,</mo><mi>N</mi><mo fence="false" stretchy="false">}</mo></math>$ 이 있다. 이 점집합을 직선 $y = a + b x <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi><mo>=</mo><mi>a</mi><mo>+</mo><mi>b</mi><mi>x</mi></math>$ 로 피팅을 하고 싶을 때, 보통 최소자승법을 이용하는데, 원리는 직선의 방정식이 예측한 $y <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi></math>$ 값과 실제 관찰한 $y <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi></math>$ 값의 차이의 제곱(=square deviation)을 최소화시키는 직선의 기울기 $a <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi></math>$ 와 절편 $b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi></math>$ 를 찾는 것이다:

$χ 2 (a, b) = \sum i | y i - (b x i + a) | 2 <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><msup><mi>χ</mi><mn>2</mn></msup><mo stretchy="false">(</mo><mi>a</mi><mo>,</mo><mi>b</mi><mo stretchy="false">)</mo><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><mo stretchy="false">)</mo><msup><mo stretchy="false">|</mo><mn>2</mn></msup></math>$

데이터를 얻는 측정 과정에서 측정값 $y i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>y</mi><mi>i</mi></msub></math>$ 는 랜덤 노이즈를 포함하게 되고, 이 노이즈는 참값 $y (x) <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>y</mi><mo stretchy="false">(</mo><mi>x</mi><mo stretchy="false">)</mo></math>$ 근방에서 정규분포를 한다고 가정을 할 수 있다. 만약 모든 측정의 노이즈가 동일한 표준편차 $σ <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>σ</mi></math>$ 를 갖게 된다면, $N <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>N</mi></math>$ 개의 관측 데이터가 나타날 확률(likelihood)은 (개별 측정은 IID 조건을 만족한다고 전제)

$P=∏ie−|yi−(bxi+a)|22σ2<math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>P</mi><mo>=</mo><munder><mo data-mjx-texclass="OP">∏</mo><mrow data-mjx-texclass="ORD"><mi>i</mi></mrow></munder><msup><mi>e</mi><mrow data-mjx-texclass="ORD"><mo>−</mo><mfrac><mrow><mo stretchy="false">|</mo><msub><mi>y</mi><mi>i</mi></msub><mo>−</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><mo stretchy="false">)</mo><msup><mo stretchy="false">|</mo><mn>2</mn></msup></mrow><mrow><mn>2</mn><msup><mi>σ</mi><mn>2</mn></msup></mrow></mfrac></mrow></msup></math>$

의 형태가 된다. 따라서 최소자승법은 이 likelihood를 최대화시키는 파라미터를 찾는 방법이 된다. 최소자승법은 피팅 파라미터를 주어진 데이터를 이용해서 표현할 수 있는 장점은 있지만, outliers에 대해서 매우 취약하다 (아래의 결과 그림을 참조). 이는 적은 수의 outlier도 χ2에 큰 기여를 할 수 있기 때문이다. 따라서 피팅을 좀 더 robust 하게 만들기 위해서는 outliers가 likelihood에 기여하는 정도를 줄여야 한다. 이를 위해서는 likelihood의 지수 인자를 큰 에러에서 덜 민감하게 반응하는 꼴로 바뀌어야 한다. 이를 만족하는 가장 간단한 것 방법 중 하나가 square-deviation 대신에 absolute-deviation을 이용하는 것이다:

$absolute deviation = \sum i | y i - (b x i + a) | . <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mtext>absolute deviation</mtext><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><mo stretchy="false">)</mo><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><mo>.</mo></math>$

그러나 이 식을 사용하는 경우에는 최소자승법과 다르게 기울기 $a <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi></math>$ 와 절편 $b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi></math>$ 를 주어진 데이터 ${(x i, y i)} <math xmlns="http://www.w3.org/1998/Math/MathML"><mo fence="false" stretchy="false">{</mo><mo stretchy="false">(</mo><msub><mi>x</mi><mi>i</mi></msub><mo>,</mo><msub><mi>y</mi><mi>i</mi></msub><mo stretchy="false">)</mo><mo fence="false" stretchy="false">}</mo></math>$ 로 표현할 수 없고, 반복적인 방법을 이용해서 찾아야 한다.

수열 ${c i} <math xmlns="http://www.w3.org/1998/Math/MathML"><mo fence="false" stretchy="false">{</mo><msub><mi>c</mi><mi>i</mi></msub><mo fence="false" stretchy="false">}</mo></math>$ 에 대해 합 $\sum i | c i - a | <math xmlns="http://www.w3.org/1998/Math/MathML"><munder><mo data-mjx-texclass="OP">\sum</mo><mrow data-mjx-texclass="ORD"><mi>i</mi></mrow></munder><mrow data-mjx-texclass="ORD"><mo stretchy="false">|</mo></mrow><msub><mi>c</mi><mi>i</mi></msub><mo>-</mo><mi>a</mi><mo stretchy="false">|</mo></math>$ 은 $a <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi></math>$ 가 수열의 median 값일 때 최솟값을 갖는다는 사실을 이용하면 (증명: 극값을 구하기 위해서 $a <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi></math>$ 에 대해서 미분하면, $0 = (\sum c i > a 1) - (\sum c i < a 1) <math xmlns="http://www.w3.org/1998/Math/MathML"><mn>0</mn><mo>=</mo><mo stretchy="false">(</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mrow data-mjx-texclass="ORD"><msub><mi>c</mi><mi>i</mi></msub><mo>></mo><mi>a</mi></mrow></munder><mn>1</mn><mo stretchy="false">)</mo><mo>-</mo><mo stretchy="false">(</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mrow data-mjx-texclass="ORD"><msub><mi>c</mi><mi>i</mi></msub><mo><</mo><mi>a</mi></mrow></munder><mn>1</mn><mo stretchy="false">)</mo></math>$ : 합은 $a <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi></math>$ 가 $c i <math xmlns="http://www.w3.org/1998/Math/MathML"><msub><mi>c</mi><mi>i</mi></msub></math>$ 보다 큰 경우와 작은 경우로 분리. 따라서 0이 되기 위해서는 작은 경우와 큰 경우의 수가 같아야 한다. 고로, $a = median {c i} <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi><mo>=</mo><mtext>median</mtext><mo fence="false" stretchy="false">{</mo><msub><mi>c</mi><mi>i</mi></msub><mo fence="false" stretchy="false">}</mo></math>$ q.e.d.). 고정된 절편 $b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi></math>$ 에 대해서 absolute deviation을 최소로 만드는 기울기 $a <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi></math>$ 는

$a = median {y i - b x i} <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mi>a</mi><mo>=</mo><mtext>median</mtext><mo fence="false" stretchy="false">{</mo><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo fence="false" stretchy="false">}</mo></math>$

임을 알 수 있다. 그리고 absolute deviation 식을 절편 $b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi></math>$ 에 대해서 미분해서

$0 = \sum i sign (y i - (b x i + a)) <math xmlns="http://www.w3.org/1998/Math/MathML" display="block"><mn>0</mn><mo>=</mo><munder><mo data-mjx-texclass="OP">\sum</mo><mi>i</mi></munder><mtext>sign</mtext><mrow data-mjx-texclass="INNER"><mo data-mjx-texclass="OPEN">(</mo><msub><mi>y</mi><mi>i</mi></msub><mo>-</mo><mo stretchy="false">(</mo><mi>b</mi><msub><mi>x</mi><mi>i</mi></msub><mo>+</mo><mi>a</mi><mo stretchy="false">)</mo><mo data-mjx-texclass="CLOSE">)</mo></mrow></math>$

을 얻는데, 위에서 구한 기울기 $a <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>a</mi></math>$ 를 대입한 후 bracketing and bisection 방법을 이용하면 절편 $b <math xmlns="http://www.w3.org/1998/Math/MathML"><mi>b</mi></math>$ 를 얻을 수 있다(불연속 함수이므로 일반적으로 근을 구하는 방법을 사용하는 것은 위험하다). 아래의 코드는 이를 구현한 것이다.

double FitLine_LS(std::vector<double>& x, std::vector<double>& y, double *a, double *b);

// 최소자승법을 이용한 직선 추정:
// return (sigma[dy] / sigma[x]);
double FitLine_LS(std::vector<double>& x, std::vector<double>& y, double& a, double& b) {
    double sx = 0, sy = 0, sxx = 0, sxy = 0;
    for (int i = x.size(); i-->0;) {
        sx  += x[i];        sy  += y[i];
        sxx += x[i] * x[i]; sxy += x[i] * y[i];
    };
    const int n = x.size();
    double det = n * sxx - sx * sx;
    if (det == 0.) return -1;                   // vertical line;
    a = (sxx * sy - sx * sxy) / det;
    b = (n * sxy - sx * sy) / det;
    double chi2 = 0;
    for (int i = x.size(); i-->0;) {
        double t = y[i] - (*a + *b * x[i]);
        chi2 += t * t;
    }
    det /= n;         //det -> var(x) * n;
    // chi2 = var(dy) * n;
    // (dy vs x의 편차비)
    return  sqrt(chi2 / det);
}

// 기울기(bb)가 주어진 경우에 y-절편(median = aa)값을 추정하고, 이 때 AD값을 얻는다.
double RhoFunc(std::vector<double>& x, std::vector<double>& y,
               double bb, double& aa, double& abdev) {
    std::vector<double> h(x.size());
    for (int i = x.size(); i-->0;) h[i] = y[i] - bb * x[i];
    std::sort(h.begin(), h.end());
    // median;
    const int med = h.size()/2;
    aa = (h.size() & 1) ? h[med] : (h[med] + h[med-1])/2;

    double sum = 0;
    abdev = 0;
    for (int i = x.size(); i-->0;) {
        double d = y[i] - (bb * x[i] + aa);
        abdev += fabs(d);
        // sign-함수의 원점에서 모호함을 없애기 위해서 증폭을 시킴;
        if (y[i] != 0.) d /= fabs(y[i]);
        if (fabs(d) > DBL_EPSILON) // sign 함수의 모호함에서 벗어나면,
            sum += (d >= 0 ? x[i] : -x[i]);
    }
    return sum; // return sum{xi * sign(yi - (b * xi + a))}
};
// y = a + b * x ;
// Least Absolute Deviation:
double FitLine_MAD (std::vector<double>& x, std::vector<double>& y,
                    double& a, double& b) {
    // least square esimates for (aa, bb);
    double aa, bb, abdev;
    double sigb = FitLine_LS(x, y, aa, bb);   // estimate: y=aa + bb*x;
    double b1 = bb;
    double f1 = RhoFunc(x, y, b1, aa, abdev);
    /* bracket 3-sigma away in the downhill direction;*/
    double b2 = fabs(3 * sigb);
    b2 = bb + (f1 < 0 ? -b2 : b2);
    double f2 = RhoFunc(x, y, b2, aa, abdev);

    // if conditional added to take care of the case of a
    // line input into this function. It is needed to avoid an
    // infinite loop when (b1 == b2) (within floating point error)
    if (fabs(b2 - b1) > (sigb + 0.005)) {
        // bracketing;
        while ((f1 * f2) > 0) {
            bb = 2 * b2 - b1;
            b1 = b2; b2 = bb; 
            f1 = f2; f2 = RhoFunc(x, y, b2, aa, abdev) ;
        }
    }
    // refine until the error is a negligible number of std;
    sigb *= 0.01;
    while (fabs(b2 - b1)> sigb) {
        // bisection;
        bb = (b1 + b2) / 2.;
        if ((bb == b1) || (bb == b2)) break ;
        double f = RhoFunc(x, y, bb, aa, abdev) ;
        if ((f * f1) >= 0) {
            f1 = f; b1 = bb;
        } else {
            f2 = f; b2 = bb;
        }
    }
    a = aa; b = bb; 
    return (abdev/x.size());
}

// 붉은 선--> 최소자승법에 의한 피팅.: outlier에 매우 취약한 구조.
// 파란 선--> least absolute deviation을 이용한 피팅: outlier에 매우 robust 하다.

'Image Recognition' 카테고리의 다른 글

RANSAC: Circle Fit (0)	2008.07.21
KMeans Algorithm (0)	2008.07.19
EM: Binarization (0)	2008.07.01
EM Algorithm: Line Fitting (0)	2008.06.29
Gaussian Mixture Model (2)	2008.06.07

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

Geometry & Recognition

Robust Line Fitting

'Image Recognition' 카테고리의 다른 글

카테고리

태그목록

최근에 올라온 글

최근에 달린 댓글

글 보관함

티스토리툴바

개인정보

단축키

내 블로그

블로그 게시글

모든 영역