Backpropagation

8월 24, 2017

오차역전파법. 풀어쓰자면 Backward Propagation Of Errors 라고 할 수 있다. 즉, 오차를 역으로(반대방향으로) 전파하는 방법이다.
오차역전파법은 계산 그래프로 이해해 볼 수 있다.

계산 그래프

어파인 = Affine 계층

class Affine:
    def __init__(self, W, b):
        self.W = W
        self.b = b
        
        self.x = None
        self.original_x_shape = None
        # 가중치와 편향 매개변수의 미분
        self.dW = None
        self.db = None

    def forward(self, x):
        # 텐서 대응
        self.original_x_shape = x.shape
        x = x.reshape(x.shape[0], -1)
        self.x = x

        out = np.dot(self.x, self.W) + self.b

        return out

    def backward(self, dout):
        dx = np.dot(dout, self.W.T)
        self.dW = np.dot(self.x.T, dout)
        self.db = np.sum(dout, axis=0)
        
        dx = dx.reshape(*self.original_x_shape)  # 입력 데이터 모양 변경(텐서 대응)
        return dx

Softmax-with-Loss

손실 함수

출처

기울기 확인 = Gradient Check

이 블로그 검색

태그

Bakbang's Moments

Backpropagation

댓글

댓글 쓰기

이 블로그의 인기 게시물

RMSprop

SGD = Stochastic Gradient Descent