PyTorch Layer 이해하기¶

예제 불러오기¶

import torch
from torchvision import datasets, transforms

import numpy as np
import matplotlib.pyplot as plt

%matplotlib inline

train_loader = torch.utils.data.DataLoader(
    datasets.MNIST('dataset', train=True, download=True,
                   transform=transforms.Compose([
                       transforms.ToTensor()
                   ])),
    batch_size=1)

image, label = next(iter(train_loader))

image.shape, label.shape

(torch.Size([1, 1, 28, 28]), torch.Size([1]))

plt.imshow(image[0,0, :, :], 'gray')
plt.show()

각 Layer별 설명¶

Network 쌓기 위한 준비

import torch
import torch.nn as nn                 # weight값이 있는 것들
import torch.nn.functional as F       # weight값이 없는 것들, pooling, activation func

Convolution¶

in_channels: 받게 될 channel의 갯수
out_channels: 보내고 싶은 channel의 갯수
kernel_size: 만들고 싶은 kernel(weights)의 사이즈

nn.Conv2d(in_channels = 1, out_channels = 20, kernel_size = 5, stride = 1)

Conv2d(1, 20, kernel_size=(5, 5), stride=(1, 1))

# 위와 동일 Convolution layer
layer = nn.Conv2d(1, 20, 5, 1).to(torch.device('cpu'))
layer

Conv2d(1, 20, kernel_size=(5, 5), stride=(1, 1))

weight 확인 & 시각화

weight = layer.weight
weight.shape

torch.Size([20, 1, 5, 5])

여기서 weight는 학습 가능한 상태이기 때문에 바로 numpy로 뽑아낼 수 없음
detach() method는 그래프에서 잠깐 빼서 gradient에 영향을 받지 않게 함

"weight = weight.numpy() - detach 이전 error"
weight = weight.detach().numpy()

weight.shape

(20, 1, 5, 5)

plt.imshow(weight[0,0, :,:], 'jet')
plt.colorbar()
plt.show()

output 시각화 준비를 위해 numpy화

output_data = layer(image)

output_data = output_data

output = output_data.cpu().detach().numpy()

output.shape

(1, 20, 24, 24)

Input으로 들어간 이미지 numpy화

image_arr = image.numpy()
image_arr.shape

(1, 1, 28, 28)

plt.figure(figsize = (15, 30))
plt.subplot(131)
plt.title('input')
plt.imshow(np.squeeze(image_arr), 'gray')

plt.subplot(132)
plt.title('Weight')
plt.imshow(weight[0,0, :, :], 'jet')

plt.subplot(133)
plt.title('Output')
plt.imshow(output[0,0, :, :], 'gray')
plt.show()

# convolution layer의 가중치를 input이 조정받은 것이 output

Pooling¶

input을 먼저 앞에 넣고, 뒤에 kernel 사이즈와 stride를 순서대로 넣음

image.shape

torch.Size([1, 1, 28, 28])

pool = F.max_pool2d(image, 2, 2)
pool.shape

torch.Size([1, 1, 14, 14])

MaxPool Layer는 weight가 없기 때문에 바로 numpy()가 가능

pool_arr = pool.numpy()
pool_arr.shape

(1, 1, 14, 14)

image_arr.shape

(1, 1, 28, 28)

plt.figure(figsize = (10,15))
plt.subplot(121)
plt.title('Input')
plt.imshow(np.squeeze(image_arr), 'gray')
plt.subplot(122)
plt.title('Output')
plt.imshow(np.squeeze(pool_arr), 'gray')
plt.show()

Linear¶

nn.Linear는 2d가 아닌 1d만 들어가기 때문에 .view() 1D로 펼쳐줘야함

flatten = image.view(1, 28 * 28)     # (batch_size, flatten_size)
flatten.shape

torch.Size([1, 784])

lin = nn.Linear(784, 10)(flatten)
lin.shape

torch.Size([1, 10])

lin

tensor([[ 0.1256, -0.0309,  0.0668,  0.0870, -0.3929,  0.1598, -0.0255,  0.2403,
          0.0621, -0.1201]], grad_fn=<AddmmBackward>)

plt.imshow(lin.detach().numpy(), 'jet')
plt.show()

Softmax¶

결과를 numpy로 꺼내기 위해선 weight가 담긴 Linear에 weight를 꺼줘야함

with torch.no_grad():
    flatten = image.view(1, 28 * 28)
    lin = nn.Linear(784,10)(flatten)
    softmax = F.softmax(lin, dim=1)

softmax

tensor([[0.1040, 0.0873, 0.1269, 0.0959, 0.0567, 0.0720, 0.0840, 0.1195, 0.1198,
         0.1340]])

np.sum(softmax.numpy())

1.0

Layer 쌓기¶

예제 출처: https://pytorch.org/tutorials/beginner/pytorch_with_examples.html#id23

nn 과 nn.functional의 차이점

nn은 학습 파라미터가 담긴 것
nn.functional은 학습 파라미터가 없는 것이라 생각하면 간단

class Net(nn.Module):
    def __init__(self):
        super(Net, self).__init__()
        self.conv1 = nn.Conv2d(1,20,5,1)
        self.conv2 = nn.Conv2d(20, 50, 5, 1)
        self.fc1 = nn.Linear(4*4*50, 500)
        self.fc2 = nn.Linear(500, 10)
        
    def forward(self, x):
        # Feature Extraction
        x = F.relu(self.conv1(x))
        x = F.max_pool2d(x, 2, 2)
        x = F.relu(self.conv2(x))
        x = F.max_pool2d(x, 2, 2)
        
        print(x.shape)         ### flatten 할 때, shape을 알아보기 위함
        # Fully Connected (Classification)
        x = x.view(-1, 4*4*50)     # (batch_size, flatten_size)
        x = F.relu(self.fc1(x))
        x = self.fc2(x)
        return F.log_softmax(x, dim=1)

Image를 Model에 넣어서 결과 확인

model = Net()

result = model.forward(image)

torch.Size([1, 50, 4, 4])

result

tensor([[-2.3085, -2.2816, -2.2852, -2.3429, -2.3122, -2.3436, -2.2811, -2.2661,
         -2.2934, -2.3144]], grad_fn=<LogSoftmaxBackward>)

image.shape

torch.Size([1, 1, 28, 28])

model.conv1(image).shape

torch.Size([1, 20, 24, 24])

[Pytorch 기초 - 4] MNIST data를 활용하여 CNN모델의 학습과 Optimizer, Evaluation (0)	2020.09.01
[Pytorch 기초 - 2] MNIST data를 load와 시각화 작업 구현하기 (0)	2020.09.01
[Pytorch 기초 - 1] Pytorch의 가장 기본적인 함수들 (0)	2020.09.01

SH의 학습노트

SH의 학습노트

태그

최근글

댓글

공지사항

아카이브

PyTorch Layer 이해하기¶

예제 불러오기¶

각 Layer별 설명¶

Convolution¶

Pooling¶

Linear¶

Softmax¶

Layer 쌓기¶

'DL in Python > Pytorch 기초' 카테고리의 다른 글

관련글

티스토리툴바