[Pytorch] Module Practice

Tensor 초기화

텐서 생성

데이터로부터 텐서 생성

import torch
import numpy as np

data = [[1,2],[3,4]]
x_data = torch.tensor(data)

x_data

tensor([[1, 2],
        [3, 4]])

Numpy 배열로부터 생성하기

np_array = np.array(data)
x_np = torch.from_numpy(np_array)

x_np

tensor([[1, 2],
        [3, 4]])

다른 텐서 모양 따오기

x_ones = torch.ones_like(x_data)
x_ones

tensor([[1, 1],
        [1, 1]])

x_rand = torch.rand_like(x_data,dtype = torch.float)
x_rand

tensor([[0.2966, 0.8495],
        [0.4505, 0.5238]])

x = torch.randn(3,4)
x

tensor([[-1.1679, -0.0261, -0.9898,  0.2107],
        [ 1.6062,  1.5683, -0.2032,  2.2659],
        [ 3.0382,  0.5172,  0.0423, -1.8426]])

텐서 연산

텐서 사칙연산

텐서 사칙연산

+ : torch.add
- : torch.sub
* : torch.mul
/ : torch.div

a = torch.tensor([5])
b = torch.tensor([7])

torch.add(a,b)

tensor([12])

a + b

tensor([12])

피 연산자중 하나라고 tensor 로 입력되면 결과는 tensor

a + 7

tensor([12])

텐서 인덱싱

torch.index_select - PyTorch 공식 문서

torch.index_select(input, dim, index, *, out=None) → Tensor

tensor를 원하는 모양으로 바꾸고 싶을때 torch.Tensor.view라는 함수를 사용하자

A = torch.Tensor([[1, 2],
                  [3, 4]])

# TODO : [1, 3]을 만드세요!

# torch.index_select 함수를 써서 해보세요!
output = torch.index_select(A.t(),0,torch.tensor([0]))

output

tensor([[1., 3.]])

A = torch.Tensor([[1, 2],
                  [3, 4]])

# torch.gather 함수를 써서 해보세요!
output = torch.gather(A,0,torch.tensor([[0,1]]))

torch.nn

nn.Linear

nn.linear -> linear transform 구현

공식 문서에는 들어오는 float32 형 input 데이터에 대해 y = wx + b 형태의 선형 변환을 수행하는 메소드이다

parameter

in-features : size of each input sample
out-features: size of each output sample
bias : if set to False the layer will not learn an additive bias default = True

그냥 차원을 맞춰주는 함수가 아닌가?

from torch import nn

X = torch.Tensor([[1,2],[3,4]])
m = nn.Linear(2,5)
output = m(X)
print(output.size())

torch.Size([2, 5])

nn.Module

nn.Module이라는 상자에 기능들을 가득 모아놓은 경우 basic building block
nn.Module이라는 상자에 basic building block인 nn.Module들을 가득 모아놓은 경우 딥러닝 모델
nn.Module이라는 상자에 딥러닝 모델인 nn.Module들을 가득 모아놓은 경우 더욱 큰 딥러닝 모델

pytorch 를 통한 신경망 모델 설계

Design yout model using class with variables
construct loss and optim
Train cycle(forward,backward,update)

torch.nn.Module 상속 해야한다
__init__ forward() override

__init__ : 모델에서 사용될 module(nn.Linear,nn.Conv2d), activation function 정의
forward(): 실행할 계산을 정의 backward()는 알아서 해준다고 함
input 을 넣어 어떤 계산을 진행하여 output을 출력할지 정의한다고 생각

nn.Module
pytorch 의 nn 라이브러리는 모든 신경망 모델의 baseclass

init(self): 이 모델에 사용될 구성품들을 연결하는 메소드
forward(self,x): init에서 정의된 구성품들을 연결하는 메소드

모델의 클래스를 통한 구현

import torch
import torch.nn as nn
import torch.nn.functional as F

#model = nn.Linear(1,1) 의 단순 선형 모델을 class 로 구현해보자
class LinearRegressionModel(nn.Module):
    def __init__(self):
        super().__init__()
        self.linear = nn.Linear(1,1)# 선형 회귀 함수의 정의

    def forward(self,x):
        return self.linear(x)#__init__에서 선언한 연산 실행 

model = LinearRegressionModel()

x 로부터 예측된 y 를 얻는 것을 forward 연산이라고 한다

class MLP(nn.Module):
    def __init__(self):
        super().__init__()
        self.linear = nn.Linear(3,1)

    def forward(self,x):
        return self.linear(x)

단순 선형 회귀 클래스로 구현하기

nn.parameter

Tensor 객체의 상속 개체
nn.Module 내에 attribute 될때는 required_grad = True로 지정되어 학습 대상이 되는 tensor
def__init__ 의 in_features 객체에 들어가 있는 것이 weight 의 개수가 된다

x_train = torch.FloatTensor([[1],[2],[3]])
y_train = torch.FloatTensor([[2],[4],[6]])

class LinearRegressionModel(nn.Module):
    def __init__(self):
        super().__init__()
        self.linear = nn.Linear(1,1)

    def forward(self,x):
        return self.linear(x)

optimizer 설정 -> 경사하강법 sgd 를 사용하고 learning rate 는 0.01 로 설정

model = LinearRegressionModel()
optimizer = torch.optim.SGD(model.parameters(), lr=0.01)

nb_epochs : 반복 횟수 prediction 은 생성한 model 안에 train data 를 넣으면 나온다

mse.loss 는 pytorch 내에서 지원하는 평균 제곱 오차 함수
이걸 사용해 loss 함수를 구한 뒤에 loss 함수에 대해 backward 진행한다

epoch 내의 기본적으로 있어야 하는 3가지

optimizer.zero_grad() -> gradient 를 초기화 해준다
초기화 해주어야 grad가 쌓이지 않는다
cost.backward() -> backward 연산
optimizer.step() -> backward 연산으로 구한 grad 를 업데이트 해준다

nb_epochs = 200
for epoch in range(nb_epochs+1):
    prediction = model(x_train)

    cost = F.mse_loss(prediction,y_train)

    optimizer.zero_grad()

    cost.backward()

    optimizer.step()

    if epoch % 10 == 0:
        print('epoch {:4d}/{} cost: {:.6f}'.format(epoch,nb_epochs,cost.item()))

epoch    0/200 cost: 28.346628
epoch   10/200 cost: 2.699617
epoch   20/200 cost: 0.257504
epoch   30/200 cost: 0.024947
epoch   40/200 cost: 0.002783
epoch   50/200 cost: 0.000653
epoch   60/200 cost: 0.000432
epoch   70/200 cost: 0.000394
epoch   80/200 cost: 0.000374
epoch   90/200 cost: 0.000356
epoch  100/200 cost: 0.000339
epoch  110/200 cost: 0.000323
epoch  120/200 cost: 0.000308
epoch  130/200 cost: 0.000294
epoch  140/200 cost: 0.000280
epoch  150/200 cost: 0.000267
epoch  160/200 cost: 0.000254
epoch  170/200 cost: 0.000242
epoch  180/200 cost: 0.000231
epoch  190/200 cost: 0.000220
epoch  200/200 cost: 0.000210

다중 선형 회귀 클래스로 구현하기

data

x_train = torch.FloatTensor([[73, 80, 75],
                             [93, 88, 93],
                             [89, 91, 90],
                             [96, 98, 100],
                             [73, 66, 70]])
y_train = torch.FloatTensor([[152], [185], [180], [196], [142]])

model 생성

class MultivariateLinearRegressionModel(nn.Module):
    def __init__(self):
        super().__init__()
        self.linear = nn.Linear(3,1)

    def forward(self,x):
        return self.linear(x)

model 선언 및 optimizer 설정

model = MultivariateLinearRegressionModel()
optimizer = torch.optim.SGD(model.parameters(),lr = 1e-5)

train

nb_epochs = 200
for epoch in range(nb_epochs + 1):

    prediction = model(x_train)
    cost = F.mse_loss(prediction,y_train)

    optimizer.zero_grad()
    cost.backward()
    optimizer.step()

    if epoch % 10 == 0:
        print('epoch {:4d}/{} cost: {:.6f}'.format(epoch,nb_epochs,cost.item()))

epoch    0/200 cost: 0.535162
epoch   10/200 cost: 0.534934
epoch   20/200 cost: 0.534713
epoch   30/200 cost: 0.534487
epoch   40/200 cost: 0.534268
epoch   50/200 cost: 0.534037
epoch   60/200 cost: 0.533809
epoch   70/200 cost: 0.533590
epoch   80/200 cost: 0.533369
epoch   90/200 cost: 0.533141
epoch  100/200 cost: 0.532918
epoch  110/200 cost: 0.532680
epoch  120/200 cost: 0.532469
epoch  130/200 cost: 0.532238
epoch  140/200 cost: 0.532019
epoch  150/200 cost: 0.531801
epoch  160/200 cost: 0.531575
epoch  170/200 cost: 0.531355
epoch  180/200 cost: 0.531141
epoch  190/200 cost: 0.530917
epoch  200/200 cost: 0.530699

DDOING

[Pytorch] Module Practice

Tensor 초기화

텐서 생성

텐서 연산

텐서 사칙연산

텐서 인덱싱

torch.nn

nn.Linear

nn.Module

모델의 클래스를 통한 구현

단순 선형 회귀 클래스로 구현하기

다중 선형 회귀 클래스로 구현하기

참고

2024.04.30
[Paper] Paint by Example: Exemplar-based Image Editing with Diffusion Models

2024.04.17
[Paper] DEVA: Anything with Decoupled Video Segmentation

2024.04.14
[Paper] Video Panoptic Segmentation

2024.04.12
[Paper] Video Object Segmentation with Episodic Graph Memory Networks

DDOING

Tensor 초기화

텐서 생성

텐서 연산

텐서 사칙연산

텐서 인덱싱

torch.nn

nn.Linear

nn.Module

모델의 클래스를 통한 구현

단순 선형 회귀 클래스로 구현하기

다중 선형 회귀 클래스로 구현하기

참고

2024.04.30 [Paper] Paint by Example: Exemplar-based Image Editing with Diffusion Models

2024.04.17 [Paper] DEVA: Anything with Decoupled Video Segmentation

2024.04.14 [Paper] Video Panoptic Segmentation

2024.04.12 [Paper] Video Object Segmentation with Episodic Graph Memory Networks

2024.04.30
[Paper] Paint by Example: Exemplar-based Image Editing with Diffusion Models

2024.04.17
[Paper] DEVA: Anything with Decoupled Video Segmentation

2024.04.14
[Paper] Video Panoptic Segmentation

2024.04.12
[Paper] Video Object Segmentation with Episodic Graph Memory Networks