TensorFlow: Optimization & Training (Expert)¶

https://www.tensorflow.org/
공식 홈페이지에서 설명하는 Expert 버젼을 배워보자

import tensorflow as tf
from tensorflow.keras import layers

from tensorflow.keras import datasets 
from tensorflow.python.keras.optimizers import Adam, SGD

학습 과정 돌아보기¶

Build Model¶

input_shape = (28, 28, 1)
num_classes = 10

inputs = layers.Input(input_shape, dtype=tf.float64)
net = layers.Conv2D(32, (3, 3), padding='SAME')(inputs)
net = layers.Activation('relu')(net)
net = layers.Conv2D(32, (3, 3), padding='SAME')(net)
net = layers.Activation('relu')(net)
net = layers.MaxPooling2D(pool_size=(2, 2))(net)
net = layers.Dropout(0.5)(net)

net = layers.Conv2D(64, (3, 3), padding='SAME')(net)
net = layers.Activation('relu')(net)
net = layers.Conv2D(64, (3, 3), padding='SAME')(net)
net = layers.Activation('relu')(net)
net = layers.MaxPooling2D(pool_size=(2, 2))(net)
net = layers.Dropout(0.5)(net)

net = layers.Flatten()(net)
net = layers.Dense(512)(net)
net = layers.Activation('relu')(net)
net = layers.Dropout(0.5)(net)
net = layers.Dense(num_classes)(net)
net = layers.Activation('softmax')(net)

model = tf.keras.Model(inputs=inputs, outputs=net, name='Basic_CNN')

WARNING:tensorflow:Layer conv2d is casting an input tensor from dtype float64 to the layer's dtype of float32, which is new behavior in TensorFlow 2.  The layer has dtype float32 because its dtype defaults to floatx.

If you intended to run this layer in float32, you can safely ignore this warning. If in doubt, this warning is likely only an issue if you are porting a TensorFlow 1.X model to TensorFlow 2.

To change all layers to have dtype float64 by default, call `tf.keras.backend.set_floatx('float64')`. To change just this layer, pass dtype='float64' to the layer constructor. If you are the author of this layer, you can disable autocasting by passing autocast=False to the base Layer constructor.

Preprocess¶

텐서플로우 공식홈페이지에서 말한 expert한 방법

tf.data 사용

mnist = tf.keras.datasets.mnist

# Load Data from MNIST

(x_train, y_train), (x_test, y_test) = mnist.load_data()

# Channel 차원 추가
x_train = x_train[..., tf.newaxis]
x_test = x_test[..., tf.newaxis]

# Data Normalization 
x_train, x_test = x_train / 255.0, x_test / 255.0

from_tensor_slices()
shuffle()
batch()

tf.data¶

train_ds = tf.data.Dataset.from_tensor_slices((x_train, y_train))
train_ds = train_ds.shuffle(1000)
train_ds = train_ds.batch(32)

test_ds = tf.data.Dataset.from_tensor_slices((x_test, y_test))
test_ds = test_ds.batch(32)

Visualize Data¶

matplotlib 불러와서 데이터 시각화하기

import matplotlib.pyplot as plt
%matplotlib inline

train_ds.take()

(32, 28, 28, 1)

# 전체 데이터셋에서 2개 샘플 가져오기
for image, label in train_ds.take(2):
    plt.title(str(label[0]))
    plt.imshow(image[0 ,:, :, 0], 'gray')
    plt.show()

# 전체 데이터 셋에서 1개의 샘플 가져오기
image, label = next(iter(train_ds))

image.shape, label.shape

(TensorShape([32, 28, 28, 1]), TensorShape([32]))

Training (Keras)¶

Keras로 학습 할 때는 기존과 같지만, train_ds는 generator라서 그대로 넣을 수 있음

model.compile(optimizer = 'adam', loss = 'sparse_categorical_crossentropy')
model.fit(train_ds, epochs = 1)

1875/1875 [==============================] - 165s 88ms/step - loss: 0.1907

<tensorflow.python.keras.callbacks.History at 0x1df0efe9948>

Optimization¶

Loss Function
Optimizer

loss_object = tf.keras.losses.SparseCategoricalCrossentropy()

optimizer = tf.keras.optimizers.Adam()

Loss Function를 담을 곳
Metrics

train_loss = tf.keras.metrics.Mean(name='train_loss')
train_accuracy = tf.keras.metrics.SparseCategoricalAccuracy(name = 'train_accuracy')

test_loss = tf.keras.metrics.Mean(name ='test_loss')
test_accuracy = tf.keras.metrics.SparseCategoricalAccuracy(name = 'test_accuracy')

Training¶

@tf.function - 기존 session 열었던 것처럼 바로 작동 안 하고, 그래프만 만들고 학습이 시작되면 돌아가도록 함

@tf.function
def train_step(images, labels):
    with tf.GradientTape() as tape :
        predictions = model(images)
        loss = loss_object(labels, predictions)
    gradients = tape.gradient(loss, model.trainable_variables)
    optimizer.apply_gradients(zip(gradients, model.trainable_variables))
    
    train_loss(loss)
    train_accuracy(labels, predictions)

@tf.function
def test_step(images, labels) :
    predictions = model(images)
    t_loss = loss_object(labels, predictions)
    
    test_loss(t_loss)
    test_accuracy(labels, predictions)

for epoch in range(2):
    print('start Training')
    for images, labels in train_ds :
        train_step(images, labels)
    for test_images, test_labels in test_ds :
        test_step(test_images, test_labels)
        
    template = 'Epoch {}, Loss: {}, Accuracy: {}. Test Loss: {}, Test Accuracy: {} '
    
    print(template.format(epoch+1, train_loss.result(), train_accuracy.result() * 100, test_loss.result(), test_accuracy.result() * 100))

start Training
Epoch 1, Loss: 0.030726563185453415, Accuracy: 99.06111145019531. Test Loss: 0.0541599877178669, Test Accuracy: 98.4000015258789 
start Training
Epoch 2, Loss: 0.026895666494965553, Accuracy: 99.17874908447266. Test Loss: 0.04437520354986191, Test Accuracy: 98.7249984741211

티스토리

[tensorflow2.x 기초 - 5] MNIST data를 활용해 optimization, loss_function, training 구현 심화