728x90

optimizer 3

[Deep Learning] ํ”„๋ ˆ์ž„์›Œํฌ ํ™•์žฅ ์ฝ”๋“œ ๊ตฌํ˜„

์ด์ „ deep learning ํด๋ž˜์Šค ์ฝ”๋“œ ๊ตฌํ˜„๊ณผ ์ด์–ด์ง€๋Š” ๋‚ด์šฉ์ด๋ฏ€๋กœ, ์ด์ „ ๊ธ€ ๋จผ์ € ํ™•์ธํ•˜๋Š” ๊ฒƒ์ด ์ข‹์Šต๋‹ˆ๋‹ค. https://heejins.tistory.com/36 float: # ๊ฐ ํ–‰(๊ด€์ฐฐ์— ํ•ด๋‹น)์— softmax ํ•จ์ˆ˜ ์ ์šฉ softmax_preds = softmax(self.prediction, axis = 1) # ์†์‹ค๊ฐ’์ด ๋ถˆ์•ˆ์ •ํ•ด์ง€๋Š” ๊ฒƒ์„ ๋ง‰๊ธฐ ์œ„ํ•ด softmax ํ•จ์ˆ˜์˜ ์ถœ๋ ฅ๊ฐ’ ๋ฒ”์œ„๋ฅผ ์ œํ•œ self.softmax_preds = np.clip(softmax_preds, self.eps, 1 - self.eps) # ์‹ค์ œ ์†์‹ค๊ฐ’ ๊ณ„์‚ฐ ์ˆ˜ํ–‰ softmax_cross_entropy_loss = ( -1.0 * self.target * np.log(self.softmax_preds) - (1.0 - s..

Deep Learning 2022.11.24

[Deep Learning] ๋”ฅ๋Ÿฌ๋‹ ํด๋ž˜์Šค ์ฝ”๋“œ ๊ตฌํ˜„(์—ฐ์‚ฐ, layer, neuralnetwork, ๋ฐฐ์น˜ํ•™์Šต, optimizer)

import numpy as np from numpy import ndarray from typing import * def assert_same_shape(array: ndarray, array_grad: ndarray): assert array.shape == array_grad.shape, \ f""" ๋‘ ndarray์˜ ๋ชจ์–‘์ด ๊ฐ™์•„์•ผ ํ•˜๋Š”๋ฐ, ์ฒซ ๋ฒˆ์งธ ndarray์˜ ๋ชจ์–‘์€ {tuple(array_grad.shape)}์ด๊ณ , ๋‘ ๋ฒˆ์งธ ndarray์˜ ๋ชจ์–‘์€ {typle(array.shape)}์ด๋‹ค. """ return None - ์‹ ๊ฒฝ๋ง ๊ตฌ์„ฑ ์š”์†Œ: ์—ฐ์‚ฐ Operation ํด๋ž˜์Šค class Operation(object): """ ์‹ ๊ฒฝ๋ง ๋ชจ๋ธ์˜ ์—ฐ์‚ฐ ์—ญํ• ์„ ํ•˜๋Š” ๊ธฐ๋ฐ˜ ํด๋ž˜์Šค """ def __in..

Deep Learning 2022.11.23

[Deep Learning]Optimizer ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ฐฑ์‹ 

๋งค๊ฐœ๋ณ€์ˆ˜ ๊ฐฑ์‹  ์‹ ๊ฒฝ๋ง ํ•™์Šต์˜ ๋ชฉ์ ์€ ์†์‹ค ํ•จ์ˆ˜์˜ ๊ฐ’์„ ๊ฐ€๋Šฅํ•œ ํ•œ ๋‚ฎ์ถ”๋Š” ๋งค๊ฐœ๋ณ€์ˆ˜๋ฅผ ์ฐพ๋Š” ๊ฒƒ ์ด๊ฒƒ์€ ๋งค๊ฐœ๋ณ€์ˆ˜์˜ ์ตœ์ ๊ฐ’์„ ์ฐพ๋Š” ๋ฌธ์ œ์ด๋ฉฐ, ์ด๋Ÿฌํ•œ ๋ฌธ์ œ๋ฅผ ํ‘ธ๋Š” ๊ฒƒ์„ ์ตœ์ ํ™”๋ผ ํ•จ. ํ™•๋ฅ ์  ๊ฒฝ์‚ฌ ํ•˜๊ฐ•๋ฒ•(SGD) class SGD: def __init__(self, lr = 0.01): self.lr = lr def update(self, params, grads): for key in params.keys(): params[key] -= self.lr * grads[key] optimizer๋Š” '์ตœ์ ํ™”๋ฅผ ํ–‰ํ•˜๋Š” ์ž'๋ผ๋Š” ๋œป ๋งค๊ฐœ๋ณ€์ˆ˜ ๊ฐฑ์‹ ์€ optimizer๊ฐ€ ์ฑ…์ž„์ง€๊ณ  ์ˆ˜ํ–‰ํ•˜๋‹ˆ optimizer์— ๋งค๊ฐœ๋ณ€์ˆ˜์™€ ๊ธฐ์šธ๊ธฐ ์ •๋ณด๋งŒ ๋„˜๊ฒจ์ฃผ๋ฉด ๋จ. SGD์˜ ๋‹จ์  ํ•จ์ˆ˜์˜ ๊ทธ๋ž˜ํ”„์™€ ๋“ฑ๊ณ ์„  [SGD์— ์˜ํ•œ ์ตœ์ ํ™” ๊ฐฑ์‹  ๊ฒฝ๋กœ: ์ตœ์†Ÿ๊ฐ’..

Deep Learning 2022.11.13
728x90