Adam

Notice

Recent Posts

Recent Comments

Link

« 2025/07 »
일	월	화	수	목	금	토
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Tags more

Archives

Today

Total

관리 메뉴

The Beautiful Future

Adam 본문

DNN

Adam

Small Octopus 2017. 5. 24. 11:55

Adam: Adaptive Moment Estimation

** Original Paper

- ADAM: A METHOD FOR STOCHASTIC OPTIMIZATION

** 정리 잘되어있는 블로그

http://shuuki4.github.io/deep%20learning/2016/05/20/Gradient-Descent-Algorithm-Overview.html

** 설명

RMSProp과 Momentum 방식을 합친 알고리즘

Momentum과 유사하게 Gradient의 지수평균을 저장

$m_{t} = \beta_{1}m_{t-1} + (1 - \beta_{1})\nabla_{\theta}J(\theta)$

RMSProp과 유사하게 Gradient의 제곱을 지수평균으로 저장

$v_{t} = \beta_{2}v_{t-1} + (1 - \beta_{2})(\nabla_{\theta}J(\theta))^2$

$m_{t} = 0, v_{t} = 0 \ \ where \ t = 0$ 이기 때문에

학습 초반부에는 0에 가까운 값을 가질 것이다.

$\hat{m_{t}} = \frac{m_{t}}{1-\beta_{1}^{t}}$

$\hat{v_{t}} = \frac{v_{t}}{1-\beta_{2}^{t}}$

** 사용 예

* LENET

base_lr: 0.001

momentum: 0.9

momentum2: 0.999

*DCGAN

lr = 0.0002

beta1 = 0.5

beta2 = 0.999

*lenet_solver_adam.prototxt

# The train/test net protocol buffer definition

# this follows "ADAM: A METHOD FOR STOCHASTIC OPTIMIZATION"

net: "examples/mnist/lenet_train_test.prototxt"

# test_iter specifies how many forward passes the test should carry out.

# In the case of MNIST, we have test batch size 100 and 100 test iterations,

# covering the full 10,000 testing images.

test_iter: 100

# Carry out testing every 500 training iterations.

test_interval: 500

# All parameters are from the cited paper above

base_lr: 0.001

momentum: 0.9

momentum2: 0.999

# since Adam dynamically changes the learning rate, we set the base learning

# rate to a fixed value

lr_policy: "fixed"

# Display every 100 iterations

display: 100

# The maximum number of iterations

max_iter: 10000

# snapshot intermediate results

snapshot: 5000

snapshot_prefix: "examples/mnist/lenet"

# solver mode: CPU or GPU

type: "Adam"

solver_mode: GPU

저작자표시 (새창열림)

'DNN' 카테고리의 다른 글

caffe conv layer (0)	2017.11.24
minimal filtering algorithm, Shmuel Winograd (0)	2017.10.12
Installing Tensorflow on Windows7 from Sources (0)	2017.03.16
Install Caffe on Windows8.1 64bits, Visual studio 2013, CUDA7.5, cuDNN5.1, Anaconda 4.3.0 (0)	2017.03.06
Install Caffe on Ubuntu 16.04 with GTX 1080, CUDA8.0, CUDNN5.1, NCCL, OPENCV3.1 (0)	2016.11.24

'DNN' Related Articles

Comments

내 블로그 - 관리자 홈 전환	`Q` `Q`
새 글 쓰기	`W` `W`

글 수정 (권한 있는 경우)	`E` `E`
댓글 영역으로 이동	`C` `C`

이 페이지의 URL 복사	`S` `S`
맨 위로 이동	`T` `T`
티스토리 홈 이동	`H` `H`
단축키 안내	`Shift` + `/` `⇧` + `/`

The Beautiful Future

The Beautiful Future

Adam 본문

Adam

'DNN' 카테고리의 다른 글

티스토리툴바

단축키

내 블로그

블로그 게시글

모든 영역