Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
|
 |
Efficient BackProp
| |
|
Yann LeCun6 , Leon Bottou6 , Genevieve B. Orr7 and Klaus -Robert Müller8 
| (6) |
Image Processing Research Department AT&T Labs - Research, 100 Schulz Drive, RedBank, NJ 07701-7033, USA |
| (7) |
Willamette University, 900 State Street, Salem, OR 97301, USA |
| (8) |
GMD FIRST, Rudower Chaussee 5, 12489 Berlin, Germany |
Abstract
The convergence of back-propagation learning is analyzed so as to explain common phenomenon observedb y practitioners. Many
undesirable behaviors of backprop can be avoided with tricks that are rarely exposedin serious technical publications. This
paper gives some of those tricks, ando.ers explanations of why they work. Many authors have suggested that second-order optimization
methods are advantageous for neural net training. It is shown that most “classical” second-order methods are impractical for
large neural networks. A few methods are proposed that do not have these limitations.
Fulltext Preview (Small, Large)
 References secured to subscribers.
|
|
|
|
|
|