Welcome!
To use the personalized features of this site, please log in or register.
If you have forgotten your username or password, we can help.
My Menu
Saved Items

2. Efficient BackProp

Yann LeCunContact Information, Leon BottouContact Information, Genevieve B. OrrContact Information and Klaus -Robert MüllerContact Information

(6)  Image Processing Research Department AT&T Labs - Research, 100 Schulz Drive, RedBank, NJ 07701-7033, USA
(7)  Willamette University, 900 State Street, Salem, OR 97301, USA
(8)  GMD FIRST, Rudower Chaussee 5, 12489 Berlin, Germany
Abstract
The convergence of back-propagation learning is analyzed so as to explain common phenomenon observedb y practitioners. Many undesirable behaviors of backprop can be avoided with tricks that are rarely exposedin serious technical publications. This paper gives some of those tricks, ando.ers explanations of why they work. Many authors have suggested that second-order optimization methods are advantageous for neural net training. It is shown that most “classical” second-order methods are impractical for large neural networks. A few methods are proposed that do not have these limitations.

Contact Information Yann LeCun
Email: yann@research.att.com

Contact Information Leon Bottou
Email: leonb@research.att.com

Contact Information Genevieve B. Orr
Email: gorr@willamette.edu

Contact Information Klaus -Robert Müller
Email: klaus@first.gmd.de
Fulltext Preview (Small, Large)
Image of the first page of the fulltext

References secured to subscribers.



Export this chapter
Export this chapter as RIS | Text
 
Referenced by
10 newer articles

  1. (2006) Writer Independent Online Handwritten Character Recognition Using a Simple Approach . Information Technology Journal 5(3)
    [CrossRef]
  2. Stack, J.R. (2009) . IEEE Transactions on Neural Networks 20(3)
    [CrossRef]
  3. Chen, Lei (2008) Convergence analysis of convex incremental neural networks. Annals of Mathematics and Artificial Intelligence
    [CrossRef]
  4. Liang, Nan-Ying (2006) . IEEE Transactions on Neural Networks 17(6)
    [CrossRef]
  5. García-Pedrajas, Nicolás (2008) Construction of classifier ensembles by means of artificial immune systems. Journal of Heuristics 14(3)
    [CrossRef]
  6. Sy, Nestor L (2006) Modelling the infiltration process with a multi-layer perceptron artificial neural network / Modelisation du processus dinfiltration avec un reseau de neurones artificiel de type perceptron multi-couches. Hydrological Sciences Journal/Journal des Sciences Hydrologiques 51(1)
    [CrossRef]
  7. Wan, Sheng (2006) . IEEE Transactions on Neural Networks 17(6)
    [CrossRef]
  8. Feng Ning (2005) . IEEE Transactions on Image Processing 14(9)
    [CrossRef]
  9. Garcia-Pedrajas, N. (2005) Cooperative Coevolution of Artificial Neural Network Ensembles for Pattern Classification. IEEE Transactions on Evolutionary Computation 9(3)
    [CrossRef]
  10. Gutierrez-Osuna, R. (2002) Pattern analysis for machine olfaction: a review. IEEE Sensors Journal 2(3)
    [CrossRef]
Remote Address: 38.107.191.107 • Server: mpweb20
HTTP User Agent: CCBot/1.0 (+http://www.commoncrawl.org/bot.html)