Neural network — LLMpedia

Neural network
Name	Neural Network
Inspired by	Biological neurons
Inventor	Warren McCulloch, Walter Pitts
Year invented	1943
Related concepts	Machine learning, Deep learning, Artificial intelligence

Contents

Overview
Biological inspiration
Artificial neural network models
Learning methods
Applications
History and development

Neural network. A computational model inspired by the structure and functional aspects of biological nervous systems, designed to recognize patterns and solve complex problems. These networks form a foundational component of modern machine learning and artificial intelligence, enabling advancements in fields from computer vision to natural language processing. Their architecture typically consists of interconnected layers of simple processing units that collectively learn from data.

Overview

A neural network is a system of algorithms that attempts to identify underlying relationships in a set of data through a process that mimics the operation of the human brain. The core structure involves layers of interconnected nodes, or "neurons," which process input data to produce an output. This architecture is central to many contemporary AI systems developed by organizations like Google Brain, DeepMind, and OpenAI. The field has evolved significantly from early theoretical models to powering practical applications such as the recommendation algorithms of Netflix and the image recognition systems in Facebook.

Biological inspiration

The design of artificial neural networks is directly inspired by the biological neural networks found in animal brains, particularly the cerebral cortex. Key biological concepts like the neuron, axon, dendrite, and synapse provide the blueprint for artificial units and their connections. Pioneering work by Santiago Ramón y Cajal in illustrating the nervous system's structure provided early insights. The fundamental idea is that complex cognitive functions emerge from networks of simple, interconnected units, a principle observed in the neural circuitry of organisms studied from Aplysia to Homo sapiens.

Artificial neural network models

Numerous architectural models have been developed, each suited to different types of data and tasks. The feedforward neural network, including the multilayer perceptron, is a basic and widely used model. For sequential data like speech or text, recurrent neural networks and their more advanced variants like Long short-term memory networks, pioneered by Jürgen Schmidhuber, are essential. Convolutional neural networks, greatly advanced by researchers like Yann LeCun, have become the standard for processing grid-like data such as images, leading to breakthroughs in competitions like the ImageNet Large Scale Visual Recognition Challenge.

Learning methods

Learning, or training, is the process by which a network adjusts its internal parameters based on data. Supervised learning, using labeled datasets, is common for tasks like classification, often optimized using the backpropagation algorithm. Unsupervised learning methods, such as those used in Generative adversarial networks developed by Ian Goodfellow, find patterns in unlabeled data. Reinforcement learning, where an agent learns by interacting with an environment, has been successfully combined with deep neural networks by teams at DeepMind to master complex games like Go and StarCraft II.

Applications

Neural networks have transformative applications across nearly every industry. In computer vision, they enable facial recognition systems and autonomous vehicle navigation by companies like Tesla and Waymo. For natural language processing, models like BERT from Google and GPT-3 from OpenAI power advanced translation, chatbots, and content generation. In the sciences, they assist in protein folding prediction as demonstrated by DeepMind's AlphaFold, and in finance, they are used for algorithmic trading on platforms like the New York Stock Exchange.

History and development

The conceptual foundation was laid in 1943 with the formal model of a threshold logic unit by Warren McCulloch and Walter Pitts. The perceptron was introduced by Frank Rosenblatt in 1958, but limitations highlighted by Marvin Minsky and Seymour Papert in their book "Perceptrons" led to a period known as the "AI winter." A resurgence began in the 1980s with the popularization of the backpropagation algorithm by researchers including David Rumelhart and Geoffrey Hinton. The modern era of "deep learning" was catalyzed by increased computational power from NVIDIA GPUs and large datasets, leading to landmark achievements like IBM's Watson winning Jeopardy! and AlphaGo defeating champion Lee Sedol.

Category:Artificial intelligence Category:Machine learning Category:Computational neuroscience