Artificial intelligence

From KYNNpedia

Template:Artificial intelligence

Artificial intelligence (AI) is the intelligence of machines or software, as opposed to the intelligence of living beings, primarily of humans. It is a field of study in computer science that develops and studies intelligent machines. Such machines may be called AIs.

AI technology is widely used throughout industry, government, and science. Some high-profile applications are: advanced web search engines (e.g., Google Search), recommendation systems (used by YouTube, Amazon, and Netflix), interacting via human speech (e.g., Google Assistant, Siri, and Alexa), self-driving cars (e.g., Waymo), generative and creative tools (e.g., ChatGPT and AI art), and superhuman play and analysis in strategy games (e.g., chess and Go).<ref name="FOOTNOTEGoogle2016">Google (2016).</ref>

Alan Turing was the first person to conduct substantial research in the field that he called machine intelligence.<ref name="turing"/> Artificial intelligence was founded as an academic discipline in 1956.<ref name="Dartmouth workshop"/> The field went through multiple cycles of optimism,<ref name="AI in the 60s"/><ref name="AI in the 80s"/> followed by periods of disappointment and loss of funding, known as AI winter.<ref name="First AI winter" /><ref name="Second AI winter"/> Funding and interest vastly increased after 2012 when deep learning surpassed all previous AI techniques,<ref name="Deep learning revolution"/> and after 2017 with the transformer architecture.<ref name="FOOTNOTEToews2023">Toews (2023).</ref> This led to the AI spring of the early 2020s, with companies, universities, and laboratories overwhelmingly based in the United States pioneering significant advances in artificial intelligence.<ref name="FOOTNOTEFrank2023">Frank (2023).</ref>

The growing use of artificial intelligence in the 21st century is influencing a societal and economic shift towards increased automation, data-driven decision-making, and the integration of AI systems into various economic sectors and areas of life, impacting job markets, healthcare, government, industry, and education. This raises questions about the ethical implications and risks of AI, prompting discussions about regulatory policies to ensure the safety and benefits of the technology.

The various sub-fields of AI research are centered around particular goals and the use of particular tools. The traditional goals of AI research include reasoning, knowledge representation, planning, learning, natural language processing, perception, and support for robotics.<ref name="Problems of AI" group="lower-alpha"></ref> General intelligence (the ability to complete any task performable by a human) is among the field's long-term goals.<ref name="AGI"/>

To solve these problems, AI researchers have adapted and integrated a wide range of problem-solving techniques, including search and mathematical optimization, formal logic, artificial neural networks, and methods based on statistics, operations research, and economics.<ref name="Tools of AI" group="lower-alpha"></ref> AI also draws upon psychology, linguistics, philosophy, neuroscience and other fields.<ref name="AI influences">Russell & Norvig (2021, §1.2).</ref>

Goals

The general problem of simulating (or creating) intelligence has been broken into sub-problems. These consist of particular traits or capabilities that researchers expect an intelligent system to display. The traits described below have received the most attention and cover the scope of AI research.<ref name="Problems of AI" group="lower-alpha">This list of intelligent traits is based on the topics covered by the major AI textbooks, including: Russell & Norvig (2021), Luger & Stubblefield (2004), Poole, Mackworth & Goebel (1998) and Nilsson (1998)</ref>

Reasoning, problem-solving

Early researchers developed algorithms that imitated step-by-step reasoning that humans use when they solve puzzles or make logical deductions.<ref> Problem solving, puzzle solving, game playing and deduction:

</ref> By the late 1980s and 1990s, methods were developed for dealing with uncertain or incomplete information, employing concepts from probability and economics.<ref> Uncertain reasoning:

</ref>

Many of these algorithms are insufficient for solving large reasoning problems because they experience a "combinatorial explosion": they became exponentially slower as the problems grew larger.<ref name="Intractability"> Intractability and efficiency and the combinatorial explosion:

</ref> Even humans rarely use the step-by-step deduction that early AI research could model. They solve most of their problems using fast, intuitive judgments.<ref name="Psychological evidence of sub-symbolic reasoning"> Psychological evidence of the prevalence sub-symbolic reasoning and knowledge:

</ref> Accurate and efficient reasoning is an unsolved problem.

Knowledge representation

An ontology represents knowledge as a set of concepts within a domain and the relationships between those concepts.

Knowledge representation and knowledge engineering<ref> Knowledge representation and knowledge engineering:

</ref> allow AI programs to answer questions intelligently and make deductions about real-world facts. Formal knowledge representations are used in content-based indexing and retrieval,<ref name="FOOTNOTESmoliarZhang1994">Smoliar & Zhang (1994).</ref> scene interpretation,<ref name="FOOTNOTENeumannMöller2008">Neumann & Möller (2008).</ref> clinical decision support,<ref name="FOOTNOTEKupermanReichleyBailey2006">Kuperman, Reichley & Bailey (2006).</ref> knowledge discovery (mining "interesting" and actionable inferences from large databases),<ref name="FOOTNOTEMcGarry2005">McGarry (2005).</ref> and other areas.<ref name="FOOTNOTEBertiniDel BimboTorniai2006">Bertini, Del Bimbo & Torniai (2006).</ref>

A knowledge base is a body of knowledge represented in a form that can be used by a program. An ontology is the set of objects, relations, concepts, and properties used by a particular domain of knowledge.<ref name="FOOTNOTERussellNorvig2021272">Russell & Norvig (2021), pp. 272.</ref> Knowledge bases need to represent things such as: objects, properties, categories and relations between objects;<ref name="Representing categories and relations"> Representing categories and relations: Semantic networks, description logics, inheritance (including frames and scripts):

</ref> situations, events, states and time;<ref name="Representing time">Representing events and time:Situation calculus, event calculus, fluent calculus (including solving the frame problem):

</ref> causes and effects;<ref name="Representing causation"> Causal calculus:

</ref> knowledge about knowledge (what we know about what other people know);<ref name="Representing knowledge about knowledge"> Representing knowledge about knowledge: Belief calculus, modal logics:

</ref> default reasoning (things that humans assume are true until they are told differently and will remain true even when other facts are changing);<ref name="Default reasoning and non-monotonic logic"> Default reasoning, Frame problem, default logic, non-monotonic logics, circumscription, closed world assumption, abduction:

(Poole et al. places abduction under "default reasoning". Luger et al. places this under "uncertain reasoning"). </ref> and many other aspects and domains of knowledge.

Among the most difficult problems in knowledge representation are: the breadth of commonsense knowledge (the set of atomic facts that the average person knows is enormous);<ref name="Breadth of commonsense knowledge"> Breadth of commonsense knowledge:

</ref> and the sub-symbolic form of most commonsense knowledge (much of what people know is not represented as "facts" or "statements" that they could express verbally).<ref name="Psychological evidence of sub-symbolic reasoning"/> There is also the difficulty of knowledge acquisition, the problem of obtaining knowledge for AI applications.<ref group="lower-alpha">It is among the reasons that expert systems proved to be inefficient for capturing knowledge.<ref name="FOOTNOTENewquist1994296">Newquist (1994), p. 296.</ref><ref name="FOOTNOTECrevier1993204–208">Crevier (1993), pp. 204–208.</ref></ref>

Planning and decision making

An "agent" is anything that perceives and takes actions in the world. A rational agent has goals or preferences and takes actions to make them happen.<ref group="lower-alpha"> "Rational agent" is general term used in economics, philosophy and theoretical artificial intelligence. It can refer to anything that directs its behavior to accomplish goals, such as a person, an animal, a corporation, a nation, or, in the case of AI, a computer program. </ref><ref name="FOOTNOTERussellNorvig2021528">Russell & Norvig (2021), p. 528.</ref> In automated planning, the agent has a specific goal.<ref> Automated planning:

</ref> In automated decision making, the agent has preferences – there are some situations it would prefer to be in, and some situations it is trying to avoid. The decision making agent assigns a number to each situation (called the "utility") that measures how much the agent prefers it. For each possible action, it can calculate the "expected utility": the utility of all possible outcomes of the action, weighted by the probability that the outcome will occur. It can then choose the action with the maximum expected utility.<ref> Automated decision making, Decision theory:

</ref>

In classical planning, the agent knows exactly what the effect of any action will be.<ref> Classical planning:

</ref> In most real-world problems, however, the agent may not be certain about the situation they are in (it is "unknown" or "unobservable") and it may not know for certain what will happen after each possible action (it is not "deterministic"). It must choose an action by making a probabilistic guess and then reassess the situation to see if the action worked.<ref> Sensorless or "conformant" planning, contingent planning, replanning (a.k.a online planning):

</ref>

In some problems, the agent's preferences may be uncertain, especially if there are other agents or humans involved. These can be learned (e.g., with inverse reinforcement learning) or the agent can seek information to improve its preferences.<ref> Uncertain preferences:

Inverse reinforcement learning:

</ref> Information value theory can be used to weigh the value of exploratory or experimental actions.<ref> Information value theory:

</ref> The space of possible future actions and situations is typically intractably large, so the agents must take actions and evaluate situations while being uncertain what the outcome will be.

A Markov decision process has a transition model that describes the probability that a particular action will change the state in a particular way, and a reward function that supplies the utility of each state and the cost of each action. A policy associates a decision with each possible state. The policy could be calculated (e.g., by iteration), be heuristic, or it can be learned.<ref> Markov decision process:

</ref>

Game theory describes rational behavior of multiple interacting agents, and is used in AI programs that make decisions that involve other agents.<ref> Game theory and multi-agent decision theory:

</ref>

Learning

Machine learning is the study of programs that can improve their performance on a given task automatically.<ref name ="machine learning"> Learning:

</ref> It has been a part of AI from the beginning.<ref group="lower-alpha">Alan Turing discussed the centrality of learning as early as 1950, in his classic paper "Computing Machinery and Intelligence".<ref name="FOOTNOTETuring1950">Turing (1950).</ref> In 1956, at the original Dartmouth AI summer conference, Ray Solomonoff wrote a report on unsupervised probabilistic machine learning: "An Inductive Inference Machine".<ref name="FOOTNOTESolomonoff1956">Solomonoff (1956).</ref> </ref>

There are several kinds of machine learning. Unsupervised learning analyzes a stream of data and finds patterns and makes predictions without any other guidance.<ref> Unsupervised learning:

</ref> Supervised learning requires a human to label the input data first, and comes in two main varieties: classification (where the program must learn to predict what category the input belongs in) and regression (where the program must deduce a numeric function based on numeric input).<ref name="Supervised learning"> Supervised learning:

</ref>

In reinforcement learning the agent is rewarded for good responses and punished for bad ones. The agent learns to choose responses that are classified as "good".<ref> Reinforcement learning:

</ref> Transfer learning is when the knowledge gained from one problem is applied to a new problem.<ref> Transfer learning:

</ref> Deep learning is a type of machine learning that runs inputs through biologically inspired artificial neural networks for all of these types of learning.<ref>"Artificial Intelligence (AI): What Is AI and How Does It Work? | Built In". builtin.com. Retrieved 30 October 2023.</ref>

Computational learning theory can assess learners by computational complexity, by sample complexity (how much data is required), or by other notions of optimization.<ref> Computational learning theory:

</ref>

Natural language processing

Natural language processing (NLP)<ref> Natural language processing (NLP):

</ref> allows programs to read, write and communicate in human languages such as English. Specific problems include speech recognition, speech synthesis, machine translation, information extraction, information retrieval and question answering.<ref> Subproblems of NLP:

</ref>

Early work, based on Noam Chomsky's generative grammar and semantic networks, had difficulty with word-sense disambiguation<ref group="lower-alpha">See AI winter § Machine translation and the ALPAC report of 1966</ref> unless restricted to small domains called "micro-worlds" (due to the common sense knowledge problem<ref name="Breadth of commonsense knowledge" />). Margaret Masterman believed that it was meaning, and not grammar that was the key to understanding languages, and that thesauri and not dictionaries should be the basis of computational language structure.

Modern deep learning techniques for NLP include word embedding (representing words, typically as vectors encoding their meaning),<ref name="FOOTNOTERussellNorvig2021856–858">Russell & Norvig (2021), p. 856–858.</ref> transformers (a deep learning architecture using an attention mechanism),<ref name="FOOTNOTEDickson2022">Dickson (2022).</ref> and others.<ref>Modern statistical and deep learning approaches to NLP:

</ref> In 2019, generative pre-trained transformer (or "GPT") language models began to generate coherent text,<ref name="FOOTNOTEVincent2019">Vincent (2019).</ref><ref name="FOOTNOTERussellNorvig2021875–878">Russell & Norvig (2021), p. 875–878.</ref> and by 2023 these models were able to get human-level scores on the bar exam, SAT test, GRE test, and many other real-world applications.<ref name="FOOTNOTEBushwick2023">Bushwick (2023).</ref>

Perception

Machine perception is the ability to use input from sensors (such as cameras, microphones, wireless signals, active lidar, sonar, radar, and tactile sensors) to deduce aspects of the world. Computer vision is the ability to analyze visual input.<ref> Computer vision:

</ref>

The field includes speech recognition,<ref name="FOOTNOTERussellNorvig2021849–850">Russell & Norvig (2021), pp. 849–850.</ref> image classification,<ref name="FOOTNOTERussellNorvig2021895–899">Russell & Norvig (2021), pp. 895–899.</ref> facial recognition, object recognition,<ref name="FOOTNOTERussellNorvig2021899–901">Russell & Norvig (2021), pp. 899–901.</ref> and robotic perception.<ref name="FOOTNOTERussellNorvig2021931–938">Russell & Norvig (2021), pp. 931–938.</ref>

Social intelligence

Kismet, a robot head which was made in the 1990s; a machine that can recognize and simulate emotions.<ref name="FOOTNOTEMIT AIL2014">MIT AIL (2014).</ref>

Affective computing is an interdisciplinary umbrella that comprises systems that recognize, interpret, process or simulate human feeling, emotion and mood.<ref> Affective computing:

</ref> For example, some virtual assistants are programmed to speak conversationally or even to banter humorously; it makes them appear more sensitive to the emotional dynamics of human interaction, or to otherwise facilitate human–computer interaction.

However, this tends to give naïve users an unrealistic conception of the intelligence of existing computer agents.<ref name="FOOTNOTEWaddell2018">Waddell (2018).</ref> Moderate successes related to affective computing include textual sentiment analysis and, more recently, multimodal sentiment analysis, wherein AI classifies the affects displayed by a videotaped subject.<ref name="FOOTNOTEPoriaCambriaBajpaiHussain2017">Poria et al. (2017).</ref>

General intelligence

A machine with artificial general intelligence should be able to solve a wide variety of problems with breadth and versatility similar to human intelligence.<ref name = "AGI" > Artificial general intelligence:

Proposal for the modern version:

Warnings of overspecialization in AI from leading researchers:

</ref>

Techniques

AI research uses a wide variety of techniques to accomplish the goals above.<ref name="Tools of AI" group="lower-alpha">This list of tools is based on the topics covered by the major AI textbooks, including: Russell & Norvig (2021), Luger & Stubblefield (2004), Poole, Mackworth & Goebel (1998) and Nilsson (1998)</ref>

Search and optimization

AI can solve many problems by intelligently searching through many possible solutions.<ref> Search algorithms:

</ref> There are two very different kinds of search used in AI: state space search and local search.

State space search

State space search searches through a tree of possible states to try to find a goal state.<ref name="State space search"> State space search:

</ref> For example, planning algorithms search through trees of goals and subgoals, attempting to find a path to a target goal, a process called means-ends analysis.<ref name="FOOTNOTERussellNorvig2021§11.2">Russell & Norvig (2021), §11.2.</ref>

Simple exhaustive searches<ref name="Uninformed search">Uninformed searches (breadth first search, depth-first search and general state space search):

</ref> are rarely sufficient for most real-world problems: the search space (the number of places to search) quickly grows to astronomical numbers. The result is a search that is too slow or never completes.<ref name="Intractability"/> "Heuristics" or "rules of thumb" can help to prioritize choices that are more likely to reach a goal.<ref name="Informed search"> Heuristic or informed searches (e.g., greedy best first and A*):

</ref>

Adversarial search is used for game-playing programs, such as chess or Go. It searches through a tree of possible moves and counter-moves, looking for a winning position.<ref> Adversarial search:

</ref>

Local search

Illustration of gradient descent for 3 different starting points. Two parameters (represented by the plan coordinates) are adjusted in order to minimize the loss function (the height).

Local search uses mathematical optimization to find a solution to a problem. It begins with some form of guess and refines it incrementally.<ref name="Local search2">Local or "optimization" search:

Gradient descent is a type of local search that optimizes a set of numerical parameters by incrementally adjusting them to minimize a loss function. Variants of gradient descent are commonly used to train neural networks.<ref>Singh Chauhan, Nagesh (18 December 2020). "Optimization Algorithms in Neural Networks". KDnuggets. Retrieved 13 January 2024.</ref>

Another type of local search is evolutionary computation, which aims to iteratively improve a set of candidate solutions by "mutating" and "recombining" them, selecting only the fittest to survive each generation.<ref> Evolutionary computation:

</ref>

Distributed search processes can coordinate via swarm intelligence algorithms. Two popular swarm algorithms used in search are particle swarm optimization (inspired by bird flocking) and ant colony optimization (inspired by ant trails).<ref name="FOOTNOTEMerkleMiddendorf2013">Merkle & Middendorf (2013).</ref>

Logic

Formal Logic is used for reasoning and knowledge representation.<ref name="Logic"> Logic:

</ref> Formal logic comes in two main forms: propositional logic (which operates on statements that are true or false and uses logical connectives such as "and", "or", "not" and "implies")<ref name="Propositional logic"> Propositional logic:

and predicate logic (which also operates on objects, predicates and relations and uses quantifiers such as "Every X is a Y" and "There are some Xs that are Ys").<ref name="Predicate logic"> First-order logic and features such as equality:

</ref>

Logical inference (or deduction) is the process of proving a new statement (conclusion) from other statements that are already known to be true (the premises).<ref name="Inference"> Logical inference:

</ref> A logical knowledge base also handles queries and assertions as a special case of inference.<ref name="FOOTNOTERussellNorvig2021§8.3.1">Russell & Norvig (2021), §8.3.1.</ref> An inference rule describes what is a valid step in a proof. The most general inference rule is resolution.<ref name="Resolution"> Resolution and unification:

</ref> Inference can be reduced to performing a search to find a path that leads from premises to conclusions, where each step is the application of an inference rule.<ref name="Logic as search">Forward chaining, backward chaining, Horn clauses, and logical deduction as search:

</ref> Inference performed this way is intractable except for short proofs in restricted domains. No efficient, powerful and general method has been discovered.

Fuzzy logic assigns a "degree of truth" between 0 and 1. It can therefore handle propositions that are vague and partially true.<ref name="Fuzzy logic"> Fuzzy logic:

</ref> Non-monotonic logics are designed to handle default reasoning.<ref name="Default reasoning and non-monotonic logic"/> Other specialized versions of logic have been developed to describe many complex domains (see knowledge representation above).

Probabilistic methods for uncertain reasoning

A simple Bayesian network, with the associated conditional probability tables

Many problems in AI (including in reasoning, planning, learning, perception, and robotics) require the agent to operate with incomplete or uncertain information. AI researchers have devised a number of tools to solve these problems using methods from probability theory and economics.<ref name="Uncertain reasoning"> Stochastic methods for uncertain reasoning:

</ref>

Bayesian networks<ref name="Bayesian networks"> Bayesian networks:

</ref> are a very general tool that can be used for many problems, including reasoning (using the Bayesian inference algorithm),<ref group="lower-alpha"> Compared with symbolic logic, formal Bayesian inference is computationally expensive. For inference to be tractable, most observations must be conditionally independent of one another. AdSense uses a Bayesian network with over 300 million edges to learn which ads to serve.<ref name="FOOTNOTEDomingos2015chapter 6">Domingos (2015), chapter 6.</ref> </ref><ref name="Bayesian inference"> Bayesian inference algorithm:

</ref> learning (using the expectation-maximization algorithm),<ref group="lower-alpha">Expectation-maximization, one of the most popular algorithms in machine learning, allows clustering in the presence of unknown latent variables.<ref name="FOOTNOTEDomingos2015210">Domingos (2015), p. 210.</ref></ref><ref name="Bayesian learning"> Bayesian learning and the expectation-maximization algorithm:

</ref> planning (using decision networks)<ref name="Bayesian decision networks">Bayesian decision theory and Bayesian decision networks:

</ref> and perception (using dynamic Bayesian networks).<ref name="Stochastic temporal models"/>

Probabilistic algorithms can also be used for filtering, prediction, smoothing and finding explanations for streams of data, helping perception systems to analyze processes that occur over time (e.g., hidden Markov models or Kalman filters).<ref name="Stochastic temporal models"> Stochastic temporal models:

Hidden Markov model:

Kalman filters:

Dynamic Bayesian networks:

</ref>

Precise mathematical tools have been developed that analyze how an agent can make choices and plan, using decision theory, decision analysis,<ref name="Decisions theory and analysis"> decision theory and decision analysis:

</ref> and information value theory.<ref name="Information value theory"> Information value theory:

</ref> These tools include models such as Markov decision processes,<ref name="Markov decision process">Markov decision processes and dynamic decision networks:

</ref> dynamic decision networks,<ref name="Stochastic temporal models" /> game theory and mechanism design.<ref name="Game theory and mechanism design">Game theory and mechanism design:

</ref>

Expectation-maximization clustering of Old Faithful eruption data starts from a random guess but then successfully converges on an accurate clustering of the two physically distinct modes of eruption.

Classifiers and statistical learning methods

The simplest AI applications can be divided into two types: classifiers (e.g., "if shiny then diamond"), on one hand, and controllers (e.g., "if diamond then pick up"), on the other hand. Classifiers<ref name="Statistical classifiers"> Statistical learning methods and classifiers:

</ref> are functions that use pattern matching to determine the closest match. They can be fine-tuned based on chosen examples using supervised learning. Each pattern (also called an "observation") is labeled with a certain predefined class. All the observations combined with their class labels are known as a data set. When a new observation is received, that observation is classified based on previous experience.<ref name="Supervised learning"/>

There are many kinds of classifiers in use. The decision tree is the simplest and most widely used symbolic machine learning algorithm.<ref> Decision trees:

</ref> K-nearest neighbor algorithm was the most widely used analogical AI until the mid-1990s, and Kernel methods such as the support vector machine (SVM) displaced k-nearest neighbor in the 1990s.<ref> Non-parameteric learning models such as K-nearest neighbor and support vector machines:

</ref> The naive Bayes classifier is reportedly the "most widely used learner"<ref name="FOOTNOTEDomingos2015152">Domingos (2015), p. 152.</ref> at Google, due in part to its scalability.<ref> Naive Bayes classifier:

</ref> Neural networks are also used as classifiers.<ref name="Neural networks"/>

Artificial neural networks

A neural network is an interconnected group of nodes, akin to the vast network of neurons in the human brain.

An artificial neural network is based on a collection of nodes also known as artificial neurons, which loosely model the neurons in a biological brain. It is trained to recognise patterns; once trained, it can recognise those patterns in fresh data. There is an input, at least one hidden layer of nodes and an output. Each node applies a function and once the weight crosses its specified threshold, the data is transmitted to the next layer. A network is typically called a deep neural network if it has at least 2 hidden layers.<ref name="Neural networks"> Neural networks:

</ref>

Learning algorithms for neural networks use local search to choose the weights that will get the right output for each input during training. The most common training technique is the backpropagation algorithm.<ref name="Backpropagation"> Gradient calculation in computational graphs, backpropagation, automatic differentiation:

</ref> Neural networks learn to model complex relationships between inputs and outputs and find patterns in data. In theory, a neural network can learn any function.<ref> Universal approximation theorem:

The theorem:

</ref>

In feedforward neural networks the signal passes in only one direction.<ref> Feedforward neural networks:

</ref> Recurrent neural networks feed the output signal back into the input, which allows short-term memories of previous input events. Long short term memory is the most successful network architecture for recurrent networks.<ref> Recurrent neural networks:

</ref> Perceptrons<ref> Perceptrons:

</ref> use only a single layer of neurons, deep learning<ref name="Deep learning"/> uses multiple layers. Convolutional neural networks strengthen the connection between neurons that are "close" to each other – this is especially important in image processing, where a local set of neurons must identify an "edge" before the network can identify an object.<ref> Convolutional neural networks:

</ref>

Deep learning

Deep learning<ref name="Deep learning"> Deep learning:

</ref> uses several layers of neurons between the network's inputs and outputs. The multiple layers can progressively extract higher-level features from the raw input. For example, in image processing, lower layers may identify edges, while higher layers may identify the concepts relevant to a human such as digits or letters or faces.<ref name="FOOTNOTEDengYu2014199–200">Deng & Yu (2014), pp. 199–200.</ref>

Deep learning has profoundly improved the performance of programs in many important subfields of artificial intelligence, including computer vision, speech recognition, natural language processing, image classification<ref name="FOOTNOTECiresanMeierSchmidhuber2012">Ciresan, Meier & Schmidhuber (2012).</ref> and others. The reason that deep learning performs so well in so many applications is not known as of 2023.<ref name="FOOTNOTERussellNorvig2021751">Russell & Norvig (2021), p. 751.</ref> The sudden success of deep learning in 2012–2015 did not occur because of some new discovery or theoretical breakthrough (deep neural networks and backpropagation had been described by many people, as far back as the 1950s)<ref group="lower-alpha"> Some form of deep neural networks (without a specific learning algorithm) were described by: Alan Turing (1948);<ref name="FOOTNOTERussellNorvig2021785">Russell & Norvig (2021), p. 785.</ref> Frank Rosenblatt(1957);<ref name="FOOTNOTERussellNorvig2021785">Russell & Norvig (2021), p. 785.</ref> Karl Steinbuch and Roger David Joseph (1961).<ref name="FOOTNOTESchmidhuber2022§5">Schmidhuber (2022), §5.</ref> Deep or recurrent networks that learned (or used gradient descent) were developed by: Ernst Ising and Wilhelm Lenz (1925);<ref name="FOOTNOTESchmidhuber2022§6">Schmidhuber (2022), §6.</ref> Oliver Selfridge (1959);<ref name="FOOTNOTESchmidhuber2022§5">Schmidhuber (2022), §5.</ref> Alexey Ivakhnenko and Valentin Lapa (1965);<ref name="FOOTNOTESchmidhuber2022§6">Schmidhuber (2022), §6.</ref> Kaoru Nakano (1977);<ref name="FOOTNOTESchmidhuber2022§7">Schmidhuber (2022), §7.</ref> Shun-Ichi Amari (1972);<ref name="FOOTNOTESchmidhuber2022§7">Schmidhuber (2022), §7.</ref> John Joseph Hopfield (1982).<ref name="FOOTNOTESchmidhuber2022§7">Schmidhuber (2022), §7.</ref> Backpropagation was independently discovered by: Henry J. Kelley (1960);<ref name="FOOTNOTERussellNorvig2021785">Russell & Norvig (2021), p. 785.</ref> Arthur E. Bryson (1962);<ref name="FOOTNOTERussellNorvig2021785">Russell & Norvig (2021), p. 785.</ref> Stuart Dreyfus (1962);<ref name="FOOTNOTERussellNorvig2021785">Russell & Norvig (2021), p. 785.</ref> Arthur E. Bryson and Yu-Chi Ho (1969);<ref name="FOOTNOTERussellNorvig2021785">Russell & Norvig (2021), p. 785.</ref> Seppo Linnainmaa (1970);<ref name="FOOTNOTESchmidhuber2022§8">Schmidhuber (2022), §8.</ref> Paul Werbos (1974).<ref name="FOOTNOTERussellNorvig2021785">Russell & Norvig (2021), p. 785.</ref> In fact, backpropagation and gradient descent are straight forward applications of Gottfried Leibniz' chain rule in calculus (1676),<ref name="FOOTNOTESchmidhuber2022§2">Schmidhuber (2022), §2.</ref> and is essentially identical (for one layer) to the method of least squares, developed independently by Johann Carl Friedrich Gauss (1795) and Adrien-Marie Legendre (1805).<ref name="FOOTNOTESchmidhuber2022§3">Schmidhuber (2022), §3.</ref> There are probably many others, yet to be discovered by historians of science. </ref> but because of two factors: the incredible increase in computer power (including the hundred-fold increase in speed by switching to GPUs) and the availability of vast amounts of training data, especially the giant curated datasets used for benchmark testing, such as ImageNet.<ref group="lower-alpha">Geoffrey Hinton said, of his work on neural networks in the 1990s, "our labeled datasets were thousands of times too small. [And] our computers were millions of times too slow"<ref>Quoted in Christian (2020, p. 22)</ref></ref>

GPT

Generative pre-trained transformers (GPT) are large language models that are based on the semantic relationships between words in sentences (natural language processing). Text-based GPT models are pre-trained on a large corpus of text which can be from the internet. The pre-training consists in predicting the next token (a token being usually a word, subword, or punctuation). Throughout this pre-training, GPT models accumulate knowledge about the world, and can then generate human-like text by repeatedly predicting the next token. Typically, a subsequent training phase makes the model more truthful, useful and harmless, usually with a technique called reinforcement learning from human feedback (RLHF). Current GPT models are still prone to generating falsehoods called "hallucinations", although this can be reduced with RLHF and quality data. They are used in chatbots, which allow you to ask a question or request a task in simple text.<ref name="FOOTNOTESmith2023">Smith (2023).</ref><ref>"Explained: Generative AI". 9 November 2023.</ref>

Current models and services include: Gemini (formerly Bard), ChatGPT, Grok, Claude, Copilot and LLaMA.<ref>"AI Writing and Content Creation Tools". MIT Sloan Teaching & Learning Technologies. Retrieved 25 December 2023.</ref> Multimodal GPT models can process different types of data (modalities) such as images, videos, sound and text.<ref name="FOOTNOTEMarmouyet2023">Marmouyet (2023).</ref>

Specialized hardware and software

In the late 2010s, graphics processing units (GPUs) that were increasingly designed with AI-specific enhancements and used with specialized TensorFlow software, had replaced previously used central processing unit (CPUs) as the dominant means for large-scale (commercial and academic) machine learning models' training.<ref name="FOOTNOTEKobielus2019">Kobielus (2019).</ref> Historically, specialized languages, such as Lisp, Prolog, Python and others, had been used.

Applications

AI and machine learning technology is used in most of the essential applications of the 2020s, including: search engines (such as Google Search), targeting online advertisements, recommendation systems (offered by Netflix, YouTube or Amazon), driving internet traffic, targeted advertising (AdSense, Facebook), virtual assistants (such as Siri or Alexa), autonomous vehicles (including drones, ADAS and self-driving cars), automatic language translation (Microsoft Translator, Google Translate), facial recognition (Apple's Face ID or Microsoft's DeepFace and Google's FaceNet) and image labeling (used by Facebook, Apple's iPhoto and TikTok).

Health and medicine

The application of AI in medicine and medical research has the potential to increase patient care and quality of life.<ref>Davenport, T; Kalakota, R (June 2019). "The potential for artificial intelligence in healthcare". Future Healthc J. 6 (2): 94–98. doi:10.7861/futurehosp.6-2-94. PMC 6616181. PMID 31363513.</ref> Through the lens of the Hippocratic Oath, medical professionals are ethically compelled to use AI, if applications can more accurately diagnose and treat patients.

For medical research, AI is an important tool for processing and integrating Big Data. This is particularly important for organoid and tissue engineering development which use microscopy imaging as a key technique in fabrication.<ref name="The future of personalized cardiova">Bax, Monique; Thorpe, Jordan; Romanov, Valentin (December 2023). "The future of personalized cardiovascular medicine demands 3D and 4D printing, stem cells, and artificial intelligence". Frontiers in Sensors. 4. doi:10.3389/fsens.2023.1294721. ISSN 2673-5067.</ref> It has been suggested that AI can overcome discrepancies in funding allocated to different fields of research.<ref name="The future of personalized cardiova"/> New AI tools can deepen our understanding of biomedically relevant pathways. For example, AlphaFold 2 (2021) demonstrated the ability to approximate, in hours rather than months, the 3D structure of a protein.<ref>Jumper, J; Evans, R; Pritzel, A (2021). "Highly accurate protein structure prediction with AlphaFold". Nature. 596 (7873): 583–589. Bibcode:2021Natur.596..583J. doi:10.1038/s41586-021-03819-2. PMC 8371605. PMID 34265844.</ref> In 2023 it was reported that AI guided drug discovery helped find a class of antibiotics capable of killing two different types of drug-resistant bacteria.<ref>"AI discovers new class of antibiotics to kill drug-resistant bacteria". 20 December 2023.</ref>

Games

Game playing programs have been used since the 1950s to demonstrate and test AI's most advanced techniques.<ref>Grant, Eugene F.; Lardner, Rex (25 July 1952). "The Talk of the Town – It". The New Yorker. ISSN 0028-792X. Retrieved 28 January 2024.</ref> Deep Blue became the first computer chess-playing system to beat a reigning world chess champion, Garry Kasparov, on 11 May 1997.<ref>Anderson, Mark Robert (11 May 2017). "Twenty years on from Deep Blue vs Kasparov: how a chess match started the big data revolution". The Conversation. Retrieved 28 January 2024.</ref> In 2011, in a Jeopardy! quiz show exhibition match, IBM's question answering system, Watson, defeated the two greatest Jeopardy! champions, Brad Rutter and Ken Jennings, by a significant margin.<ref>Markoff, John (16 February 2011). "Computer Wins on 'Jeopardy!': Trivial, It's Not". The New York Times. ISSN 0362-4331. Retrieved 28 January 2024.</ref> In March 2016, AlphaGo won 4 out of 5 games of Go in a match with Go champion Lee Sedol, becoming the first computer Go-playing system to beat a professional Go player without handicaps. Then in 2017 it defeated Ke Jie, who was the best Go player in the world.<ref>Byford, Sam (27 May 2017). "AlphaGo retires from competitive Go after defeating world number one 3-0". The Verge. Retrieved 28 January 2024.</ref> Other programs handle imperfect-information games, such as the poker-playing program Pluribus.<ref>Brown, Noam; Sandholm, Tuomas (30 August 2019). "Superhuman AI for multiplayer poker". Science. 365 (6456): 885–890. doi:10.1126/science.aay2400. ISSN 0036-8075.</ref> DeepMind developed increasingly generalistic reinforcement learning models, such as with MuZero, which could be trained to play chess, Go, or Atari games.<ref>"MuZero: Mastering Go, chess, shogi and Atari without rules". Google DeepMind. 23 December 2020. Retrieved 28 January 2024.</ref> In 2019, DeepMind's AlphaStar achieved grandmaster level in StarCraft II, a particularly challenging real-time strategy game that involves incomplete knowledge of what happens on the map.<ref>Sample, Ian (30 October 2019). "AI becomes grandmaster in 'fiendishly complex' StarCraft II". The Guardian. ISSN 0261-3077. Retrieved 28 January 2024.</ref> In 2021 an AI agent competed in a Playstation Gran Turismo competition, winning against four of the world's best Gran Turismo drivers using deep reinforcement learning.<ref>Wurman, P.R.; Barrett, S.; Kawamoto, K. (2022). "Outracing champion Gran Turismo drivers with deep reinforcement learning". Nature 602. 602 (7896): 223–228. doi:10.1038/s41586-021-04357-7.</ref>

Military

Various countries are deploying AI military applications.<ref name=":22">Congressional Research Service (2019). Artificial Intelligence and National Security (PDF). Washington, DC: Congressional Research Service.PD-notice</ref> The main applications enhance command and control, communications, sensors, integration and interoperability.<ref name="AI">Slyusar, Vadym (2019). "Artificial intelligence as the basis of future control networks". ResearchGate. doi:10.13140/RG.2.2.30247.50087.</ref> Research is targeting intelligence collection and analysis, logistics, cyber operations, information operations, and semiautonomous and autonomous vehicles.<ref name=":22" /> AI technologies enable coordination of sensors and effectors, threat detection and identification, marking of enemy positions, target acquisition, coordination and deconfliction of distributed Joint Fires between networked combat vehicles involving manned and unmanned teams.<ref name="AI" /> AI was incorporated into military operations in Iraq and Syria.<ref name=":22" />

In November 2023, US Vice President Kamala Harris disclosed a declaration signed by 31 nations to set guardrails for the military use of IA. The commitments include using legal reviews to ensure the compliance of military AI with international laws, and being cautious and transparent in the development of this technology.<ref>Knight, Will. "The US and 30 Other Nations Agree to Set Guardrails for Military AI". Wired. ISSN 1059-1028. Retrieved 24 January 2024.</ref>

Generative AI

Vincent van Gogh in watercolour created by generative AI software

In the early 2020s, generative AI gained widespread prominence. In March 2023, 58% of US adults had heard about ChatGPT and 14% had tried it.<ref>Marcelline, Marco (27 May 2023). "ChatGPT: Most Americans Know About It, But Few Actually Use the AI Chatbot". PCMag. Retrieved 28 January 2024.</ref> The increasing realism and ease-of-use of AI-based text-to-image generators such as Midjourney, DALL-E, and Stable Diffusion sparked a trend of viral AI-generated photos. Widespread attention was gained by a fake photo of Pope Francis wearing a white puffer coat, the fictional arrest of Donald Trump, and a hoax of an attack on the Pentagon, as well as the usage in professional creative arts.<ref>Lu, Donna (31 March 2023). "Misinformation, mistakes and the Pope in a puffer: what rapidly evolving AI can – and can't – do". The Guardian. ISSN 0261-3077. Retrieved 28 January 2024.</ref><ref>Hurst, Luke (23 May 2023). "How a fake image of a Pentagon explosion shared on Twitter caused a real dip on Wall Street". euronews. Retrieved 28 January 2024.</ref>

Industry-specific tasks

There are also thousands of successful AI applications used to solve specific problems for specific industries or institutions. In a 2017 survey, one in five companies reported they had incorporated "AI" in some offerings or processes.<ref>Ransbotham, Sam; Kiron, David; Gerbert, Philipp; Reeves, Martin (6 September 2017). "Reshaping Business With Artificial Intelligence". MIT Sloan Management Review. Archived from the original on 13 February 2024.</ref> A few examples are energy storage, medical diagnosis, military logistics, applications that predict the result of judicial decisions, foreign policy, or supply chain management.

In agriculture, AI has helped farmers identify areas that need irrigation, fertilization, pesticide treatments or increasing yield. Agronomists use AI to conduct research and development. AI has been used to predict the ripening time for crops such as tomatoes, monitor soil moisture, operate agricultural robots, conduct predictive analytics, classify livestock pig call emotions, automate greenhouses, detect diseases and pests, and save water.

Artificial intelligence is used in astronomy to analyze increasing amounts of available data and applications, mainly for "classification, regression, clustering, forecasting, generation, discovery, and the development of new scientific insights" for example for discovering exoplanets, forecasting solar activity, and distinguishing between signals and instrumental effects in gravitational wave astronomy. It could also be used for activities in space such as space exploration, including analysis of data from space missions, real-time science decisions of spacecraft, space debris avoidance, and more autonomous operation.

Ethics

AI, like any powerful technology, has potential benefits and potential risks. AI may be able to advance science and find solutions for serious problems: Demis Hassabis of Deep Mind hopes to "solve intelligence, and then use that to solve everything else".<ref name="FOOTNOTESimonite2016">Simonite (2016).</ref> However, as the use of AI has become widespread, several unintended consequences and risks have been identified.<ref name="FOOTNOTERussellNorvig2021987">Russell & Norvig (2021), p. 987.</ref>

Anyone looking to use machine learning as part of real-world, in-production systems needs to factor ethics into their AI training processes and strive to avoid bias. This is especially true when using AI algorithms that are inherently unexplainable in deep learning.<ref name="FOOTNOTELaskowski2023">Laskowski (2023).</ref>

Risks and harm

Privacy and copyright

Machine learning algorithms require large amounts of data. The techniques used to acquire this data have raised concerns about privacy, surveillance and copyright.

Technology companies collect a wide range of data from their users, including online activity, geolocation data, video and audio.<ref name="FOOTNOTEGAO2022">GAO (2022).</ref> For example, in order to build speech recognition algorithms, Amazon have recorded millions of private conversations and allowed temporary workers to listen to and transcribe some of them.<ref name="FOOTNOTEValinsky2019">Valinsky (2019).</ref> Opinions about this widespread surveillance range from those who see it as a necessary evil to those for whom it is clearly unethical and a violation of the right to privacy.<ref name="FOOTNOTERussellNorvig2021991">Russell & Norvig (2021), p. 991.</ref>

AI developers argue that this is the only way to deliver valuable applications. and have developed several techniques that attempt to preserve privacy while still obtaining the data, such as data aggregation, de-identification and differential privacy.<ref name="FOOTNOTERussellNorvig2021991–992">Russell & Norvig (2021), p. 991–992.</ref> Since 2016, some privacy experts, such as Cynthia Dwork, began to view privacy in terms of fairness. Brian Christian wrote that experts have pivoted "from the question of 'what they know' to the question of 'what they're doing with it'.".<ref name="FOOTNOTEChristian202063">Christian (2020), p. 63.</ref>

Generative AI is often trained on unlicensed copyrighted works, including in domains such as images or computer code; the output is then used under a rationale of "fair use". Also website owners who do not wish to have their copyrighted content be AI indexed or 'scraped' can add code to their site, as you would, if you did not want your website to be indexed by a search engine which is currently available to certain services such as OpenAI. Experts disagree about how well, and under what circumstances, this rationale will hold up in courts of law; relevant factors may include "the purpose and character of the use of the copyrighted work" and "the effect upon the potential market for the copyrighted work".<ref name="FOOTNOTEVincent2022">Vincent (2022).</ref> In 2023, leading authors (including John Grisham and Jonathan Franzen) sued AI companies for using their work to train generative AI.<ref name="FOOTNOTEReisner2023">Reisner (2023).</ref><ref name="FOOTNOTEAlterHarris2023">Alter & Harris (2023).</ref>

Misinformation

YouTube, Facebook and others use recommender systems to guide users to more content. These AI programs were given the goal of maximizing user engagement (that is, the only goal was to keep people watching). The AI learned that users tended to choose misinformation, conspiracy theories, and extreme partisan content, and, to keep them watching, the AI recommended more of it. Users also tended to watch more content on the same subject, so the AI led people into filter bubbles where they received multiple versions of the same misinformation.<ref name="FOOTNOTENicas2018">Nicas (2018).</ref> This convinced many users that the misinformation was true, and ultimately undermined trust in institutions, the media and the government.<ref>Rainie, Lee; Keeter, Scott; Perrin, Andrew (22 July 2019). "Trust and Distrust in America". Pew Research Center. Archived from the original on 22 February 2024.</ref> The AI program had correctly learned to maximize its goal, but the result was harmful to society. After the U.S. election in 2016, major technology companies took steps to mitigate the problem.

In 2022, generative AI began to create images, audio, video and text that are indistinguishable from real photographs, recordings, films or human writing. It is possible for bad actors to use this technology to create massive amounts of misinformation or propaganda.<ref name="FOOTNOTEWilliams2023">Williams (2023).</ref> AI pioneer Geoffrey Hinton expressed concern about AI enabling "authoritarian leaders to manipulate their electorates" on a large scale, among other risks.<ref name="FOOTNOTETaylorHern2023">Taylor & Hern (2023).</ref>

Algorithmic bias and fairness

Machine learning applications will be biased if they learn from biased data.<ref name="FOOTNOTERose2023">Rose (2023).</ref> The developers may not be aware that the bias exists.<ref name="FOOTNOTECNA2019">CNA (2019).</ref> Bias can be introduced by the way training data is selected and by the way a model is deployed.<ref name="FOOTNOTEGoffrey200817">Goffrey (2008), p. 17.</ref><ref name="FOOTNOTERose2023">Rose (2023).</ref> If a biased algorithm is used to make decisions that can seriously harm people (as it can in medicine, finance, recruitment, housing or policing) then the algorithm may cause discrimination.<ref>Berdahl et al. (2023); Goffrey (2008, p. 17); Rose (2023); Russell & Norvig (2021, p. 995)</ref> Fairness in machine learning is the study of how to prevent the harm caused by algorithmic bias. It has become serious area of academic study within AI. Researchers have discovered it is not always possible to define "fairness" in a way that satisfies all stakeholders.<ref> Algorithmic bias and Fairness (machine learning):

</ref>

On June 28, 2015, Google Photos's new image labeling feature mistakenly identified Jacky Alcine and a friend as "gorillas" because they were black. The system was trained on a dataset that contained very few images of black people,<ref name="FOOTNOTEChristian202025">Christian (2020), p. 25.</ref> a problem called "sample size disparity".<ref name="FOOTNOTERussellNorvig2021995">Russell & Norvig (2021), p. 995.</ref> Google "fixed" this problem by preventing the system from labelling anything as a "gorilla". Eight years later, in 2023, Google Photos still could not identify a gorilla, and neither could similar products from Apple, Facebook, Microsoft and Amazon.<ref name="FOOTNOTEGrantHill2023">Grant & Hill (2023).</ref>

COMPAS is a commercial program widely used by U.S. courts to assess the likelihood of a defendant becoming a recidivist. In 2016, Julia Angwin at ProPublica discovered that COMPAS exhibited racial bias, despite the fact that the program was not told the races of the defendants. Although the error rate for both whites and blacks was calibrated equal at exactly 61%, the errors for each race were different—the system consistently overestimated the chance that a black person would re-offend and would underestimate the chance that a white person would not re-offend.<ref name="FOOTNOTELarsonAngwin2016">Larson & Angwin (2016).</ref> In 2017, several researchers<ref group="lower-alpha">Including Jon Kleinberg (Cornell), Sendhil Mullainathan (University of Chicago), Cynthia Chouldechova (Carnegie Mellon) and Sam Corbett-Davis (Stanford)<ref name="FOOTNOTEChristian202067–70">Christian (2020), p. 67–70.</ref></ref> showed that it was mathematically impossible for COMPAS to accommodate all possible measures of fairness when the base rates of re-offense were different for whites and blacks in the data.<ref>Christian (2020, pp. 67–70); Russell & Norvig (2021, pp. 993–994)</ref>

A program can make biased decisions even if the data does not explicitly mention a problematic feature (such as "race" or "gender"). The feature will correlate with other features (like "address", "shopping history" or "first name"), and the program will make the same decisions based on these features as it would on "race" or "gender".<ref>Russell & Norvig (2021, p. 995); Lipartito (2011, p. 36); Goodman & Flaxman (2017, p. 6); Christian (2020, pp. 39–40, 65)</ref> Moritz Hardt said "the most robust fact in this research area is that fairness through blindness doesn't work."<ref>Quoted in Christian (2020, p. 65).</ref>

Criticism of COMPAS highlighted a deeper problem with the misuse of AI. Machine learning models are designed to make "predictions" that are only valid if we assume that the future will resemble the past. If they are trained on data that includes the results of racist decisions in the past, machine learning models must predict that racist decisions will be made in the future. Unfortunately, if an application then uses these predictions as recommendations, some of these "recommendations" will likely be racist.<ref>Russell & Norvig (2021, p. 994); Christian (2020, pp. 40, 80–81)</ref> Thus, machine learning is not well suited to help make decisions in areas where there is hope that the future will be better than the past. It is necessarily descriptive and not proscriptive.<ref group="lower-alpha">Moritz Hardt (a director at the Max Planck Institute for Intelligent Systems) argues that machine learning "is fundamentally the wrong tool for a lot of domains, where you're trying to design interventions and mechanisms that change the world."<ref>Quoted in Christian (2020, p. 80)</ref></ref>

Bias and unfairness may go undetected because the developers are overwhelmingly white and male: among AI engineers, about 4% are black and 20% are women.<ref name="FOOTNOTERussellNorvig2021995">Russell & Norvig (2021), p. 995.</ref>

At its 2022 Conference on Fairness, Accountability, and Transparency (ACM FAccT 2022) the Association for Computing Machinery, in Seoul, South Korea, presented and published findings recommending that until AI and robotics systems are demonstrated to be free of bias mistakes, they are unsafe and the use of self-learning neural networks trained on vast, unregulated sources of flawed internet data should be curtailed.<ref name="FOOTNOTEDockrill2022">Dockrill (2022).</ref>

Lack of transparency

Lidar testing vehicle for autonomous driving

Many AI systems are so complex that their designers cannot explain how they reach their decisions.<ref name="FOOTNOTESample2017">Sample (2017).</ref> Particularly with deep neural networks, in which there are a large amount of non-linear relationships between inputs and outputs. But some popular explainability techniques exist.<ref>"Black Box AI". 16 June 2023.</ref>

There have been many cases where a machine learning program passed rigorous tests, but nevertheless learned something different than what the programmers intended. For example, a system that could identify skin diseases better than medical professionals was found to actually have a strong tendency to classify images with a ruler as "cancerous", because pictures of malignancies typically include a ruler to show the scale.<ref name="FOOTNOTEChristian2020110">Christian (2020), p. 110.</ref> Another machine learning system designed to help effectively allocate medical resources was found to classify patients with asthma as being at "low risk" of dying from pneumonia. Having asthma is actually a severe risk factor, but since the patients having asthma would usually get much more medical care, they were relatively unlikely to die according to the training data. The correlation between asthma and low risk of dying from pneumonia was real, but misleading.<ref name="FOOTNOTEChristian202088–91">Christian (2020), pp. 88–91.</ref>

People who have been harmed by an algorithm's decision have a right to an explanation. Doctors, for example, are required to clearly and completely explain the reasoning behind any decision they make.[clarification needed]<ref>Christian (2020, p. 83); Russell & Norvig (2021, p. 997)</ref> Early drafts of the European Union's General Data Protection Regulation in 2016 included an explicit statement that this right exists.<ref group="lower-alpha">When the law was passed in 2018, it still contained a form of this provision.</ref> Industry experts noted that this is an unsolved problem with no solution in sight. Regulators argued that nevertheless the harm is real: if the problem has no solution, the tools should not be used.<ref name="FOOTNOTEChristian202091">Christian (2020), p. 91.</ref>

DARPA established the XAI ("Explainable Artificial Intelligence") program in 2014 to try and solve these problems.<ref name="FOOTNOTEChristian202083">Christian (2020), p. 83.</ref>

There are several potential solutions to the transparency problem. SHAP helps visualise the contribution of each feature to the output.<ref name="FOOTNOTEVerma2021">Verma (2021).</ref> LIME can locally approximate a model with a simpler, interpretable model.<ref name="FOOTNOTERothman2020">Rothman (2020).</ref> Multitask learning provides a large number of outputs in addition to the target classification. These other outputs can help developers deduce what the network has learned.<ref name="FOOTNOTEChristian2020105-108">Christian (2020), p. 105-108.</ref> Deconvolution, DeepDream and other generative methods can allow developers to see what different layers of a deep network have learned and produce output that can suggest what the network is learning.<ref name="FOOTNOTEChristian2020108–112">Christian (2020), pp. 108–112.</ref>

Conflict, surveillance and weaponized AI

A lethal autonomous weapon is a machine that locates, selects and engages human targets without human supervision.<ref group="lower-alpha">This is the United Nations' definition, and includes things like land mines as well.<ref name="FOOTNOTERussellNorvig2021989">Russell & Norvig (2021), p. 989.</ref></ref> By 2015, over fifty countries were reported to be researching battlefield robots.<ref>Robitzski (2018); Sainato (2015)</ref> These weapons are considered especially dangerous for several reasons: if they kill an innocent person it is not clear who should be held accountable, it is unlikely they will reliably choose targets, and, if produced at scale, they are potentially weapons of mass destruction.<ref name="FOOTNOTERussellNorvig2021987-990">Russell & Norvig (2021), p. 987-990.</ref> In 2014, 30 nations (including China) supported a ban on autonomous weapons under the United Nations' Convention on Certain Conventional Weapons, however the United States and others disagreed.<ref name="FOOTNOTERussellNorvig2021988">Russell & Norvig (2021), p. 988.</ref>

AI provides a number of tools that are particularly useful for authoritarian governments: smart spyware, face recognition and voice recognition allow widespread surveillance; such surveillance allows machine learning to classify potential enemies of the state and can prevent them from hiding; recommendation systems can precisely target propaganda and misinformation for maximum effect; deepfakes and generative AI aid in producing misinformation; advanced AI can make authoritarian centralized decision making more competitive with liberal and decentralized systems such as markets.<ref name="FOOTNOTEHarari2018">Harari (2018).</ref>

AI facial recognition systems are used for mass surveillance, notably in China.<ref>Buckley, Chris; Mozur, Paul (22 May 2019). "How China Uses High-Tech Surveillance to Subdue Minorities". The New York Times.</ref><ref>"Security lapse exposed a Chinese smart city surveillance system". 3 May 2019. Archived from the original on 7 March 2021. Retrieved 14 September 2020.</ref> In 2019, Bengaluru, India deployed AI-managed traffic signals. This system uses cameras to monitor traffic density and adjust signal timing based on the interval needed to clear traffic.<ref>"AI traffic signals to be installed in Bengaluru soon". NextBigWhat. 24 September 2019. Retrieved 1 October 2019.</ref> Terrorists, criminals and rogue states can use weaponized AI such as advanced digital warfare and lethal autonomous weapons. Machine-learning AI is also able to design tens of thousands of toxic molecules in a matter of hours.<ref name="FOOTNOTEUrbinaLentzosInvernizziEkins2022">Urbina et al. (2022).</ref>

Technological unemployment

From the early days of the development of artificial intelligence there have been arguments, for example those put forward by Joseph Weizenbaum, about whether tasks that can be done by computers actually should be done by them, given the difference between computers and humans, and between quantitative calculation and qualitative, value-based judgement.<ref>Tarnoff, Ben (4 August 2023). "Lessons from Eliza". The Guardian Weekly. pp. 34–9.</ref>

Economists have frequently highlighted the risks of redundancies from AI, and speculated about unemployment if there is no adequate social policy for full employment.<ref name="auto1">E McGaughey, 'Will Robots Automate Your Job Away? Full Employment, Basic Income, and Economic Democracy' (2022) 51(3) Industrial Law Journal 511–559 Archived 27 May 2023 at the Wayback Machine</ref>

In the past, technology has tended to increase rather than reduce total employment, but economists acknowledge that "we're in uncharted territory" with AI.<ref>Ford & Colvin (2015);McGaughey (2022)</ref> A survey of economists showed disagreement about whether the increasing use of robots and AI will cause a substantial increase in long-term unemployment, but they generally agree that it could be a net benefit if productivity gains are redistributed.<ref name="FOOTNOTEIGM Chicago2017">IGM Chicago (2017).</ref> Risk estimates vary; for example, in the 2010s, Michael Osborne and Carl Benedikt Frey estimated 47% of U.S. jobs are at "high risk" of potential automation, while an OECD report classified only 9% of U.S. jobs as "high risk".<ref group="lower-alpha">See table 4; 9% is both the OECD average and the US average.<ref name="FOOTNOTEArntzGregoryZierahn201633">Arntz, Gregory & Zierahn (2016), p. 33.</ref></ref><ref>Lohr (2017); Frey & Osborne (2017); Arntz, Gregory & Zierahn (2016, p. 33)</ref> The methodology of speculating about future employment levels has been criticised as lacking evidential foundation, and for implying that technology, rather than social policy, creates unemployment, as opposed to redundancies.<ref name="auto1"/>

Unlike previous waves of automation, many middle-class jobs may be eliminated by artificial intelligence; The Economist stated in 2015 that "the worry that AI could do to white-collar jobs what steam power did to blue-collar ones during the Industrial Revolution" is "worth taking seriously".<ref name="FOOTNOTEMorgenstern2015">Morgenstern (2015).</ref> Jobs at extreme risk range from paralegals to fast food cooks, while job demand is likely to increase for care-related professions ranging from personal healthcare to the clergy.<ref>Mahdawi (2017); Thompson (2014)</ref>

In April 2023, it was reported that 70% of the jobs for Chinese video game illustrators had been eliminated by generative artificial intelligence.<ref>Zhou, Viola (11 April 2023). "AI is already taking video game illustrators' jobs in China". Rest of World. Retrieved 17 August 2023.</ref><ref>Carter, Justin (11 April 2023). "China's game art industry reportedly decimated by growing AI use". Game Developer. Retrieved 17 August 2023.</ref>

Existential risk

It has been argued AI will become so powerful that humanity may irreversibly lose control of it. This could, as physicist Stephen Hawking stated, "spell the end of the human race".<ref name="FOOTNOTECellan-Jones2014">Cellan-Jones (2014).</ref> This scenario has been common in science fiction, when a computer or robot suddenly develops a human-like "self-awareness" (or "sentience" or "consciousness") and becomes a malevolent character.<ref group="lower-alpha">Sometimes called a "robopocalypse".<ref name="FOOTNOTERussellNorvig20211001">Russell & Norvig 2021, p. 1001.</ref></ref> These sci-fi scenarios are misleading in several ways.

First, AI does not require human-like "sentience" to be an existential risk. Modern AI programs are given specific goals and use learning and intelligence to achieve them. Philosopher Nick Bostrom argued that if one gives almost any goal to a sufficiently powerful AI, it may choose to destroy humanity to achieve it (he used the example of a paperclip factory manager).<ref name="FOOTNOTEBostrom2014">Bostrom (2014).</ref> Stuart Russell gives the example of household robot that tries to find a way to kill its owner to prevent it from being unplugged, reasoning that "you can't fetch the coffee if you're dead."<ref name="FOOTNOTERussell2019">Russell (2019).</ref> In order to be safe for humanity, a superintelligence would have to be genuinely aligned with humanity's morality and values so that it is "fundamentally on our side".<ref>Bostrom (2014); Müller & Bostrom (2014); Bostrom (2015).</ref>

Second, Yuval Noah Harari argues that AI does not require a robot body or physical control to pose an existential risk. The essential parts of civilization are not physical. Things like ideologies, law, government, money and the economy are made of language; they exist because there are stories that billions of people believe. The current prevalence of misinformation suggests that an AI could use language to convince people to believe anything, even to take actions that are destructive.<ref name="FOOTNOTEHarari2023">Harari (2023).</ref>

The opinions amongst experts and industry insiders are mixed, with sizable fractions both concerned and unconcerned by risk from eventual superintelligent AI.<ref name="FOOTNOTEMüllerBostrom2014">Müller & Bostrom (2014).</ref> Personalities such as Stephen Hawking, Bill Gates, and Elon Musk have expressed concern about existential risk from AI.<ref> Leaders' concerns about the existential risks of AI around 2015:

</ref>

In the early 2010s, experts argued that the risks are too distant in the future to warrant research or that humans will be valuable from the perspective of a superintelligent machine.<ref> Arguments that AI is not an imminent risk:

</ref> However, after 2016, the study of current and future risks and possible solutions became a serious area of research.<ref name="FOOTNOTEChristian202067, 73">Christian (2020), pp. 67, 73.</ref>

AI pioneers including Fei-Fei Li, Geoffrey Hinton, Yoshua Bengio, Cynthia Breazeal, Rana el Kaliouby, Demis Hassabis, Joy Buolamwini, and Sam Altman have expressed concerns about the risks of AI. In 2023, many leading AI experts issued the joint statement that "Mitigating the risk of extinction from AI should be a global priority alongside other societal-scale risks such as pandemics and nuclear war".<ref name="FOOTNOTEValance2023">Valance (2023).</ref>

Other researchers, however, spoke in favor of a less dystopian view. AI pioneer Juergen Schmidhuber did not sign the joint statement, emphasising that in 95% of all cases, AI research is about making "human lives longer and healthier and easier."<ref name="guardian2023">Taylor, Josh (7 May 2023). "Rise of artificial intelligence is inevitable but should not be feared, 'father of AI' says". The Guardian. Retrieved 26 May 2023.</ref> While the tools that are now being used to improve lives can also be used by bad actors, "they can also be used against the bad actors."<ref name="foxnews2023">Colton, Emma (7 May 2023). "'Father of AI' says tech fears misplaced: 'You cannot stop it'". Fox News. Retrieved 26 May 2023.</ref><ref name="forbes2023">Jones, Hessie (23 May 2023). "Juergen Schmidhuber, Renowned 'Father Of Modern AI,' Says His Life's Work Won't Lead To Dystopia". Forbes. Retrieved 26 May 2023.</ref> Andrew Ng also argued that "it's a mistake to fall for the doomsday hype on AI—and that regulators who do will only benefit vested interests."<ref name="andrewng2023">McMorrow, Ryan (19 December 2023). "Andrew Ng: 'Do we think the world is better off with more or less intelligence?'". Financial Times. Retrieved 30 December 2023.</ref> Yann LeCun "scoffs at his peers' dystopian scenarios of supercharged misinformation and even, eventually, human extinction."<ref name="lecun2023">Levy, Steven (22 December 2023). "How Not to Be Stupid About AI, With Yann LeCun". Wired. Retrieved 30 December 2023.</ref>

Limiting AI

Possible options for limiting AI include: using Embedded Ethics or Constitutional AI where companies or governments can add a policy, restricting high levels of compute power in training, restricting the ability to rewrite its own code base, restrict certain AI techniques but not in the training phase, open-source (transparency) vs proprietary (could be more restricted), backup model with redundancy, restricting security, privacy and copyright, restricting or controlling the memory, real-time monitoring, risk analysis, emergency shut-off, rigorous simulation and testing, model certification, assess known vulnerabilities, restrict the training material, restrict access to the internet, issue terms of use.

Ethical machines and alignment

Friendly AI are machines that have been designed from the beginning to minimize risks and to make choices that benefit humans. Eliezer Yudkowsky, who coined the term, argues that developing friendly AI should be a higher research priority: it may require a large investment and it must be completed before AI becomes an existential risk.<ref name="FOOTNOTEYudkowsky2008">Yudkowsky (2008).</ref>

Machines with intelligence have the potential to use their intelligence to make ethical decisions. The field of machine ethics provides machines with ethical principles and procedures for resolving ethical dilemmas.<ref name="FOOTNOTEAndersonAnderson2011">Anderson & Anderson (2011).</ref> The field of machine ethics is also called computational morality,<ref name="FOOTNOTEAndersonAnderson2011">Anderson & Anderson (2011).</ref> and was founded at an AAAI symposium in 2005.<ref name="FOOTNOTEAAAI2014">AAAI (2014).</ref>

Other approaches include Wendell Wallach's "artificial moral agents"<ref name="FOOTNOTEWallach2010">Wallach (2010).</ref> and Stuart J. Russell's three principles for developing provably beneficial machines.<ref name="FOOTNOTERussell2019173">Russell (2019), p. 173.</ref>

Frameworks

Artificial Intelligence projects can have their ethical permissibility tested while designing, developing, and implementing an AI system. An AI framework such as the Care and Act Framework containing the SUM values – developed by the Alan Turing Institute tests projects in four main areas:<ref>Alan Turing Institute (2019). "Understanding artificial intelligence ethics and safety" (PDF).</ref><ref>Alan Turing Institute (2023). "AI Ethics and Governance in Practice" (PDF).</ref>

  • RESPECT the dignity of individual people
  • CONNECT with other people sincerely, openly and inclusively
  • CARE for the wellbeing of everyone
  • PROTECT social values, justice and the public interest

Other developments in ethical frameworks include those decided upon during the Asilomar Conference, the Montreal Declaration for Responsible AI, and the IEEE's Ethics of Autonomous Systems initiative, among others;<ref>Floridi, Luciano; Cowls, Josh (23 June 2019). "A Unified Framework of Five Principles for AI in Society". Harvard Data Science Review. 1 (1). doi:10.1162/99608f92.8cd550d1. S2CID 198775713.</ref> however, these principles do not go without their criticisms, especially regards to the people chosen contributes to these frameworks.<ref>Buruk, Banu; Ekmekci, Perihan Elif; Arda, Berna (1 September 2020). "A critical perspective on guidelines for responsible and trustworthy artificial intelligence". Medicine, Health Care and Philosophy. 23 (3): 387–399. doi:10.1007/s11019-020-09948-1. ISSN 1572-8633. PMID 32236794. S2CID 214766800.</ref>

Promotion of the wellbeing of the people and communities that these technologies affect requires consideration of the social and ethical implications at all stages of AI system design, development and implementation, and collaboration between job roles such as data scientists, product managers, data engineers, domain experts, and delivery managers.<ref>Kamila, Manoj Kumar; Jasrotia, Sahil Singh (1 January 2023). "Ethical issues in the development of artificial intelligence: recognizing the risks". International Journal of Ethics and Systems. ahead-of-print (ahead-of-print). doi:10.1108/IJOES-05-2023-0107. ISSN 2514-9369. S2CID 259614124.</ref>

Regulation

AI Safety Summit
The first global AI Safety Summit was held in 2023 with a declaration calling for international co-operation.

The regulation of artificial intelligence is the development of public sector policies and laws for promoting and regulating artificial intelligence (AI); it is therefore related to the broader regulation of algorithms.<ref> Regulation of AI to mitigate risks:

</ref> The regulatory and policy landscape for AI is an emerging issue in jurisdictions globally.<ref name="FOOTNOTELaw Library of Congress (U.S.). Global Legal Research Directorate2019">Law Library of Congress (U.S.). Global Legal Research Directorate (2019).</ref> According to AI Index at Stanford, the annual number of AI-related laws passed in the 127 survey countries jumped from one passed in 2016 to 37 passed in 2022 alone.<ref name="FOOTNOTEVincent2023">Vincent (2023).</ref><ref name="FOOTNOTEStanford University2023">Stanford University (2023).</ref> Between 2016 and 2020, more than 30 countries adopted dedicated strategies for AI.<ref name="FOOTNOTEUNESCO2021">UNESCO (2021).</ref> Most EU member states had released national AI strategies, as had Canada, China, India, Japan, Mauritius, the Russian Federation, Saudi Arabia, United Arab Emirates, US and Vietnam. Others were in the process of elaborating their own AI strategy, including Bangladesh, Malaysia and Tunisia.<ref name="FOOTNOTEUNESCO2021">UNESCO (2021).</ref> The Global Partnership on Artificial Intelligence was launched in June 2020, stating a need for AI to be developed in accordance with human rights and democratic values, to ensure public confidence and trust in the technology.<ref name="FOOTNOTEUNESCO2021">UNESCO (2021).</ref> Henry Kissinger, Eric Schmidt, and Daniel Huttenlocher published a joint statement in November 2021 calling for a government commission to regulate AI.<ref name="FOOTNOTEKissinger2021">Kissinger (2021).</ref> In 2023, OpenAI leaders published recommendations for the governance of superintelligence, which they believe may happen in less than 10 years.<ref name="FOOTNOTEAltmanBrockmanSutskever2023">Altman, Brockman & Sutskever (2023).</ref> In 2023, the United Nations also launched an advisory body to provide recommendations on AI governance; the body comprises technology company executives, governments officials and academics.<ref>VOA News (25 October 2023). "UN Announces Advisory Body on Artificial Intelligence".</ref>

In a 2022 Ipsos survey, attitudes towards AI varied greatly by country; 78% of Chinese citizens, but only 35% of Americans, agreed that "products and services using AI have more benefits than drawbacks".<ref name="FOOTNOTEVincent2023">Vincent (2023).</ref> A 2023 Reuters/Ipsos poll found that 61% of Americans agree, and 22% disagree, that AI poses risks to humanity.<ref name="FOOTNOTEEdwards2023">Edwards (2023).</ref> In a 2023 Fox News poll, 35% of Americans thought it "very important", and an additional 41% thought it "somewhat important", for the federal government to regulate AI, versus 13% responding "not very important" and 8% responding "not at all important".<ref name="FOOTNOTEKasperowicz2023">Kasperowicz (2023).</ref><ref name="FOOTNOTEFox News2023">Fox News (2023).</ref>

In November 2023, the first global AI Safety Summit was held in Bletchley Park in the UK to discuss the near and far term risks of AI and the possibility of mandatory and voluntary regulatory frameworks.<ref>Milmo, Dan (3 November 2023). "Hope or Horror? The great AI debate dividing its pioneers". The Guardian Weekly. pp. 10–12.</ref> 28 countries including the United States, China, and the European Union issued a declaration at the start of the summit, calling for international co-operation to manage the challenges and risks of artificial intelligence.<ref name="2023-11-01-bletchley-declaration-full">"The Bletchley Declaration by Countries Attending the AI Safety Summit, 1-2 November 2023". GOV.UK. 1 November 2023. Archived from the original on 1 November 2023. Retrieved 2 November 2023.</ref><ref>"Countries agree to safe and responsible development of frontier AI in landmark Bletchley Declaration". GOV.UK (Press release). Archived from the original on 1 November 2023. Retrieved 1 November 2023.</ref>

History

The study of mechanical or "formal" reasoning began with philosophers and mathematicians in antiquity. The study of logic led directly to Alan Turing's theory of computation, which suggested that a machine, by shuffling symbols as simple as "0" and "1", could simulate both mathematical deduction and formal reasoning, which is known as the Church–Turing thesis.<ref name="FOOTNOTEBerlinski2000">Berlinski (2000).</ref> This, along with concurrent discoveries in cybernetics and information theory, led researchers to consider the possibility of building an "electronic brain".<ref group="lower-alpha">"Electronic brain" was the term used by the press around this time.<ref>"Google books ngram".</ref></ref><ref> AI's immediate precursors:

</ref>

Alan Turing was thinking about machine intelligence at least as early as 1941, when he circulated a paper on machine intelligence which could be the earliest paper in the field of AI – though it is now lost.<ref name="turing">Copeland, J., ed. (2004). The Essential Turing: the ideas that gave birth to the computer age. Oxford, England: Clarendon Press. ISBN 0-19-825079-7.</ref> The first available paper generally recognized as "AI" was McCullouch and Pitts design for Turing-complete "artificial neurons" in 1943 – the first mathematical model of a neural network.<ref name="FOOTNOTERussellNorvig202117">Russell & Norvig (2021), p. 17.</ref> The paper was influenced by Turing's earlier paper 'On Computable Numbers' from 1936 using similar two-state boolean 'neurons', but was the first to apply it to neuronal function.<ref name="turing" />

The term 'machine intelligence' was used by Alan Turing during his life which was later often referred to as 'artificial intelligence' after his death in 1954. In 1950 Turing published the best known of his papers 'Computing Machinery and Intelligence', the paper introduced his concept of what is now known as the Turing test to the general public. Then followed three radio broadcasts on AI by Turing, the lectures: 'Intelligent Machinery, A Heretical Theory', 'Can Digital Computers Think'? and the panel discussion 'Can Automatic Calculating Machines be Said to Think'. By 1956 computer intelligence had been actively pursued for more than a decade in Britain; the earliest AI programmes were written there in 1951–1952.<ref name="turing" />

In 1951, using a Ferranti Mark 1 computer of the University of Manchester, checkers and chess programs were written where you could play against the computer.<ref>See "A Brief History of Computing" at AlanTuring.net.</ref> The field of American AI research was founded at a workshop at Dartmouth College in 1956.<ref group="lower-alpha"> Daniel Crevier wrote, "the conference is generally recognized as the official birthdate of the new science."<ref name="FOOTNOTECrevier199347–49">Crevier (1993), pp. 47–49.</ref> Russell and Norvig called the conference "the inception of artificial intelligence."<ref name="FOOTNOTERussellNorvig202117">Russell & Norvig (2021), p. 17.</ref></ref><ref name="Dartmouth workshop"> Dartmouth workshop:

The proposal:

</ref> The attendees became the leaders of AI research in the 1960s.<ref group="lower-alpha"> Russell and Norvig wrote "for the next 20 years the field would be dominated by these people and their students."<ref name="FOOTNOTERussellNorvig200317">Russell & Norvig (2003), p. 17.</ref> </ref> They and their students produced programs that the press described as "astonishing":<ref group="lower-alpha"> Russell and Norvig wrote "it was astonishing whenever a computer did anything kind of smartish".<ref name="FOOTNOTERussellNorvig200318">Russell & Norvig (2003), p. 18.</ref> </ref> computers were learning checkers strategies, solving word problems in algebra, proving logical theorems and speaking English.<ref group="lower-alpha"> The programs described are Arthur Samuel's checkers program for the IBM 701, Daniel Bobrow's STUDENT, Newell and Simon's Logic Theorist and Terry Winograd's SHRDLU. </ref><ref name="AI in the 60s"> Successful programs the 1960s:

</ref> Artificial Intelligence laboratories were set up at a number of British and US Universities in the latter 1950s and early 1960s.<ref name="turing" />

They had, however, underestimated the difficulty of the problem.<ref group="lower-alpha">Russell and Norvig write: "in almost all cases, these early systems failed on more difficult problems"<ref name="FOOTNOTERussellNorvig202121">Russell & Norvig (2021), p. 21.</ref></ref> Both the U.S. and British governments cut off exploratory research in response to the criticism of Sir James Lighthill<ref name="FOOTNOTELighthill1973">Lighthill (1973).</ref> and ongoing pressure from the U.S. Congress to fund more productive projects. Minsky's and Papert's book Perceptrons was understood as proving that artificial neural networks would never be useful for solving real-world tasks, thus discrediting the approach altogether.<ref name="FOOTNOTERussellNorvig202122">Russell & Norvig (2021), p. 22.</ref> The "AI winter", a period when obtaining funding for AI projects was difficult, followed.<ref name="First AI winter"> First AI Winter, Lighthill report, Mansfield Amendment

</ref>

In the early 1980s, AI research was revived by the commercial success of expert systems,<ref> Expert systems:

</ref> a form of AI program that simulated the knowledge and analytical skills of human experts. By 1985, the market for AI had reached over a billion dollars. At the same time, Japan's fifth generation computer project inspired the U.S. and British governments to restore funding for academic research.<ref name="AI in the 80s"> Funding initiatives in the early 1980s: Fifth Generation Project (Japan), Alvey (UK), Microelectronics and Computer Technology Corporation (US), Strategic Computing Initiative (US):

</ref> However, beginning with the collapse of the Lisp Machine market in 1987, AI once again fell into disrepute, and a second, longer-lasting winter began.<ref name="Second AI winter"> Second AI Winter:

</ref>

Many researchers began to doubt that the current practices would be able to imitate all the processes of human cognition, especially perception, robotics, learning and pattern recognition.<ref name="FOOTNOTERussellNorvig202124">Russell & Norvig (2021), p. 24.</ref> A number of researchers began to look into "sub-symbolic" approaches.<ref name="FOOTNOTENilsson19987">Nilsson (1998), p. 7.</ref> Robotics researchers, such as Rodney Brooks, rejected "representation" in general and focussed directly on engineering machines that move and survive.<ref group="lower-alpha"></ref> Judea Pearl, Lofti Zadeh and others developed methods that handled incomplete and uncertain information by making reasonable guesses rather than precise logic.<ref name = "Uncertain reasoning"/><ref name="FOOTNOTERussellNorvig202125">Russell & Norvig (2021), p. 25.</ref> But the most important development was the revival of "connectionism", including neural network research, by Geoffrey Hinton and others.<ref>

</ref> In 1990, Yann LeCun successfully showed that convolutional neural networks can recognize handwritten digits, the first of many successful applications of neural networks.<ref name="FOOTNOTERussellNorvig202126">Russell & Norvig (2021), p. 26.</ref>

AI gradually restored its reputation in the late 1990s and early 21st century by exploiting formal mathematical methods and by finding specific solutions to specific problems. This "narrow" and "formal" focus allowed researchers to produce verifiable results and collaborate with other fields (such as statistics, economics and mathematics).<ref name="AI 1990s"> Formal and narrow methods adopted in the 1990s:

</ref> By 2000, solutions developed by AI researchers were being widely used, although in the 1990s they were rarely described as "artificial intelligence".<ref name="AI widely used 1990s"> AI widely used in the late 1990s:

</ref>

Several academic researchers became concerned that AI was no longer pursuing the original goal of creating versatile, fully intelligent machines. Beginning around 2002, they founded the subfield of artificial general intelligence (or "AGI"), which had several well-funded institutions by the 2010s.<ref name="AGI"/>

Deep learning began to dominate industry benchmarks in 2012 and was adopted throughout the field.<ref name="Deep learning revolution"> Deep learning revolution, AlexNet:

</ref> For many specific tasks, other methods were abandoned.<ref group="lower-alpha">Matteo Wong wrote in The Atlantic: "Whereas for decades, computer-science fields such as natural-language processing, computer vision, and robotics used extremely different methods, now they all use a programming method called "deep learning." As a result, their code and approaches have become more similar, and their models are easier to integrate into one another."<ref name="FOOTNOTEWong2023">Wong (2023).</ref></ref> Deep learning's success was based on both hardware improvements (faster computers,<ref name="Moore's Law"> Moore's Law and AI:

</ref> graphics processing units, cloud computing<ref name="FOOTNOTEClark2015b">Clark (2015b).</ref>) and access to large amounts of data<ref name="Big data"> Big data:

</ref> (including curated datasets,<ref name="FOOTNOTEClark2015b">Clark (2015b).</ref> such as ImageNet).

Deep learning's success led to an enormous increase in interest and funding in AI.<ref group="lower-alpha">Jack Clark wrote in Bloomberg: "After a half-decade of quiet breakthroughs in artificial intelligence, 2015 has been a landmark year. Computers are smarter and learning faster than ever," and noted that the number of software projects that use machine learning at Google increased from a "sporadic usage" in 2012 to more than 2,700 projects in 2015.<ref name="FOOTNOTEClark2015b">Clark (2015b).</ref></ref> The amount of machine learning research (measured by total publications) increased by 50% in the years 2015–2019,<ref name="FOOTNOTEUNESCO2021">UNESCO (2021).</ref> and WIPO reported that AI was the most prolific emerging technology in terms of the number of patent applications and granted patents.<ref>"Intellectual Property and Frontier Technologies". WIPO. Archived from the original on 2 April 2022. Retrieved 30 March 2022.</ref> According to 'AI Impacts', about $50 billion annually was invested in "AI" around 2022 in the US alone and about 20% of new US Computer Science PhD graduates have specialized in "AI";<ref name="FOOTNOTEDiFeliciantonio2023">DiFeliciantonio (2023).</ref> about 800,000 "AI"-related US job openings existed in 2022.<ref name="FOOTNOTEGoswami2023">Goswami (2023).</ref> The large majority of the advances have occurred within the United States, with its companies, universities, and research labs leading artificial intelligence research.<ref name="FOOTNOTEFrank2023">Frank (2023).</ref>

In 2016, issues of fairness and the misuse of technology were catapulted into center stage at machine learning conferences, publications vastly increased, funding became available, and many researchers re-focussed their careers on these issues. The alignment problem became a serious field of academic study.<ref name="FOOTNOTEChristian202067, 73">Christian (2020), pp. 67, 73.</ref>

Philosophy

Defining artificial intelligence

Alan Turing wrote in 1950 "I propose to consider the question 'can machines think'?"<ref name="FOOTNOTETuring19501">Turing (1950), p. 1.</ref> He advised changing the question from whether a machine "thinks", to "whether or not it is possible for machinery to show intelligent behaviour".<ref name="FOOTNOTETuring19501">Turing (1950), p. 1.</ref> He devised the Turing test, which measures the ability of a machine to simulate human conversation.<ref name="Turing test"> Turing's original publication of the Turing test in "Computing machinery and intelligence":

Historical influence and philosophical implications:

</ref> Since we can only observe the behavior of the machine, it does not matter if it is "actually" thinking or literally has a "mind". Turing notes that we can not determine these things about other people<ref group="lower-alpha">See Problem of other minds</ref> but "it is usual to have a polite convention that everyone thinks"<ref name="FOOTNOTETuring1950Under "The Argument from Consciousness"">Turing (1950), Under "The Argument from Consciousness".</ref>

Russell and Norvig agree with Turing that AI must be defined in terms of "acting" and not "thinking".<ref name="FOOTNOTERussellNorvig2021chpt. 2">Russell & Norvig (2021), chpt. 2.</ref> However, they are critical that the test compares machines to people. "Aeronautical engineering texts," they wrote, "do not define the goal of their field as making 'machines that fly so exactly like pigeons that they can fool other pigeons.Template:' "<ref name="FOOTNOTERussellNorvig20213">Russell & Norvig (2021), p. 3.</ref> AI founder John McCarthy agreed, writing that "Artificial intelligence is not, by definition, simulation of human intelligence".<ref name="FOOTNOTEMaker2006">Maker (2006).</ref>

McCarthy defines intelligence as "the computational part of the ability to achieve goals in the world."<ref name="FOOTNOTEMcCarthy1999">McCarthy (1999).</ref> Another AI founder, Marvin Minsky similarly defines it as "the ability to solve hard problems".<ref name="FOOTNOTEMinsky1986">Minsky (1986).</ref> These definitions view intelligence in terms of well-defined problems with well-defined solutions, where both the difficulty of the problem and the performance of the program are direct measures of the "intelligence" of the machine—and no other philosophical discussion is required, or may not even be possible.

Another definition has been adopted by Google,<ref>"What Is Artificial Intelligence (AI)?". Google Cloud Platform. Archived from the original on 31 July 2023. Retrieved 16 October 2023.</ref> a major practitioner in the field of AI. This definition stipulates the ability of systems to synthesize information as the manifestation of intelligence, similar to the way it is defined in biological intelligence.

Evaluating approaches to AI

No established unifying theory or paradigm has guided AI research for most of its history.<ref group="lower-alpha">Nils Nilsson wrote in 1983: "Simply put, there is wide disagreement in the field about what AI is all about."<ref name="FOOTNOTENilsson198310">Nilsson (1983), p. 10.</ref></ref> The unprecedented success of statistical machine learning in the 2010s eclipsed all other approaches (so much so that some sources, especially in the business world, use the term "artificial intelligence" to mean "machine learning with neural networks"). This approach is mostly sub-symbolic, soft and narrow (see below). Critics argue that these questions may have to be revisited by future generations of AI researchers.

Symbolic AI and its limits

Symbolic AI (or "GOFAI")<ref name="FOOTNOTEHaugeland1985112–117">Haugeland (1985), pp. 112–117.</ref> simulated the high-level conscious reasoning that people use when they solve puzzles, express legal reasoning and do mathematics. They were highly successful at "intelligent" tasks such as algebra or IQ tests. In the 1960s, Newell and Simon proposed the physical symbol systems hypothesis: "A physical symbol system has the necessary and sufficient means of general intelligent action."<ref name="Physical symbol system hypothesis"> Physical symbol system hypothesis:

Historical significance:

</ref>

However, the symbolic approach failed on many tasks that humans solve easily, such as learning, recognizing an object or commonsense reasoning. Moravec's paradox is the discovery that high-level "intelligent" tasks were easy for AI, but low level "instinctive" tasks were extremely difficult.<ref> Moravec's paradox:

</ref> Philosopher Hubert Dreyfus had argued since the 1960s that human expertise depends on unconscious instinct rather than conscious symbol manipulation, and on having a "feel" for the situation, rather than explicit symbolic knowledge.<ref name="Dreyfus' critique"> Dreyfus' critique of AI:

Historical significance and philosophical implications:

</ref> Although his arguments had been ridiculed and ignored when they were first presented, eventually, AI research came to agree with him.<ref group="lower-alpha"> Daniel Crevier wrote that "time has proven the accuracy and perceptiveness of some of Dreyfus's comments. Had he formulated them less aggressively, constructive actions they suggested might have been taken much earlier."<ref name="FOOTNOTECrevier1993125">Crevier (1993), p. 125.</ref> </ref><ref name="Psychological evidence of sub-symbolic reasoning"/>

The issue is not resolved: sub-symbolic reasoning can make many of the same inscrutable mistakes that human intuition does, such as algorithmic bias. Critics such as Noam Chomsky argue continuing research into symbolic AI will still be necessary to attain general intelligence,<ref name="FOOTNOTELangley2011">Langley (2011).</ref><ref name="FOOTNOTEKatz2012">Katz (2012).</ref> in part because sub-symbolic AI is a move away from explainable AI: it can be difficult or impossible to understand why a modern statistical AI program made a particular decision. The emerging field of neuro-symbolic artificial intelligence attempts to bridge the two approaches.

Neat vs. scruffy

"Neats" hope that intelligent behavior is described using simple, elegant principles (such as logic, optimization, or neural networks). "Scruffies" expect that it necessarily requires solving a large number of unrelated problems. Neats defend their programs with theoretical rigor, scruffies rely mainly on incremental testing to see if they work. This issue was actively discussed in the 1970s and 1980s,<ref name="Neats vs. scruffies"> Neats vs. scruffies, the historic debate:

A classic example of the "scruffy" approach to intelligence:

A modern example of neat AI and its aspirations in the 21st century:

</ref> but eventually was seen as irrelevant. Modern AI has elements of both.

Soft vs. hard computing

Finding a provably correct or optimal solution is intractable for many important problems.<ref name="Intractability"/> Soft computing is a set of techniques, including genetic algorithms, fuzzy logic and neural networks, that are tolerant of imprecision, uncertainty, partial truth and approximation. Soft computing was introduced in the late 1980s and most successful AI programs in the 21st century are examples of soft computing with neural networks.

Narrow vs. general AI

AI researchers are divided as to whether to pursue the goals of artificial general intelligence and superintelligence directly or to solve as many specific problems as possible (narrow AI) in hopes these solutions will lead indirectly to the field's long-term goals.<ref name="FOOTNOTEPennachinGoertzel2007">Pennachin & Goertzel (2007).</ref><ref name="FOOTNOTERoberts2016">Roberts (2016).</ref> General intelligence is difficult to define and difficult to measure, and modern AI has had more verifiable successes by focusing on specific problems with specific solutions. The experimental sub-field of artificial general intelligence studies this area exclusively.

Machine consciousness, sentience and mind

The philosophy of mind does not know whether a machine can have a mind, consciousness and mental states, in the same sense that human beings do. This issue considers the internal experiences of the machine, rather than its external behavior. Mainstream AI research considers this issue irrelevant because it does not affect the goals of the field: to build machines that can solve problems using intelligence. Russell and Norvig add that "[t]he additional project of making a machine conscious in exactly the way humans are is not one that we are equipped to take on."<ref name="FOOTNOTERussellNorvig2021986">Russell & Norvig (2021), p. 986.</ref> However, the question has become central to the philosophy of mind. It is also typically the central question at issue in artificial intelligence in fiction.

Consciousness

David Chalmers identified two problems in understanding the mind, which he named the "hard" and "easy" problems of consciousness.<ref name="FOOTNOTEChalmers1995">Chalmers (1995).</ref> The easy problem is understanding how the brain processes signals, makes plans and controls behavior. The hard problem is explaining how this feels or why it should feel like anything at all, assuming we are right in thinking that it truly does feel like something (Dennett's consciousness illusionism says this is an illusion). Human information processing is easy to explain, however, human subjective experience is difficult to explain. For example, it is easy to imagine a color-blind person who has learned to identify which objects in their field of view are red, but it is not clear what would be required for the person to know what red looks like.<ref name="FOOTNOTEDennett1991">Dennett (1991).</ref>

Computationalism and functionalism

Computationalism is the position in the philosophy of mind that the human mind is an information processing system and that thinking is a form of computing. Computationalism argues that the relationship between mind and body is similar or identical to the relationship between software and hardware and thus may be a solution to the mind–body problem. This philosophical position was inspired by the work of AI researchers and cognitive scientists in the 1960s and was originally proposed by philosophers Jerry Fodor and Hilary Putnam.<ref name="FOOTNOTEHorst2005">Horst (2005).</ref>

Philosopher John Searle characterized this position as "strong AI": "The appropriately programmed computer with the right inputs and outputs would thereby have a mind in exactly the same sense human beings have minds."<ref name="Searle's strong AI" group="lower-alpha"> Searle presented this definition of "Strong AI" in 1999.<ref name="FOOTNOTESearle1999">Searle (1999).</ref> Searle's original formulation was "The appropriately programmed computer really is a mind, in the sense that computers given the right programs can be literally said to understand and have other cognitive states."<ref name="FOOTNOTESearle19801">Searle (1980), p. 1.</ref> Strong AI is defined similarly by Russell and Norvig: "Stong AI – the assertion that machines that do so are actually thinking (as opposed to simulating thinking)."<ref name="FOOTNOTERussellNorvig20219817">Russell & Norvig (2021), p. 9817.</ref> </ref> Searle counters this assertion with his Chinese room argument, which attempts to show that, even if a machine perfectly simulates human behavior, there is still no reason to suppose it also has a mind.<ref name="Chinese room"> Searle's Chinese room argument:

Discussion:

</ref>

AI welfare and rights

It is difficult or impossible to reliably evaluate whether an advanced AI is sentient (has the ability to feel), and if so, to what degree.<ref>Leith, Sam (7 July 2022). "Nick Bostrom: How can we be certain a machine isn't conscious?". The Spectator. Retrieved 23 February 2024.</ref> But if there is a significant chance that a given machine can feel and suffer, then it may be entitled to certain rights or welfare protection measures, similarly to animals.<ref name=":02">Thomson, Jonny (31 October 2022). "Why don't robots have rights?". Big Think. Retrieved 23 February 2024.</ref><ref name=":12">Kateman, Brian (24 July 2023). "AI Should Be Terrified of Humans". TIME. Retrieved 23 February 2024.</ref> Sapience (a set of capacities related to high intelligence, such as discernment or self-awareness) may provide another moral basis for AI rights.<ref name=":02" /> Robot rights are also sometimes proposed as a practical way to integrate autonomous agents into society.<ref>Wong, Jeff (10 July 2023). "What leaders need to know about robot rights". Fast Company.</ref>

In 2017, the European Union considered granting "electronic personhood" to some of the most capable AI systems. Similarly to the legal status of companies, it would have conferred rights but also responsibilities.<ref>Hern, Alex (12 January 2017). "Give robots 'personhood' status, EU committee argues". The Guardian. ISSN 0261-3077. Retrieved 23 February 2024.</ref> Critics argued in 2018 that granting rights to AI systems would downplay the importance of human rights, and that legislation should focus on user needs rather than speculative futuristic scenarios. They also noted that robots lacked the autonomy to take part to society on their own.<ref>Dovey, Dana (14 April 2018). "Experts Don't Think Robots Should Have Rights". Newsweek. Retrieved 23 February 2024.</ref><ref>Cuddy, Alice (13 April 2018). "Robot rights violate human rights, experts warn EU". euronews. Retrieved 23 February 2024.</ref>

Progress in AI increased interest in the topic. Proponents of AI welfare and rights often argue that AI sentience, if it emerges, would be particularly easy to deny. They warn that this may be a moral blind spot analogous to slavery or factory farming, which could lead to large-scale suffering if sentient AI is created and carelessly exploited.<ref name=":12" /><ref name=":02" />

Future

Superintelligence and the singularity

A superintelligence is a hypothetical agent that would possess intelligence far surpassing that of the brightest and most gifted human mind.<ref name="FOOTNOTERoberts2016">Roberts (2016).</ref>

If research into artificial general intelligence produced sufficiently intelligent software, it might be able to reprogram and improve itself. The improved software would be even better at improving itself, leading to what I. J. Good called an "intelligence explosion" and Vernor Vinge called a "singularity".<ref name = "Singularity"> The Intelligence explosion and technological singularity:

I. J. Good's "intelligence explosion"

Vernor Vinge's "singularity"

</ref>

However, technologies cannot improve exponentially indefinitely, and typically follow an S-shaped curve, slowing when they reach the physical limits of what the technology can do.<ref name="FOOTNOTERussellNorvig20211005">Russell & Norvig (2021), p. 1005.</ref>

Transhumanism

Robot designer Hans Moravec, cyberneticist Kevin Warwick, and inventor Ray Kurzweil have predicted that humans and machines will merge in the future into cyborgs that are more capable and powerful than either. This idea, called transhumanism, has roots in Aldous Huxley and Robert Ettinger.<ref> Transhumanism:

</ref>

Edward Fredkin argues that "artificial intelligence is the next stage in evolution", an idea first proposed by Samuel Butler's "Darwin among the Machines" as far back as 1863, and expanded upon by George Dyson in his book of the same name in 1998.<ref> AI as evolution:

</ref>

In fiction

The word "robot" itself was coined by Karel Čapek in his 1921 play R.U.R., the title standing for "Rossum's Universal Robots".

Thought-capable artificial beings have appeared as storytelling devices since antiquity,<ref name="AI in myth"> AI in myth:

</ref> and have been a persistent theme in science fiction.<ref name="FOOTNOTEMcCorduck2004340–400">McCorduck (2004), pp. 340–400.</ref>

A common trope in these works began with Mary Shelley's Frankenstein, where a human creation becomes a threat to its masters. This includes such works as Arthur C. Clarke's and Stanley Kubrick's 2001: A Space Odyssey (both 1968), with HAL 9000, the murderous computer in charge of the Discovery One spaceship, as well as The Terminator (1984) and The Matrix (1999). In contrast, the rare loyal robots such as Gort from The Day the Earth Stood Still (1951) and Bishop from Aliens (1986) are less prominent in popular culture.<ref name="FOOTNOTEButtazzo2001">Buttazzo (2001).</ref>

Isaac Asimov introduced the Three Laws of Robotics in many books and stories, most notably the "Multivac" series about a super-intelligent computer of the same name. Asimov's laws are often brought up during lay discussions of machine ethics;<ref name="FOOTNOTEAnderson2008">Anderson (2008).</ref> while almost all artificial intelligence researchers are familiar with Asimov's laws through popular culture, they generally consider the laws useless for many reasons, one of which is their ambiguity.<ref name="FOOTNOTEMcCauley2007">McCauley (2007).</ref>

Several works use AI to force us to confront the fundamental question of what makes us human, showing us artificial beings that have the ability to feel, and thus to suffer. This appears in Karel Čapek's R.U.R., the films A.I. Artificial Intelligence and Ex Machina, as well as the novel Do Androids Dream of Electric Sheep?, by Philip K. Dick. Dick considers the idea that our understanding of human subjectivity is altered by technology created with artificial intelligence.<ref name="FOOTNOTEGalvan1997">Galvan (1997).</ref>

See also

Explanatory notes

<references group="lower-alpha" responsive="1"></references>

References

<references group="" responsive="0"></references>

AI textbooks

The two most widely used textbooks in 2023. (See the Open Syllabus).

These were the four of the most widely used AI textbooks in 2008:

Later editions.

History of AI

Other sources

Further reading

  • Ashish Vaswani, Noam Shazeer, Niki Parmar et al. "Attention is all you need." Advances in neural information processing systems 30 (2017). Seminal paper on transformers.
  • Autor, David H., "Why Are There Still So Many Jobs? The History and Future of Workplace Automation" (2015) 29(3) Journal of Economic Perspectives 3.
  • Boden, Margaret, Mind As Machine, Oxford University Press, 2006.
  • Cukier, Kenneth, "Ready for Robots? How to Think about the Future of AI", Foreign Affairs, vol. 98, no. 4 (July/August 2019), pp. 192–98. George Dyson, historian of computing, writes (in what might be called "Dyson's Law") that "Any system simple enough to be understandable will not be complicated enough to behave intelligently, while any system complicated enough to behave intelligently will be too complicated to understand." (p. 197.) Computer scientist Alex Pentland writes: "Current AI machine-learning algorithms are, at their core, dead simple stupid. They work, but they work by brute force." (p. 198.)
  • Domingos, Pedro, "Our Digital Doubles: AI will serve our species, not control it", Scientific American, vol. 319, no. 3 (September 2018), pp. 88–93. "AIs are like autistic savants and will remain so for the foreseeable future.... AIs lack common sense and can easily make errors that a human never would... They are also liable to take our instructions too literally, giving us precisely what we asked for instead of what we actually wanted." (p. 93.)
  • Gertner, Jon. (2023) "Wikipedia's Moment of Truth: Can the online encyclopedia help teach A.I. chatbots to get their facts right — without destroying itself in the process?" New York Times Magazine (July 18, 2023) online
  • Gleick, James, "The Fate of Free Will" (review of Kevin J. Mitchell, Free Agents: How Evolution Gave Us Free Will, Princeton University Press, 2023, 333 pp.), The New York Review of Books, vol. LXXI, no. 1 (18 January 2024), pp. 27–28, 30. "Agency is what distinguishes us from machines. For biological creatures, reason and purpose come from acting in the world and experiencing the consequences. Artificial intelligences – disembodied, strangers to blood, sweat, and tears – have no occasion for that." (p. 30.)
  • Hanna, Alex, and Emily M. Bender, "Theoretical AI Harms Are a Distraction: Fearmongering about artificial intelligence's potential to end humanity shrouds the real harm it already causes", Scientific American, vol 330, no. 2 (February 2024), pp. 69–70. "[H]ype [about "existential risks"] surrounds many AI firms, but their technology already enables myriad harms, including... discrimination in housing, criminal justice, and health care, as well as the spread of hate speech and misinformation... Large language models extrude... fluent... coherent-seeming text but have no understanding of what the text means, let alone the ability to reason.... (p. 69.) [T]hat output... becomes a noxious... insidious pollutant of our information ecosystem.... [T]oo many... publications [about] AI come from corporate labs or... academic groups that receive... industry funding. Many of these publications are based on junk science [that] is nonreproducible... is full of hype, and uses evaluation methods that do not measure what they purport to... Meanwhile 'AI doomers' cite this junk science... to [misdirect] attention [to] the fantasy of all-powerful machines possibly going rogue and destroying humanity." (p. 70.)
  • Hughes-Castleberry, Kenna, "A Murder Mystery Puzzle: The literary puzzle Cain's Jawbone, which has stumped humans for decades, reveals the limitations of natural-language-processing algorithms", Scientific American, vol. 329, no. 4 (November 2023), pp. 81–82. "This murder mystery competition has revealed that although NLP (natural-language processing) models are capable of incredible feats, their abilities are very much limited by the amount of context they receive. This [...] could cause [difficulties] for researchers who hope to use them to do things such as analyze ancient languages. In some cases, there are few historical records on long-gone civilizations to serve as training data for such a purpose." (p. 82.)
  • Immerwahr, Daniel, "Your Lying Eyes: People now use A.I. to generate fake videos indistinguishable from real ones. How much does it matter?", The New Yorker, 20 November 2023, pp. 54–59. "If by 'deepfakes' we mean realistic videos produced using artificial intelligence that actually deceive people, then they barely exist. The fakes aren't deep, and the deeps aren't fake. [...] A.I.-generated videos are not, in general, operating in our media as counterfeited evidence. Their role better resembles that of cartoons, especially smutty ones." (p. 59.)
  • Johnston, John (2008) The Allure of Machinic Life: Cybernetics, Artificial Life, and the New AI, MIT Press.
  • Jumper, John; Evans, Richard; Pritzel, Alexander; et al. (26 August 2021). "Highly accurate protein structure prediction with AlphaFold". Nature. 596 (7873): 583–589. Bibcode:2021Natur.596..583J. doi:10.1038/s41586-021-03819-2. PMC 8371605. PMID 34265844. S2CID 235959867.
  • LeCun, Yann; Bengio, Yoshua; Hinton, Geoffrey (28 May 2015). "Deep learning". Nature. 521 (7553): 436–444. Bibcode:2015Natur.521..436L. doi:10.1038/nature14539. PMID 26017442. S2CID 3074096. Archived from the original on 5 June 2023. Retrieved 19 June 2023.
  • Marcus, Gary, "Am I Human?: Researchers need new ways to distinguish artificial intelligence from the natural kind", Scientific American, vol. 316, no. 3 (March 2017), pp. 61–63. Marcus points out a so far insuperable stumbling block to artificial intelligence: an incapacity for reliable disambiguation. "[V]irtually every sentence [that people generate] is ambiguous, often in multiple ways. Our brain is so good at comprehending language that we do not usually notice." A prominent example is the "pronoun disambiguation problem" ("PDP"): a machine has no way of determining to whom or what a pronoun in a sentence—such as "he", "she" or "it"—refers.
  • Marcus, Gary, "Artificial Confidence: Even the newest, buzziest systems of artificial general intelligence are stymmied by the same old problems", Scientific American, vol. 327, no. 4 (October 2022), pp. 42–45.
  • Mitchell, Melanie (2019). Artificial intelligence: a guide for thinking humans. New York: Farrar, Straus and Giroux. ISBN 9780374257835.
  • Mnih, Volodymyr; Kavukcuoglu, Koray; Silver, David; et al. (26 February 2015). "Human-level control through deep reinforcement learning". Nature. 518 (7540): 529–533. Bibcode:2015Natur.518..529M. doi:10.1038/nature14236. PMID 25719670. S2CID 205242740. Archived from the original on 19 June 2023. Retrieved 19 June 2023. Introduced DQN, which produced human-level performance on some Atari games.
  • Press, Eyal, "In Front of Their Faces: Does facial-recognition technology lead police to ignore contradictory evidence?", The New Yorker, 20 November 2023, pp. 20–26.
  • Roivainen, Eka, "AI's IQ: ChatGPT aced a [standard intelligence] test but showed that intelligence cannot be measured by IQ alone", Scientific American, vol. 329, no. 1 (July/August 2023), p. 7. "Despite its high IQ, ChatGPT fails at tasks that require real humanlike reasoning or an understanding of the physical and social world.... ChatGPT seemed unable to reason logically and tried to rely on its vast database of... facts derived from online texts."
  • Serenko, Alexander; Michael Dohan (2011). "Comparing the expert survey and citation impact journal ranking methods: Example from the field of Artificial Intelligence" (PDF). Journal of Informetrics. 5 (4): 629–49. doi:10.1016/j.joi.2011.06.002. Archived (PDF) from the original on 4 October 2013. Retrieved 12 September 2013.
  • Silver, David; Huang, Aja; Maddison, Chris J.; et al. (28 January 2016). "Mastering the game of Go with deep neural networks and tree search". Nature. 529 (7587): 484–489. Bibcode:2016Natur.529..484S. doi:10.1038/nature16961. PMID 26819042. S2CID 515925. Archived from the original on 18 June 2023. Retrieved 19 June 2023.
  • White Paper: On Artificial Intelligence – A European approach to excellence and trust (PDF). Brussels: European Commission. 2020. Archived (PDF) from the original on 20 February 2020. Retrieved 20 February 2020.

External links

Lua error in Module:Sister_project_links at line 469: attempt to index field 'wikibase' (a nil value).

Lua error in Module:Authority_control at line 181: attempt to index field 'wikibase' (a nil value).