User Tools

Site Tools


ai_glossary

Table of Contents

AI Glossary

Appendix F. Glossary

Acronyms

  • BOW: Bag of words - A data structure (usually a vector) that retains the counts (frequencies) of words but not their order
  • DFA: Deterministic finite automaton - A finite state machine that doesn’t make random choices. The re package in Python compiles regular expressions to create a DFA, but the regex can compile fuzzy regular expressions into NDFA (nondeterministic FA).
  • GIS: Geographic information system - A database for storing, manipulating, and displaying geographic information, usually involv-ing latitude, longitude, and altitude coordinates and traces.
  • GRU: Gated recurrent unit - A variation of long short-term memory networks with shared parameters to cut computation time
  • HitLRL or HLRL: Human in the Loop Reinforcement Learning - HitLRL is an active learning approach to model training used for conversational LLMs such as InstructGPT and large game playing deep learning models such as AlphaGo. Conversational LLMs such as InstructGPT use reinforcement learning augmented with human curators in order to keep up with the evolution of language and concepts. The human labelers are identifying whether generated text is within the ethical and quality guidelines for the model. But unlike conventional RL, these labels are used to train a quality scoring supervisor model that is then used to flag future bot responses for labeling.
  • HPC: High performance computing - The study of systems that maximize throughput, usually by parallelizing computation with separate map and reduce computation stages
  • IR: Information retrieval - The study of document and web search engine algorithms. This is what brought NLP to the forefront of important computer science disciplines in the 90s.
  • LLM: Large language model - If you scale up a transformer-based language model to web scale, using millions of dollars in compute resources to train it on a large portion of the natural language text on the Internet, that’s a Large Language Model.
  • LSH: Locality sensitive hash - A hash that works as an efficient but approximate mapping/clustering index on dense, continuous, high-dimensional vectors (see chapter 13). Think of them as ZIP Codes that work for more than just 2D (latitude and longitude).
  • LSTM: Long short-term memory - An enhanced form of a recurrent neural network that maintains a memory of state that itself is trained via backpropagation (see chapter 9)
  • ML: Machine learning - Programming a machine with data rather than hand-coded algorithms
  • MSE: Mean squared error - The sum of the square of the difference between the desired output of a machine learning model and the actual output of the model
  • NELL: Never Ending Language Learning - A Carnegie Mellon knowledge extraction project that has been running continuously for years, scraping web pages and extracting general knowledge about the world (mostly “IS-A” categorical relationships between terms)
  • NLG: Natural language generation - Composing text automatically, algorithmically; one of the most challenging tasks of natural language processing (NLP)
  • pip: Pip installs pip - The official Python package manager that downloads and installs packages automatically from the “Cheese Shop” (pypi.python.org)
  • PR: Pull request - The right way to request that someone merge your code into theirs. GitHub has some buttons and wizards to make this easy. This is how you can build your reputation as a conscientious contributor to open source.
  • RAG: Retrieval-Augmented Generation - A way to increase the accuracy and reliability of generative language models by using a retrieval model to fetch relevant data from a database or knowledge graph to serve as a base for the generation step.
  • ReLU: Rectified linear unit - A linear neural net activation function that forces the output of a neuron to be nonzero. Equivalent to y = np.max(x, 0). The most popular and efficient activation function for image processing and NLP, because it allows back propagation to work efficiently on extremely deep networks without “vanishing the gradients.”
  • REPL: Read–evaluate–print loop - The typical workflow of a developer of any scripting language that doesn’t need to be compiled. The ipython, jupyter console, and jupyter notebook REPLs are particularly powerful, with their help, ?, ??, and % magic commands, plus auto-complete, and Ctrl-R history search. .[1]
  • RMSE: Root mean square error - The square root of the mean squared error. A common regression error metric. It can also be used for binary and ordinal classification problems. It provides an intuitive estimate of the 1-sigma uncertainty in a model’s predictions.
  • RNN: Recurrent neural network - A neural network architecture that feeds the outputs of one layer into the input of an earlier layer. RNNs are often “unfolded” into equivalent feed forward neural networks for diagramming and analysis.
  • SVD: Singular value decomposition - A matrix factorization that produces a diagonal matrix of eigenvalues and two orthogonal matrices containing eigenvectors. It’s the math behind LSA and PCA (see chapter 4).
  • UI: User interface - The “affordances” you offer your user through your software, often the graphical web pages or mobile application screens that your user must interact with to use your product or -service
  • UX: User experience - The nature of a customer’s interaction with your product or company, from purchase all the way through to their last contact with you. This includes your website or API UI on your website and all the other interactions with your company.
  • VSM: Vector space model - A vector representation of the objects in your problem, such as words or documents in an NLP problem (see chapter 4 and chapter 6)

Terms

  • Affordance: A way for your user to interact with your product that you intentionally make possible. Ideally that interaction should come naturally to the user, be easily discoverable, and self-documenting.
  • Cell: The memory or state part of an LSTM unit that records a single scalar value and outputs it continuously .[2]
  • Dark patterns: Software patterns (usually for a user interface) that are intended to increase revenue but often fail due to “blowback” because they manipulate your customers into using your product in ways that they don’t intend
  • Feed-forward network: A “one-way” neural network that passes all its inputs through to its outputs in a consistent direction, forming a directed acyclic graph (DAG) or tree
  • Grounding: A method to improve accuracy of large language models and reduce hallucinations, by making the model base its answers on data retrieved from a document database.
  • Guardrails: Ways of controlling the output of large language model, such as ensuring the response format, or preventing the model from discussing certain issues.
  • Hallucinations: A common problem with generative language models, where the model generates text that seems plausible but is actually not true or accurate.
  • Intent: A category of users' intentions that is meant to produce a response in a conversational system.
  • Morpheme: A part of a token or word that contains meaning in and of itself. The morphemes that make up a token are collectively called the token’s morphology. The morphology of a token can be found using algorithms in packages like SpaCy that process the token with its context (words around it). .[3]
  • Neuron: A unit in a neural net whose function (such as y = tanh(w.dot(x))) takes multiple inputs and outputs a single scalar value. This value is usually the weights for that neuron (w*or wi) multiplied by all the input signals (*x or xi) and summed with a bias weight (w_0) before applying an activation function like _tanh. A neuron always outputs a scalar value, which is sent to the inputs of any additional hidden or output neurons in the network. If a neuron implements a much more complicated activation function than that, like the enhancements that were made to recurrent neurons to create an LSTM, it is usually called a unit, for example, an LSTM unit.
  • Nessvector: An informal term for topic vectors or semantic vectors that capture concepts or qualities, such as femaleness or blueness, into the dimensions of a vector
  • Predicate: In English grammar, the predicate is the main verb of a sentence that’s associated with the subject. Every complete sentence must have a predicate, just like it must also have a subject.
  • Skip-grams: Pairs of tokens used as training examples for a word vector embedding, where any number of intervening words are ignored (see chapter 6).
  • Softmax: Normalized exponential function used to squash the real-valued vector output by a neural network so that its values range between 0 and 1 like probabilities.
  • Subject: The main noun of a sentence]]: every complete sentence must have a subject (and a predicate) even if the subject is implied, like in the sentence “Run!” where the implied subject is “you.”
  • Transformers: A type of artificial neural network that use a mechanism called attention. Large transformers trained on internet-sized datasets are often called Large Language Models.
  • Unit: Neuron or small collection of neurons that perform some more complicated nonlinear function to compute the output. For example, an LSTM unit has a memory cell that records state, an input gate (neuron) that decides what value to remember, a forget gate (neuron) that decides how long to remember that value, and an output gate neuron that accomplishes the activation function of the unit (usually a sigmoid or tanh()). A unit is a drop-in replacement for a neuron in a neural net that takes a vector input and outputs a scalar value; it just has more complicated behavior.

[1] Python’s REPLs even allow you to execute any shell command (including pip) installed on your OS (such as !git commit -am 'fix 123'). This lets your fingers stay on the keyboard and away from the mouse, minimizing cognitive load from context switches.

[2] See the web page titled “Long short-term memory” ( https://en.wikipedia.org/wiki/Long_short-term_memory).

[3] See the web page titled “Linguistic Features : spaCy Usage Documentation” ( https://spacy.io/usage/linguistic-features#rule-based-morphology).

Artificial Intelligence (AI): The Borg, SkyNet, Google Gemini, ChatGPT, AI Fundamentals, AI Inventor: Arthur Samuel of IBM 1959 coined term Machine Learning. Synonym Self-Teaching Computers from 1950s. Experimental AILearning Machine” called Cybertron in early 1960s by Raytheon Company; ChatGPT, Generative AI, NLP, GAN, AI winter, The Singularity, AI FUD, Quantum FUD (Fake Quantum Computers), AI Propaganda, Quantum Propaganda, Cloud AI (AWS AI, Azure AI, Google AI-GCP AI-Google Cloud AI, IBM AI, Apple AI), Deep Learning (DL), Machine learning (ML), AI History, AI Bibliography, Manning AI-ML-DL-NLP-GAN Series, AI Glossary, AI Topics, AI Courses, AI Libraries, AI frameworks, AI GitHub, AI Awesome List. (navbar_ai - See also navbar_dl, navbar_ml, navbar_nlp, navbar_chatbot, navbar_chatgpt, navbar_llm)


© 1994 - 2024 Cloud Monk Losang Jinpa or Fair Use. Disclaimers

SYI LU SENG E MU CHYWE YE. NAN. WEI LA YE. WEI LA YE. SA WA HE.


ai_glossary.txt · Last modified: 2024/08/28 15:46 by 127.0.0.1

Donate Powered by PHP Valid HTML5 Valid CSS Driven by DokuWiki