AI Dictionary

AI Glossary

110+ AI terms explained in plain English. No PhD required.

A

AI Agent

An AI system that can take actions autonomously to achieve goals, rather than just responding to prompts.

AI Ethics

The study of moral questions raised by AI development and deployment, including fairness, accountability, transparency, and societal impact.

Ethics

AI Safety

The field focused on ensuring AI systems work as intended without causing unintended harm to humans or society.

Safety

Alignment

The challenge of making AI systems behave in ways that match human values and intentions.

Safety

Anomaly Detection

AI technique that identifies unusual patterns or outliers that don't conform to expected behavior.

Techniques

Anthropic API

Anthropic's interface for accessing Claude models, offering AI assistance with a focus on safety and helpfulness.

Tools

API (Application Programming Interface)

A way for software programs to communicate with each other, allowing developers to integrate AI capabilities into their applications.

Infrastructure

Attention Mechanism

A technique that allows neural networks to focus on the most relevant parts of the input when producing each part of the output.

Technical

AutoGen

Microsoft's framework for building multi-agent systems where AI agents collaborate through conversation.

Frameworks

Autonomous AI

AI systems that can operate independently to achieve goals with minimal human oversight or intervention.

Core Concepts

Azure OpenAI

Microsoft's enterprise service providing access to OpenAI models through Azure with added security and compliance features.

Tools

B

Backpropagation

An algorithm that calculates how much each weight in a neural network contributed to errors, enabling the network to learn from mistakes.

Technical

Bedrock

AWS's managed service for accessing foundation models from multiple providers through a unified API.

Tools

BERT

Bidirectional Encoder Representations from Transformers, Google's model designed for understanding text context by reading in both directions simultaneously.

Model Types

Bias in AI

Systematic patterns in AI outputs that unfairly favor or disadvantage particular groups, often reflecting biases present in training data or design choices.

Ethics

C

Chatbot

A software application that simulates human conversation through text or voice interactions.

Applications

Chinchilla

DeepMind's language model that proved smaller, better-trained models outperform larger undertrained ones, reshaping how the field thinks about scaling.

Models

Claude

Anthropic's family of AI assistants designed with constitutional AI principles, emphasizing helpfulness, harmlessness, and honesty.

Models

Code Completion

An AI feature that predicts and suggests code as developers type, speeding up programming workflows.

Applications

Codex

OpenAI's code-specialized model that powered GitHub Copilot, trained to understand and generate programming code across dozens of languages.

Models

Cohere

An enterprise-focused AI company providing language models, embeddings, and retrieval systems optimized for business applications.

Models

Command R

Cohere's retrieval-augmented generation optimized model designed specifically for enterprise RAG applications and tool use.

Models

Computer Vision

A field of AI that enables computers to interpret and understand visual information from images and videos.

Core Concepts

Constitutional AI

A training approach where AI systems are given explicit principles to follow and learn to critique and revise their own outputs accordingly.

Safety

Context Window

The maximum amount of text an AI model can consider at once, including both your input and its response.

Technical

CrewAI

A framework for orchestrating autonomous AI agents that work together as a crew to accomplish complex tasks.

Frameworks

CUDA

NVIDIA's parallel computing platform that enables developers to use GPUs for general-purpose processing and AI workloads.

Infrastructure

D

DALL-E

OpenAI's text-to-image model that generates original images from natural language descriptions with strong prompt adherence.

Models

Data Augmentation

Techniques for artificially expanding training datasets by creating modified versions of existing data, like rotating images or paraphrasing text.

Techniques

Deep Learning

A subset of machine learning that uses neural networks with many layers to learn complex patterns from large amounts of data.

Core Concepts

DeepSeek

A Chinese AI lab's open-source language models known for strong coding abilities and competitive performance at efficient training costs.

Models

Diffusion Model

A generative AI approach that creates images by learning to gradually remove noise from random static until a coherent image emerges.

Model Types

Distillation

A training technique where a smaller student model learns to mimic the behavior of a larger teacher model.

Infrastructure

Document Processing

AI technology that extracts, classifies, and structures information from documents like PDFs, forms, and images.

Applications

E

Embeddings

Numerical representations of text that capture semantic meaning, allowing AI to find similar content and understand relationships.

Technical

Explainability

The ability to understand and communicate why an AI system made a particular decision or produced a specific output.

Ethics

F

Falcon

The Technology Innovation Institute's open-source language models from Abu Dhabi, known for high-quality pretraining data and competitive benchmarks.

Models

Few-Shot Learning

The ability of an AI model to learn new tasks or concepts from just a handful of examples, rather than requiring thousands of training samples.

Techniques

Fine-tuning

The process of training an existing AI model on specific data to customize its behavior for particular tasks.

Technical

G

Gemini

Google DeepMind's multimodal AI model family designed to natively understand and generate text, images, audio, and video.

Models

Generative AI

AI systems that create new content like text, images, audio, or video based on patterns learned from training data.

Model Types

GPT

Generative Pre-trained Transformer, OpenAI's family of large language models that generate human-like text through autoregressive prediction.

Models

GPU

A Graphics Processing Unit that accelerates AI model training and inference through parallel computation.

Infrastructure

Gradient Descent

An optimization algorithm that iteratively adjusts model parameters to minimize errors by moving in the direction of steepest improvement.

Technical

Grounding

Connecting AI responses to authoritative sources or real-world data to reduce hallucinations and improve accuracy.

Techniques

H

Hallucination

When an AI generates information that sounds plausible but is factually incorrect or made up.

Core Concepts

Hugging Face

A platform and company providing open-source ML libraries, pre-trained models, and infrastructure for AI development.

Tools

I

Image Classification

AI technique that assigns labels or categories to images based on their visual content.

Techniques

Inference

The process of using a trained model to make predictions or generate outputs on new, previously unseen data.

Technical

Inference Endpoint

A deployed API endpoint that serves a trained AI model for real-time predictions.

Infrastructure

Interpretability

The degree to which humans can understand the internal workings and reasoning processes of an AI model.

Ethics

J

Jailbreak

A prompt or technique designed to bypass an AI system's safety guidelines and get it to produce normally restricted content.

Safety

L

LangChain

A framework for building applications powered by language models, with tools for chaining prompts, memory, and external data.

Frameworks

Latent Space

A compressed, abstract representation of data where similar items are positioned near each other and meaningful operations become possible.

Technical

LLaMA

Large Language Model Meta AI, Meta's open-weight language model family that enabled widespread research and commercial fine-tuning of powerful AI systems.

Models

LlamaIndex

A data framework for connecting custom data sources to large language models through indexing and retrieval.

Frameworks

LLM (Large Language Model)

A type of AI trained on massive amounts of text that can understand and generate human language.

Core Concepts

M

Machine Learning

A branch of artificial intelligence where computers learn patterns from data to make predictions or decisions without being explicitly programmed.

Core Concepts

Machine Translation

AI technology that automatically translates text or speech from one language to another.

Applications

MCP Server

A server that implements the Model Context Protocol, allowing AI models to interact with external tools and data sources in a standardized way.

Infrastructure

Midjourney

A proprietary text-to-image AI service known for producing highly aesthetic, artistic images through a Discord-based interface.

Models

Mistral

A French AI company's efficient open-weight language models known for punching above their weight class in performance per parameter.

Models

Mixtral

Mistral AI's Mixture of Experts model that achieves GPT-3.5 level performance while using a fraction of the compute per inference.

Model Types

Model Serving

The process of deploying and managing machine learning models to handle prediction requests in production.

Infrastructure

Multimodal

AI systems that can understand and generate multiple types of data, such as text, images, audio, and video, within a single model.

Model Types

N

Named Entity Recognition

AI technique that identifies and classifies named entities like people, organizations, and locations in text.

Techniques

Natural Language Processing

A field of AI focused on enabling computers to understand, interpret, and generate human language.

Core Concepts

Neural Network

A computing system inspired by the human brain that learns patterns from data through interconnected nodes organized in layers.

Core Concepts

O

Object Detection

AI technique that identifies and locates multiple objects within an image or video, drawing bounding boxes around each.

Techniques

OCR

Optical Character Recognition technology that converts images of text into machine-readable characters.

Applications

ONNX

Open Neural Network Exchange, an open format for representing machine learning models across different frameworks.

Tools

OpenAI API

OpenAI's interface for accessing GPT models, DALL-E, Whisper, and other AI capabilities programmatically.

Tools

Overfitting

When a model learns training data too perfectly, including noise and quirks, causing poor performance on new unseen data.

Technical

P

PaLM

Pathways Language Model, Google's large-scale language model that preceded Gemini and demonstrated strong reasoning and multilingual capabilities.

Models

Phi

Microsoft's family of small language models that achieve surprisingly strong performance through high-quality synthetic training data.

Models

Predictive Analytics

Using statistical models and machine learning to forecast future outcomes based on historical data.

Applications

Prompt Engineering

The practice of crafting inputs to AI models to get better, more consistent, and more useful outputs.

Techniques

Prompt Injection

An attack where malicious instructions are hidden in input data, tricking an AI system into ignoring its original instructions.

Safety

Pruning

An optimization technique that removes unnecessary weights or neurons from a neural network to reduce size and computation.

Infrastructure

Q

Quantization

A technique that reduces model size and speeds up inference by using lower-precision numbers for weights and activations.

Infrastructure

Question Answering

AI system that automatically answers questions posed in natural language based on given context or knowledge.

Applications

Qwen

Alibaba Cloud's family of multilingual language models with strong performance in Chinese and English, released as open weights.

Models

R

RAG (Retrieval-Augmented Generation)

A technique that enhances AI responses by retrieving relevant information from a knowledge base before generating answers.

Technical

Recommendation System

AI system that predicts and suggests items or content a user might like based on their behavior and preferences.

Applications

Red Teaming

The practice of deliberately trying to find flaws, vulnerabilities, and harmful behaviors in AI systems before they're deployed.

Safety

Reinforcement Learning

A machine learning approach where agents learn optimal behavior through trial and error, receiving rewards or penalties for their actions.

Techniques

RLHF

Reinforcement Learning from Human Feedback, a training technique where AI learns to improve its responses based on human ratings and preferences.

Techniques

S

Semantic Kernel

Microsoft's open-source SDK for integrating large language models into applications with plugins and planners.

Frameworks

Semantic Segmentation

AI technique that classifies every pixel in an image into a category, creating detailed visual maps.

Techniques

Sentiment Analysis

AI technique that identifies and categorizes emotions and opinions expressed in text.

Techniques

Speech-to-Text

Technology that converts spoken audio into written text, also known as automatic speech recognition.

Applications

Stable Diffusion

An open-source text-to-image diffusion model that generates detailed images from text descriptions and runs on consumer hardware.

Models

Summarization

AI technique that condenses long documents or text into shorter versions while preserving key information.

Applications

Supervised Learning

A machine learning approach where models learn from labeled examples that include both inputs and their correct outputs.

Techniques

Synthetic Data

Artificially generated data that mimics the statistical properties of real data, used to train AI models when actual data is scarce or sensitive.

Techniques

T

Text-to-Image

AI technology that generates images from written text descriptions.

Applications

Text-to-Speech

Technology that converts written text into spoken audio using synthesized or cloned voices.

Applications

Time-Series Forecasting

AI technique that predicts future values based on patterns observed in sequential historical data.

Techniques

Tokens

The basic units that AI models use to process text, roughly corresponding to word fragments or characters.

Technical

Tool Use

The ability of an AI model to call external functions, APIs, or services to accomplish tasks beyond text generation.

Core Concepts

TPU

Google's custom Tensor Processing Unit, an ASIC designed specifically for accelerating machine learning workloads.

Infrastructure

Training Data

The dataset used to teach a machine learning model patterns and relationships, forming the foundation of what the model learns.

Infrastructure

Transformer

A neural network architecture that processes sequences using self-attention, enabling models to weigh the importance of different parts of the input.

Technical

U

Underfitting

When a model is too simple to capture the underlying patterns in the data, resulting in poor performance on both training and new data.

Technical

Unsupervised Learning

A machine learning approach where models discover hidden patterns and structures in data without labeled examples.

Techniques

V

Vector Database

A database optimized for storing and searching embeddings, enabling fast similarity search across millions of items.

Infrastructure

Vertex AI

Google Cloud's unified machine learning platform for building, deploying, and scaling AI models.

Tools

Virtual Assistant

An AI-powered software agent that performs tasks and services based on voice or text commands.

Applications

W

Whisper

OpenAI's open-source automatic speech recognition model that transcribes and translates audio with near-human accuracy across multiple languages.

Models

Z

Zero-Shot Learning

The ability of an AI model to perform tasks it was never explicitly trained on, using only instructions or descriptions without any examples.

Techniques

110 terms across 11 categories