LibGuides: Generative Artificial Intelligence: Defining Generative AI

Where to start? Defining "Generative AI"

A key part of understanding generative AI, and specific tools like ChatGPT, is contextualizing it within the broader field of artificial intelligence. The following terms are ordered from conceptually broad to specific, and additional resources are included in each section:

The ability of machines and other technologies to "make predictions or decisions" based on "large amounts of data". Artificial intelligence is also an academic sub-discipline of computer and data science, with a history of scholarly research that originated in the 1950s.
Additional Resources:
- Web Article - "AI is confusing - here's your cheat sheet" (The Verge)
- Web Article - "Artificial Intelligence" (Britannica)
- UR Ebook - Artificial Intelligence: A Very Short Introduction by Margaret Boden
  - Limited to five simultaneous users

A specific branch of artificial intelligence focused on a machine model's ability to create, or "generate", new content. This content could be text-based, images, music, or even human voices. This is possible thanks to two key concepts:
- Training data: the set of examples or input data used to train a generative AI model. It consists of a collection of representative samples the model learns from to generate new content or make predictions. For example, companies like OpenAI use web data collected by Common Crawl, a non-profit organization that has collected over 9.5 petabytes of web data since 2007.
- Machine learning: an algorithmic process that identifies patterns in training data sets and makes predictions based on those patterns.
Additional Resources:
- Web Article - "Explained: Generative AI" (MIT News; Adam Zewe)
- Video - "Introduction to Generative AI" (Google Cloud Tech)
- Web Article - "How AI chatbots like ChatGPT or Bard work – visual explainer" (The Guardian)

A specific type of generative AI that creates written works or text-based responses to user-generated prompts. Large-language models (LLMs) are trained on huge amounts of data which it uses to predict the most likely series of words needed to address a user's prompt, essentially act as "an extremely powerful form of auto-complete".
It is important to understand the difference between LLMs and the tools that utilize their training data. For example, OpenAI's LLM is used as the foundation for a wide range of generative AI tools, including ChatGPT and Jenni.ai. A person who using ChatGPT relies on OpenAI's LLM as the basis for its responses.
Additional Resources:
- Web Article - "Generative AI exists because of the transformer. This is how it works." (Financial Times)
- Web Article - "A jargon-free explanation of how AI large language models work" (ArsTechnica; Timothy B. Lee and Sean Trott)
- Video - "Introduction to large language models" (Google Cloud Tech)

A specific chatbot LLM, created by the company OpenAI. ChatGPT can take a wide range of user text inputs and respond in a human-like manner.
There have been multiple versions of ChatGPT made available to the public. The most well-known version, ChatGPT 3.5, was released in November 2022, though it has since been discontinued in favor of updated ChatGPT versions.
Additional Resources:
- Web Article - "ChatGPT: Everything you need to know about OpenAI's GPT-4 tool" (BBC Science Focus; Alex Hughes)
- Web Article - "What Is ChatGPT Doing...and Why Does It Work?" (Stephen Wolfram Writings)

Source: AI Guide – The AI Pedagogy Project. (n.d.). Retrieved December 4, 2023, from https://aipedagogy.org/guide/

Per a definition coined by Long & Magerko (2020), AI literacy refers to "a set of competencies that enables individuals to critically evaluate AI technologies; communicate and collaborate effectively with AI; and use AI as a tool online, at home, and in the workplace" (p. 2).

While this is not necessarily an all-encompassing definition, and one that will continue to evolve alongside new generative AI tools, it speaks to key skills the UR Library will foster through this guide, and in our research assistance with students and faculty alike. In particular, we envision users will be able to:

Explain the basic functionality and applications of generative AI;
Utilize generative AI tools in an effective, responsible, and ethical manner;
Assess various generative AI tools, as well as their outputs, for accuracy and bias;
Analyze the various implications of increased generative AI usage, from privacy to environmental sustainability

For more information on AI literacy, check out the following pieces below:

Long, D., & Magerko, B. (2020). What is AI literacy? Competencies and design considerations. Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems, 1–16. https://doi.org/10.1145/3313831.3376727

Miao, F. & Shiohira, K. (2024). AI competency framework for students. UNESCO. https://doi.org/10.54675/JKJB9835

Ng, D. T. K., Leung, J. K. L., Chu, S. K. W., & Qiao, M. S. (2021). Conceptualizing AI literacy: An exploratory review. Computers and Education. Artificial Intelligence, 2, Article 100041. https://doi.org/10.1016/j.caeai.2021.100041

LinkedIn Learning Courses

Find more information about generative AI through the following LinkedIn Learning courses (all UR students, staff, and faculty have access to LinkedIn Learning):

The LinkedIn Learning platform (formerly Lynda.com) is available to all UR students, staff, and faculty.

LinkedIn Learning
Access online library of instructional videos on the latest software tools and skills. (LinkedIn)

Key Terminology

Here are terms commonly used in discussions of generative AI and their definitions:

Autonomous AI: Artificial intelligence systems capable of operating and performing tasks without constant human control. These systems can analyze data, learn from it, and execute actions based on their programming and algorithms.

Bias: The presence of systematic and undesired preferences or imbalances in the output generated by an AI model. Bias can emerge in various forms, such as in the content, language, or perspectives generated by the AI system.

Burstiness: The abrupt shifts in quality, coherence, or relevance often observed in AI-generated content, particularly in writing. It refers to the inconsistencies in style, tone, or factual accuracy that can occur within a short span. Identifying burstiness helps distinguish AI-generated content from human-created content.

Generative AI: AI systems that can generate new content, such as text, images, or music. It involves developing algorithms and models that can understand patterns in existing data and use that understanding to generate novel output.

Generative Model: An AI model designed to generate new data that resembles the patterns and characteristics of the training data it has been exposed to.

Hallucinations: Misinformation or made-up information based on a pattern that the AI model has learned as part of its training. For example, the model could create references that do not actually exist.

Heat Map: A visual representation that highlights important elements in the output generated by an AI model. It helps understand where the model focuses and assists in evaluating and improving the generated content.

Large-Language Models: Components of artificial intelligence developed based on the training of vast datasets of documents from various sources. The computer program analyzes data input and maps out words in the dataset. It next tries to predict which words are positioned before or after other words using predictive patterns of most likely combinations.

Output: The generated content produced by a generative AI system. It can be text, images, audio, music, video, or other data the model is designed to produce.

Perplexity: A measure used to assess the coherence and consistency of AI-generated text. Higher perplexity values suggest the content is more likely to be AI-generated due to unusual patterns or inconsistencies. Content identification systems use perplexity to identify AI-generated content.

Positional Encoding: A technique that assigns a number to each word during training that is used to show the position (or order) of words in a sequence.

Probabilistic: In generative AI, probabilistic means that the models incorporate probability, which is used to estimate the likelihood of different outcomes and generate outputs that align with the learned probabilities.

Prompt: The initial input text or instructions given to a model to generate new content based on that starting point. It provides context and guides the model's output. The prompt can be a few words or sentences that set the tone or specify the desired content.

Sentient: The capability to possess consciousness, self-awareness, and subjective experiences. Achieving true sentience in AI systems is a topic of scientific exploration and philosophical debate.

Tokens: Discrete units used to represent meaningful components of text, such as words or phrases. Breaking down text into these units allows AI models to process and analyze language at a granular level, enabling tasks like language generation.

Training Data: Training data refers to the set of examples or input data used to train a generative AI model. It consists of a collection of representative samples the model learns from to generate new content or make predictions.

Transformer: A type of model (or robot) that can simultaneously work on several tasks and sequentially build output. The transformer gives AI models the ability to process and learn from data so they can interpret context and place words together to form a cohesive sentence structure

Adapted in part from: Walden University (n.d.) Key AI terms glossary. https://academics.waldenu.edu/artificial-intelligence/glossary

Additional Resources

AI Guide
Introductory guide to generative AI through an educational/pedagogical lens, including a collection of assignments for students on beginning artificial intelligence concepts. Created by the metaLAB at Harvard University.
Elements of AI
Created by MinnaLearn and the University of Helsinki, this is a self-paced, online tutorial designed to introduce to people "what AI is, what can (and can’t) be done with AI, and how to start creating AI methods".
Large language models, explained with a minimum of math and jargon
In-depth, but approachable, explanation of Large Language Models and their role in how generative AI tools function (July 2023)
Watch an A.I. Learn to Write by Reading Nothing but...
NYT piece that breaks down and visualizes the steps generative AI tools use to produce responses in the style of famous works like Jane Austen, Shakespeare, the Federalist Papers, and even Star Trek (April 2023)
Citing AI Guide
Page from UR Library on how to cite generative AI discussions in MLA, APA, and Chicago formats.

Boatwright Memorial Library

Where to start? Defining "Generative AI"

LinkedIn Learning Courses

Key Terminology

Additional Resources