Understanding Narrative Transportation Using NotebookLM

In a previous post I discussed Embracing Narrative Transportation as a way to enhance the impact of your personal stories. And while I enjoy writing articles about storytelling, I’ve been exploring AI (Artificial Intelligence) to broaden my research capabilities.

I have to say, it’s amazing to watch these Chatbots in action. They are fast, and comprehensive in their reach, but they don’t replace humans as they still don’t have human experiences that we have, but they are able to locate and analyze information faster than any human.

So I decided to see what some of the latest chatbots had to say about the topic of Narrative Transportation with regards to crafting impactful personal stories. In part two, I asked NotebookLM what it thought. While it’s a long article, if you have an interest in AI, it’s an interesting read.

Input:

As Google’s NotebookLM queries the sources that have been uploaded to a specific notebook, I added 42 sources that discussed some aspect of narrative transportation. Then I asked 8 questions, which are listed in the following text, along with the output from NotebookLM

Output:

Here’s what NotebookLM had to say:

Overview

The provided texts collectively explore the multifaceted nature and impact of narrative. Monika Fludernik‘s work offers a comprehensive introduction to narratology, defining narrative elements and theoretical frameworks for understanding storytelling across various media. Lisa Libby and Melanie Green examine the psychological phenomenon of narrative transportation, investigating how engagement with stories influences beliefs, attitudes, and self-perception, even in the face of conflicting information.

Helenq Bilandzic focuses on measuring narrative engagement, identifying its components and relationship to concepts like identification. Martin Payne’s “Narrative Therapy” discusses the application of storytelling principles in therapeutic contexts, emphasizing the power of personal narratives in shaping identity and facilitating change. Finally, Lucas Bietti and Kurt Braddock, along with Michael Dahlstrom and William Buchanan, investigate the adaptive functions of storytelling and the persuasive effects of narrative on beliefs and cognitive processes like recall and comprehension, highlighting the immersive “experience” of transportation into a story world.

Questions

(1) What are the core components and definitions of Narrative Transportation as described in the provided documents?

Narrative transportation is an experiential state of immersion in which all mental processes are concentrated on the events occurring in the narrative. It’s the process that enables narrative persuasion, occurring when a person becomes so immersed in a story that they feel as if they are truly experiencing it.

Here are some core components and definitions of narrative transportation as described in the sources:

    • Definition: Narrative transportation is defined as a convergent mental process where all mental systems and capacities become focused on events occurring in the narrative. It involves individuals becoming absorbed into a story world. This psychological state varies depending on the text or media product, the recipient, the situation, and interactions between these factors.
  • Key Components: Several key cognitive and emotional components contribute to narrative transportation:
    • Focused Attention: Receivers become so focused on the story that they do not think about distractions in their environment. A high level of focus or attention to the narrative is considered a necessary but not sufficient condition for transportation.
    • Emotional Engagement (including Empathy): There is a strong emotional response to the story with empathy for the characters. Transportation involves empathy, where story receivers try to understand the experience of a story character, knowing and feeling the world in the same way.
    • Mental Imagery: Receivers have a mental image of the story in their heads or try to predict what might happen in the future of the story. Story receivers generate vivid images of the story plot, feeling as though they are experiencing the events themselves.
    • Cognitive Detachment from Reality (Suspended Reality/Loss of Access to the External World): When transported, story receivers lose track of reality in a physiological sense. They mentally leave the real world behind and become entirely focused on the story world, although recent perspectives emphasize this is a dynamic process. This also includes a lack of awareness of surroundings.
  • Related Concepts and Distinctions:
    • Narrative transportation is distinct from a narrative itself, which is an antecedent of narrative transportation. Narratives are defined as a connected series of events that typically include the actions and experiences of one or more protagonists that are causally related.
    • While similar to other experiential constructs like absorption, flow, and immersion, narrative transportation has specific characteristics. Absorption is a personality trait, while transportation is temporary. Flow is more general, while transportation specifically entails empathy and mental imagery. Immersion is more related to aesthetic and visual elements, whereas transportation relies on a story with plot and characters. Identification emphasizes involvement with characters, while transportation concerns involvement with the entire narrative.
  • Outcomes: Narrative transportation can lead to several outcomes:
    • Belief and Attitude Shift: Transportation can influence receivers’ beliefs, attitudes, intentions, and behaviors, aligning them with the story’s content.
    • Reduced Counterarguing: When engrossed in a narrative, the receiver is less likely to challenge the story’s content.
    • Story-consistent beliefs and favorable evaluations of protagonists.
  • Factors Influencing Transportation: The degree of transportation depends on various factors related to the story (e.g., narrativity, quality, coherence, characters), the recipient (e.g., transportability, need for affect), the medium, and the situation. Narrativity, which includes content dimensions and structural features like a sequence of causally related events unfolding over time, contributes to transportation by enabling an easy flow of comprehension.

In essence, narrative transportation is a psychological state of deep engagement with a story, involving focus, emotion, imagination, and a temporary detachment from the real world, which can subsequently influence beliefs and attitudes.

(2) According to these documents, what factors contribute to or hinder an individual’s experience of Narrative Transportation?

According to the documents, several factors contribute to or hinder an individual’s experience of Narrative Transportation. These factors can be broadly categorized into story characteristics, recipient traits and individual differences, situational factors, and medium characteristics.

  1. Story/Narrative Characteristics: These are features inherent to the narrative itself that can influence how easily a person becomes transported.
  • Narrativity and Literariness: Stories with high narrativity, meaning they include protagonists with goals, ensuing conflicts, and follow canonical affective trajectories, tend to elicit stronger narrative transportation. Well-crafted, high-quality narratives are generally more transporting. Literariness can also contribute to transportation.
  • Coherence: A story that provides clear and sensible links between story events, character actions, and other elements is more transporting.
  • Character Development:Well-developed characters contribute to higher transportation. Identifiable characters, where the story receiver can clearly pinpoint the personas, facilitate transportation by allowing for vicarious experience of beliefs and emotions.
  • Emotional Intensity and Range: Stories with a greater emotional range or intensity can be more transporting.
  • Suspense and Dramatic Tension: The presence of suspense or dramatic tension can increase transportation.
  • Psychological Realism (Believability):Even in fantasy settings, the plot and characters should be plausible and act like real people to enhance transportation. Narratives that lack verisimilitude (realism or the appearance of being true) tend to be less transporting. However, this realism refers more to psychological realism than literal similarity to the real world.
  • Clarity and Ease of Comprehension: Narratives that are easy to read and understand facilitate transportation. Conversely, written narratives that are difficult to read, understand, or imagine tend to be less transporting.
  • Vividness and Detail: Rich detail may make stories more transporting. Stories containing rich, concrete imagery can increase levels of transportation.
  • Novelty: Foregrounding in a narrative can increase perceived novelty, potentially contributing to transportation.
  • Story Line Appeal and Quality of Production: These are suggested as important text factors influencing transportation.
  • Unobtrusiveness of Persuasive Subtext: When persuasive messages are presented subtly, transportation may be higher.
  • Similarity of Characters with the Self (Homophily): Characters who are similar to the reader can increase transportation. Personally relevant stories also tend to be more transporting.
  • Narrative Perspective: While third-person perspective stories can be transporting, there is some evidence that the first-person perspective might be particularly effective.
  • Coherence of Narrative Elements: A series of interrelated events that constitute a developed narrative, where the reader relies on the author, and the author relies on the reader’s ability to interpret, contributes to the experience.
  • Causal Information: Information that is part of the causal chain of the story may lead to greater engagement.
  1. Recipient Traits and Individual Differences: These are characteristics of the individual reading or experiencing the narrative.
  • Transportability: This is a generalized tendency toward being transported by a narrative. Individuals high in transportability are more likely to become transported. This trait involves subdimensions like suspense/curiosity propensity, emotional engageability, and ease of accepting fictional worlds.
  • Need for Affect: Individuals with a higher need for affect (a tendency to avoid emotion-inducing situations) are more likely to be transported into narratives.
  • Imaginative Engagement: A higher capacity for imaginative engagement is linked to increased transportation.
  • Narrative Comprehension Ability: How well a person understands the story, characters, and conflicts can affect transportability.
  • Tendencies Toward Mental Imagery Production: Individuals who are better able to imagine the events and characters in a narrative are more likely to be transported. However, individuals with a low ability to generate vivid mental imagery might be more transported by visual media like films than by texts.
  • Tendency Toward Emotional Responding: Individuals with a greater tendency toward emotional responding may have higher transportability.
  • Prior Knowledge: Familiarity with a story topic can increase transportation. Prior knowledge that makes it easier to understand and connect with characters can also make transportation more likely. However, if personal experience contradicts the story, transportation can be disrupted as individuals may adopt a critical mode of processing.
  • Attention: Paying attention to a story is a prerequisite for transportation.
  • Self-Concept Accessibility: The level of accessibility of an individual’s self-concepts upon entering a narrative world can influence experience-taking, which is related to transportation. High self-concept accessibility did not appear to increase distraction or reduce absorption but was associated with lower levels of experience-taking.
  • Literary Expertise: Readers with literary expertise may process texts differently due to their memory representations, including genre expectations.
  • Demographic Factors: Some research suggests that younger individuals and females may experience more narrative transportation. Education level has also been linked to transportation.
  1. Situational Factors: These are elements of the environment or context in which the narrative is experienced.
  • Pre-Narrative Instructions: Instructions given to readers or viewers before engaging with the narrative can affect transportation. For example, instructions to relax and enjoy the narrative can increase transportation, while instructions to focus on surface aspects like grammar can decrease it.
  • Prior Expectations (e.g., Reviews): Positive expectations created by prior positive reviews of a story can encourage greater transportation. Conversely, negative reviews might hinder it.
  • Distraction and Cognitive Load: Anything that distracts the individual or increases cognitive load can interfere with transportation.
  • Co-viewing: The presence and reactions of other people while experiencing a narrative (e.g., watching a movie) can influence transportation and its effects.
  1. Medium Characteristics: The format in which the narrative is presented can also play a role.
  • Type of Medium: Different media (e.g., written, audio, video, virtual reality) can influence the extent of narrative transportation. For example, virtual reality stories may evoke stronger physiological responses.
  • Features of the Medium: Elements such as music in video narratives can affect transportation. The sensory capabilities of a medium (e.g., VR haptic interfaces) can influence how stories can be told and experienced.
  • Visual Narrativity: For visual narratives, the presence of setting, a focal actor, and elements for viewer resonance contribute to visual narrative transportation.

It’s important to note that these factors can interact with each other, and the experience of narrative transportation is often a dynamic process that can fluctuate over time.

(3) How does Narrative Transportation, as explained in these texts, affect an audience’s beliefs, attitudes, and emotions in relation to a story?

Narrative Transportation, as explained in these texts, significantly affects an audience’s beliefs, attitudes, and emotions in relation to a story by immersing them in the narrative world, leading to various psychological and persuasive outcomes. This immersion, characterized by focused attention, emotional engagement, and vivid mental imagery, has several key effects:

  1. Belief Change:
  • Reduced Counterarguing: When individuals are transported into a narrative, they are often less likely to critically evaluate the information presented and may have a reduced tendency to counterargue claims made in the story. This can lead to the acceptance of story-consistent beliefs.
  • Perceived Realism: Higher levels of transportation can lead story receivers to perceive the story and its plot as more realistic and truthful, even if they know it is fictional. This perception of truthfulness can then prompt them to adopt story-consistent beliefs.
  • Influence of Characters’ Beliefs: Strong feelings and connections formed with story characters during transportation can enhance the influence of those characters’ experiences and beliefs on the audience’s own beliefs.
  • Memory Reinforcement: The mental imagery formed during transportation can be recalled later, potentially re-evoking the original communication and reinforcing story-relevant beliefs over time.
  • Generalization of Lessons: Audiences tend to generalize lessons learned from transported narratives and apply them to real-life situations, even if the stories focus on specific individuals and events.
  • Causal Information Acceptance: Information presented at causal locations within a narrative tends to be perceived as more truthful regarding the real world, and this effect can persist over time.
  1. Attitude Change:
  • Favorable Evaluations: Transportation can lead to more positive feelings and evaluations of sympathetic characters within the narrative.
  • Story-Consistent Attitudes: The engrossing experience of narrative transportation can result in more positive attitudes towards the story itself and story-consistent attitudes toward the experiences described within it.
  • Mediation by Affect and Cognition: Narrative transportation can cause both affective (emotional) and cognitive responses, which in turn mediate changes in attitudes. Higher transportation is associated with increased story-consistent affective responses and narrative thoughts, and decreased critical thoughts.
  • Persuasion: Narrative transportation is a key mechanism of narrative persuasion, influencing changes in consumers’ attitudes and intentions. Individuals more transported into narratives often show more positive attitudes towards related topics, such as products or social policies.
  • Reduced Resistance: Stories, particularly when transporting, can be less threatening than direct persuasive messages, helping to overcome resistance to attitude change.
  1. Emotional Impact:
  • Emotional Engagement: Transportation inherently involves emotional engagement with the narrative. Transporting narratives are effective partly because they evoke emotions.
  • Vicarious Experience: Through transportation, audience members can vicariously experience the emotions of the characters, leading to emotional responses aligned with the narrative events.
  • Emotional Flow: The pattern of emotional experiences throughout a story, including shifts in emotional tone and the creation and resolution of suspense, can help maintain audience interest and increase transportation.
  • Intensity of Response: Greater levels of narrative transportation can cause story receivers to perceive the story as more realistic, leading to more intense and story-consistent emotional responses.
  • Connection to Characters: Emotional connections formed with characters during transportation contribute to the overall emotional impact of the story.

In summary, Narrative Transportation acts as a powerful process that allows audiences to become deeply involved in a story world. This immersion reduces critical processing, enhances emotional engagement, and fosters connections with characters and the narrative, ultimately leading to shifts in their beliefs, attitudes, and emotional states that are often consistent with the story’s content and implications. This effect is distinct from more analytical forms of persuasion.

(4) Based on the uploaded documents, what are the key principles of Narrative Transportation that can be applied to crafting impactful personal stories?

Based on the uploaded documents and our conversation history, several key principles of Narrative Transportation can be applied to crafting impactful personal stories:

  • Narrative Quality and Coherence: A well-crafted personal story should have a clear and logical flow of events. The connections between different parts of the story, including cause-and-effect relationships, should be sensible and easy to follow. Ensuring that the actions and motivations of the storyteller (and any other characters involved) are consistent will enhance the coherence.
  • High Narrativity: Personal stories should possess strong narrativity, which encompasses both the content (the sequence of events and the individuals involved) and the discourse (how the story is told using literary devices). A compelling personal story will have identifiable elements of a story, such as a beginning, rising action, climax, falling action, and resolution (even if implicit).
  • Vivid Imagery: To facilitate transportation, a personal story should incorporate rich and descriptive language that allows the audience to form vivid mental images of the events, settings, and people involved. Engaging sensory details (what was seen, heard, felt, etc.) can significantly enhance this aspect.
  • Emotional Engagement: Impactful personal stories evoke emotions in the audience. Sharing your own genuine emotions and describing events in a way that resonates emotionally with listeners or readers is crucial. The emotional arc of your story, including moments of tension, joy, sadness, or reflection, can contribute to greater transportation.
  • Focused Attention: A well-crafted personal story will capture and maintain the audience’s attention. This can be achieved through compelling plot points, intriguing details, and a pace that keeps the audience engaged without overwhelming them.
  • Identifiable Characters (Especially Yourself): Even in a personal story where you are the central figure, making yourself relatable and allowing the audience to understand your experiences and emotions will foster empathy and connection. Sharing your inner thoughts, motivations, and vulnerabilities can help the audience identify with you.
  • Psychological Realism: While personal stories are factual, presenting them in a way that feels psychologically plausible – where the emotions and reactions feel genuine and understandable – can enhance transportation. Even when recounting unusual events, grounding the narrative in relatable human experiences is important.
  • Personal Relevance and Similarity: When crafting a personal story, consider how it might resonate with your intended audience. Highlighting themes, experiences, or emotions that the audience can relate to or find relevant to their own lives can increase their level of transportation. Similarities between the storyteller and the audience can also lead to higher levels of transportation.
  • Perspective: While not explicitly stated as a necessity, using the first-person perspective (“I”) in a personal story can naturally enhance the feeling of direct experience and foster a closer connection between the storyteller and the audience, potentially increasing transportation.

By thoughtfully applying these principles, individuals can craft personal stories that are not just accounts of events, but immersive experiences that can affect an audience’s beliefs, attitudes, and emotions, making the stories more impactful and memorable.

(5) What narrative elements or techniques, identified in these documents, are most effective in fostering Narrative Transportation in personal storytelling?

Based on the uploaded documents and our conversation history, several narrative elements and techniques are identified as being most effective in fostering Narrative Transportation in personal storytelling:

  • Coherence and Narrative Quality: A personal story that flows logically, with clear connections between events, character actions, and other elements, is more likely to be transporting. When the cause-and-effect links are sensible, the audience can more easily become immersed.
  • Strong Narrativity: This involves both the what (narrative content – the sequence of events and characters) and the how (narrative discourse – the literary devices used to tell the story). A well-defined sequence of events with relatable characters provides a foundation for transportation.
  • Vivid Mental Imagery: Using rich and descriptive language that appeals to the senses helps the audience create mental images of the story world and events, a key component of transportation. Concrete details and sensory information make the story more engaging and immersive.
  • Emotional Engagement:Stories that evoke genuine emotions in the audience are more transporting. Sharing your own emotions authentically and describing experiences in an emotionally resonant way allows the audience to connect with the story on a deeper level. The emotional arc of the story, including moments of suspense, joy, or reflection, also plays a crucial role.
  • Focused Attention: While the storyteller primarily influences this through the above elements, crafting a story that captures and holds the audience’s attention is vital for transportation. Avoiding unnecessary digressions and maintaining a compelling pace can aid in this.
  • Identifiable Characters (Especially the Teller): Making yourself and any other individuals in your personal story relatable and understandable fosters empathy and identification, which are closely linked to transportation. Sharing thoughts, feelings, and motivations can help the audience connect with your experience.
  • Psychological Realism: Even when recounting extraordinary personal events, grounding the story in believable human psychology – ensuring that actions and reactions feel plausible – enhances transportation. The audience should be able to understand the characters’ (including your own) motivations and emotional states.
  • Personal Relevance and Similarity: When the audience perceives the personal story as relevant to their own lives or identifies similarities between themselves and the storyteller or the situation, transportation is more likely. Highlighting universal themes or relatable experiences can bridge this gap.
  • Perspective (First-Person):While not explicitly stated as a definitive rule in the sources, using the first-person perspective (“I”) in personal storytelling naturally creates a sense of immediacy and direct experience, potentially drawing the audience more directly into your narrative and fostering transportation.

In essence, crafting impactful personal stories that foster Narrative Transportation involves telling a well-structured and coherent story with vivid details that evoke emotions, allowing the audience to connect with the storyteller and experience the narrative in a way that feels real and relevant to them.

(6) Are there any limitations or potential drawbacks of relying on Narrative Transportation to create impactful personal stories, according to these sources?

Based on the uploaded documents and our conversation history, there are several limitations or potential drawbacks of relying on Narrative Transportation to create impactful personal stories:

  • Distraction by Irrelevant Details: While vivid details are crucial for transportation, an overabundance of them, or details that are irrelevant to the main point of the personal story, can be a significant drawback. These details might obscure the relevant elements of the story and detract from the intended impact. Listeners or readers could become focused on less important aspects, potentially missing the core message or the intended emotional or attitudinal shift.
  • Potential for Misinterpretation and Misapplication: If the audience becomes highly transported but fixates on tangential similarities between the story and their own lives, they might draw incorrect or unintended conclusions. This “distraction by details” could lead to decisions or perspectives that are not aligned with the storyteller’s intention.
  • Risk of Perceived Manipulation: If a personal story is perceived as being overly crafted or manipulative in its attempt to induce transportation, it could backfire. Audiences might become aware of the persuasive intent, which can lead to increased skepticism and reduced effectiveness of the story. The sincerity and authenticity often expected in personal stories are crucial to avoid this perception.
  • Ethical Considerations: Depending on the content of the personal story, high levels of transportation could raise ethical concerns. For instance, if the story inadvertently normalizes or glorifies harmful behaviors, a transported audience might be more susceptible to these negative implications. Accuracy, potential for manipulation, and the portrayal of oneself and others with respect are important ethical considerations.
  • Focus on the Story Over the Message: The very elements that make a personal story highly transporting – engaging plot, vivid characters, and emotional resonance – might overshadow the specific belief, attitude, or behavior the storyteller intends to influence. The audience might remember the story and their emotional experience but not necessarily internalize the intended lesson or perspective.
  • Individual Differences in Transportability: Not everyone is equally susceptible to narrative transportation. Factors like “transportability” (a person’s general tendency to become absorbed in narratives), prior experiences, and the situational context can influence how deeply someone is transported. Therefore, a personal story that is highly transporting for one person might have less impact on another.
  • The Role of Critical Thinking: While transportation often reduces counterarguing, if the personal story contains inconsistencies or elements that strongly contradict the audience’s existing knowledge or beliefs, high transportation might not entirely suppress critical thinking. Instead, these inconsistencies might disrupt transportation and reduce the story’s impact.

In summary, while Narrative Transportation is a powerful tool for crafting impactful personal stories, storytellers need to be mindful of the potential for distraction through details, the risk of appearing manipulative, ethical implications, the possibility of the story overshadowing the message, and individual differences in audience responses. Carefully balancing the elements that foster transportation with a clear focus on the intended message and maintaining authenticity are key to mitigating these potential drawbacks.

(7) Can you identify any examples or case studies of successful personal storytelling within these documents that illustrate the principles of Narrative Transportation?

Based on the uploaded documents, there aren’t explicit, detailed examples or case studies of successful personal storytelling that are thoroughly analyzed to illustrate the specific principles of Narrative Transportation we have discussed. While the sources touch upon the power of narratives and their persuasive effects, the emphasis is often on broader categories of stories, such as public narratives, fictional narratives used in research, or the use of autobiographical storytelling in therapeutic contexts without specific examples dissected for their Narrative Transportation elements.

However, we can infer some potential examples and connect them to the principles based on the information provided:

  • The “Shopping Trolley Soldier (John Mulligan)” mentioned in: This is referred to as a dramatic example of someone benefiting from writing therapy. While the specifics of Mulligan’s story and its telling are not detailed, its effectiveness in resolving emotional issues suggests that it likely employed several principles of Narrative Transportation. For example, to have a “dramatic” impact, the story likely had a degree of narrative quality and coherence, presented events in a way that evoked emotional engagement, and perhaps allowed for some level of identification or psychological realism for the reader to connect with Mulligan’s experiences. However, without more details, this remains an inferred example.
  • The use of autobiographical stories in therapy described in: The text outlines how clients can be asked to choose significant autobiographical stories and clarify them through writing and discussion. The potential for these stories to be “helpful” and for clients to gain insights about their lives by reflecting on their own narratives suggests that when these stories are shared (even if primarily with a therapist), they can be impactful. For Narrative Transportation to occur in this context, the client’s ability to articulate their experiences with coherence, including vivid details of memories, and to convey the emotional significance of these events would likely be crucial. The therapist’s questions to help clarify the story could also enhance these elements. Again, no specific story is provided as a case study.
  • Stories used in health communication research: While not strictly “personal” storytelling in the sense of an individual sharing their own life story, the reference to using storytelling strategies in health messages and the example of a video story about cervical cancer screening having positive effects implies that well-crafted narratives, even if based on others’ experiences or fictionalized for a purpose, can be impactful. These stories likely leverage principles like narrative quality, emotional engagement (perhaps through relatable characters), and the potential for identification to achieve their persuasive goals.

In summary, while the documents highlight the importance and effects of narratives, they do not offer specific, detailed examples of successful personal storytelling that are then explicitly analyzed to showcase how principles of Narrative Transportation were employed. We can infer that instances of impactful storytelling, whether in therapy or health communication, likely benefited from these principles, but the sources do not provide the detailed case studies requested.

(8) How do the different terms provided (Narrative Transportation Theory, Narrative Theory, Transportation Theory, and Narratology) relate to or differ from each other within the context of these documents?

Based on the provided documents and our conversation history, here’s how the different terms relate to or differ from each other:

  • Narrative Theory is the broadest term, encompassing a wide range of approaches to texts that are considered to be, partially or wholly, narrative. It is described as a set of approaches to understand, analyze, and evaluate narratives across various forms, including texts, images, spectacles, and events. Narrative theory draws upon various disciplines like linguistics, semiotics, psychology, and cultural studies to study the forms, functions, and effects of narrative.
  • Narratology is a specific branch or discipline within Narrative Theory. It is often referred to as the theory or science of narrative. Narratology aims to develop systematic frameworks and tools for analyzing narrative structures, elements (like story and discourse), and categories. The sources indicate that narratology has its roots in structuralism and has evolved into various schools of thought and approaches, including classical and postclassical narratologies. Mieke Bal defines narratology as “the ensemble of theories of narratives, narrative texts, images, spectacles, events; cultural artifacts that ‘tell a story'”.
  • Transportation Theory (also referred to as Narrative Transportation Theory) is a psychological theory that focuses specifically on the experience of being immersed or “transported” into a narrative world. This theory examines the cognitive and emotional processes involved when individuals become highly engaged with a story, experiencing a sense of “going into” the narrative world. Key aspects of transportation include focused attention, imagery, and emotional engagement.
  • The relationship between Narrative Theory and Narratology is that narratology is a core component of narrative theory. While narrative theory is the overarching field of study, narratology provides many of the analytical tools, concepts, and frameworks used within this field to understand how narratives function.
  • Narrative Transportation Theory is a specific theory within the broader field of Narrative Theory, but it comes primarily from psychology and communication studies rather than literary studies, which have historically been central to narratology. While narratology might analyze the structural features of a story, Narrative Transportation Theory focuses on the psychological impact of engaging with that story, particularly its ability to transport the audience and potentially lead to persuasion.
  • Transportation Theory and Narrative Transportation Theory appear to be used interchangeably in the provided sources. Both terms refer to the same psychological construct of being immersed in a narrative.

In simpler terms:

  • Think of Narrative Theory as the entire field of studying stories.
  • Narratology is a key set of theories and methods within narrative theory, focusing on the structure and workings of narratives.
  • Narrative Transportation Theory (or Transportation Theory) is a specific psychological theory within narrative theory that explains how people become immersed in stories and the effects of this immersion, such as persuasion.

Therefore, while all these terms relate to the study of narrative, they operate at different levels of generality and come from different disciplinary perspectives. Narratology offers tools for analyzing narratives, while Narrative Transportation Theory explains a particular psychological experience of engaging with narratives.

Learn more about the coaching process or
contact me to discuss your storytelling goals!

Subscribe to the newsletter for the latest updates!

Copyright Storytelling with Impact® – All rights reserved

Understanding Narrative Transportation Using Manus AI

In my last post I talked about Embracing Narrative Transportation as a way to enhance the impact of your personal stories. And while I enjoy writing articles about storytelling, I’ve also been exploring AI (Artificial Intelligence) as a way to broaden my research capabilities.

I have to say, it’s amazing to watch these Chatbots in action. They are fast, and comprehensive in their reach, but they don’t replace humans as they still don’t have human experiences that we have, but they are able to locate and analyze information faster than any human.

So I decided to see what some of the latest chatbots had to say about the topic of Narrative Transportation with regards to crafting impactful personal stories. In part one, I asked Manus AI what it thought. Below is the prompt that I used, its initial response, and the output it generated. Warning, it’s a long article, but should you have an interest in AI, it’s an interesting read

Prompt:

I’m trying to understand the topic of Narrative Transportation in depth, as it relates to storytelling in general, but more specifically, how we can use Narrative Transportation to create impactful personal narratives.

Response:

I’ll help you understand Narrative Transportation in depth, both in general storytelling and specifically for creating impactful personal narratives. I’ll research this topic thoroughly and provide you with comprehensive information. Let me get started right away./em>

Output:

Here’s what Manus AI had to say:

Understanding Narrative Transportation: Creating Impactful Personal Narratives

Introduction

Narrative transportation is a powerful psychological phenomenon that occurs when individuals become completely immersed in a story. This comprehensive guide explores the concept of narrative transportation, its application in storytelling, and specifically how it can be leveraged to create impactful personal narratives. By understanding the mechanisms of narrative transportation and implementing specific techniques, you can craft personal narratives that deeply engage audiences, foster connections, and potentially transform beliefs and attitudes.

Part I: Understanding Narrative Transportation

The Concept of Narrative Transportation

Narrative transportation refers to the psychological experience of being completely immersed in a story. When a person is transported into a narrative, they become absorbed cognitively, emotionally, and in the imagery of the story, regardless of the medium (print, film, podcasts, video games, etc.).
The concept was first formally defined by Richard Gerrig in his 1993 book “Experiencing Narrative Worlds,” where he used travel as a metaphor for reading. He described it as a state in which the reader—referred to as the traveler—becomes detached from their original reality due to deep engagement with the story.
Melanie Green and Timothy Brock later expanded on this idea, describing narrative transportation as “a distinct mental process, an integrative melding of attention, imagery, and feelings.” They conceptualized it as the experience of being carried away by a narrative.

Key Features of Narrative Transportation

  1. Focused attention – Receivers become so focused on the story they do not think about distractions in their environment
  2. Emotional Engagement – There is a strong emotional response to the story with empathy for the characters
  3. Mental Imagery – Receivers have a mental image of the story in their heads or try to predict what might happen in the future of the story
  4. Cognitive Detachment from Reality – Receivers feel like they are really in the story and not in their real-world environment
  5. Belief and Attitude Shift – Narrative transportation can influence receivers’ beliefs, attitudes, and behaviors, aligning them with the story’s content
  6. Reduced Counterarguing – When transported, the receiver is less likely to challenge the story’s content
  7. Long-term Efficacy – The receiver is impacted even after the story ends, affecting long-term behavior and attitude change

Psychological Mechanisms of Narrative Transportation

Reduced Counterarguing

Transporting stories create a more open, accepting mindset for individuals. When readers are transported in an entertaining narrative, they may not want to interrupt their enjoyment to argue with the story. This reduced counterarguing can occur even when the persuasive intent is obvious.

Connections with Characters

Audience members may:

  • See themselves in story characters
  • Come to see characters as friends
  • Admire the characters

Through these connections, audiences may change their attitudes and beliefs to align with those of narrative characters. Identification with and liking of story characters has been shown to increase the adoption of beliefs advocated by the character.

Mental Imagery

The transportation-imagery model highlights the role of visual imagery in transportation-based belief change. The experience of being transported into a story links vivid images with beliefs implied by the story. This connection between images and beliefs may be one basis for the power of narrative persuasion.

Emotional Engagement

Stories are particularly powerful when they evoke strong emotions. The pattern of emotional experience that occurs as readers progress through a story (emotional flow) can help maintain interest, particularly for longer narratives.

Reminding Readers of Personal Experiences

Stories can prompt self-examination and change by reminding readers of experiences in their own lives that relate to those in the narrative. Individuals who are more immersed in a story show greater generalization of the beliefs implied by the story, but additionally, whether the story brings to mind events from the readers’ real lives seems to be important in determining narrative impact.

Factors Affecting Narrative Transportation

Audience Factors:

  • World view (e.g., pro-environmental)
  • Prior knowledge and experience
  • Attitudes and beliefs
  • Empathy
  • Political ideology
  • Religiosity
  • Transportability (individual tendency to become immersed)
  • Need for affect (enjoyment of emotional experiences)
  • Need for cognition (enjoyment of thinking)

Story Factors:

  • Fiction vs nonfiction
  • First-person vs third-person
  • Well-developed characters
  • Intent of character
  • Personally relevant stories
  • Stories that stay engaging
  • Coherence and structure
  • Emotional intensity
  • Suspense and dramatic tension
  • Psychological realism
  • Cultural appropriateness

Part II: Narrative Transportation in Personal Narratives

Unique Aspects of Personal Narratives

Personal narratives differ from fictional narratives in several key ways when it comes to narrative transportation:

Authenticity and Vulnerability

  1. Authenticity as a Bridge: In personal narratives, authenticity serves as a bridge between the storyteller’s world and the audience’s. When authors share their genuine experiences, emotions, and vulnerabilities, they create a unique connection that can be more immediate than in fictional narratives.
  2. Vulnerability Creates Intimacy: Personal narratives that expose vulnerability create a level of intimacy that fictional narratives may struggle to achieve. Research by Hagmann et al. (2024) found that “self-revealing personal narratives create vulnerability by exposing the speaker to others’ negative judgment.” This vulnerability signals authenticity and honesty to listeners, increasing trust and connection.
  3. Real-life Consequences: Unlike fictional narratives, personal narratives carry real-life consequences for the storyteller, which can heighten the audience’s emotional investment. The willingness to incur potential interpersonal costs through self-disclosure signals that the speaker values honesty.

Identification and Social Identity

  1. Shared Experience Identification: In personal narratives, identification often occurs through shared experiences rather than character traits. Research on patient identification in cancer narratives found that shared experiences can create powerful connections regardless of demographic differences.
  2. Social Identity Influence: Personal narratives can activate social identities that influence transportation. According to social identity theory, perceived risk to a patient’s identity increases identification with salient social groups who possess a similar identity.
  3. Transportability Across Divides: Personal narratives have shown the ability to build trust across ideological divides. Research has found that people judge ideological opponents as more trustworthy when opposing opinions are expressed through a self-revealing personal narrative than through either data or stories about third parties.

Mechanisms of Narrative Transportation in Personal Narratives

Personalization and Intimacy

  1. Personalization Effect: Research suggests that personal narratives create a sense of “personalness” that drives effectiveness. Personalness is defined as “the level of intimacy delivered when narrating an experience, or the capability of eliciting a sense of intimacy with the ‘other.'”
  2. <Mental Representation: This sense of intimacy motivates engagement with the content and enables the mental representation of the narrative (narrative transportation), consequently increasing the likelihood of accepting the recommendation or message.

Self-Related Control Beliefs

  1. Empowerment Through Transportation: Research has found that “narratives in both written text and video form with protagonists displaying high versus low self-efficacy can, at least temporarily, affect recipients’ own self-related control beliefs when they experience strong transportation into the stories.”
  2. Mediation Through Transportation and Identification: The effect of narrative manipulation on self-related control beliefs was mediated by experienced transportation and identification, suggesting that personal narratives that facilitate transportation can influence how readers view their own capabilities.

Reduced Counterarguing in Personal Contexts

  1. Disarming Effect: Personal narratives may reduce counterarguing more effectively than fictional narratives because questioning someone’s lived experience can seem inappropriate or insensitive.
  2. Trust Building: Research has found that “trust does not suffer when explanations grounded in self-revealing personal narratives are augmented with data, suggesting that our results are not driven by quantitative aversion.”

Impact of Personal Narratives

Building Trust and Changing Beliefs

  1. Trust Across Differences: Personal narratives have shown the ability to build trust across ideological divides. People are more willing to collaborate with ideological opponents who support their views by embedding data in a self-revealing personal narrative, rather than relying on data-only explanations.
  2. Changing Health Behaviors: Research has found that “identification with the main character in the vignettes was a significant predictor of intentions to participate in cancer research, but only when the mediating role of narrative transportation was considered.”
  3. Self-Concept Changes: Personal narratives can influence how readers view themselves. Research has demonstrated that stories featuring strong protagonists who display high self-efficacy can increase recipients’ own self-related control beliefs when they are highly transported into the stories.

Profound Connections with Audiences

  1. Solace and Belonging: Stories that expose the raw, unfiltered emotions of their creators have the power to resonate with readers in a way that is impossible to replicate. In our own struggles and triumphs, we recognize a reflection of ourselves, and it’s in those moments that we find solace, encouragement, and a sense of belonging.
  2. Shared Human Experience: Personal narratives remind us that we are not alone in our experiences. They create a sense of shared humanity that can be particularly powerful for individuals facing challenges or difficult situations.

Part III: Techniques for Creating Impactful Personal Narratives

Structural Techniques

1. Clear Narrative Structure

A well-structured personal narrative guides readers through your emotional journey with ease. The basic structure includes:

  • Beginning: Establish your core idea and hook your readers
  • Middle: Develop your central idea using descriptive language to define your story, setting, characters, and plot
  • End: Summarize lessons learned, relate your core theme back to readers’ lives, and include a call to action

The ABCDE storytelling framework (from Anne Lamott’s book “Bird by Bird”) provides an excellent structure:

  • Action: Drop the reader right into the action. No need to “set the scene.”
  • Background: Once you have the reader’s attention, tell them why it matters.
  • Development: Build the story up. Where is the story going? What’s at risk?
  • Climax: This is the tipping point. The “aha!” The “oh no!” The “SURPRISE!”
  • Ending: Explain what it was all for. What was the lesson?

2. Start with a Strong Hook

Begin your personal narrative with something that immediately captures attention. This could be:
  • A surprising statement
  • A vivid scene
  • A provocative question
  • A moment of tension
The hook should relate to the core message of your narrative and give readers a reason to keep reading.

3. Focus on Transformation

Personal narratives are most impactful when they demonstrate transformation. Structure your narrative around:
  • Who you were before
  • The challenge or catalyst for change
  • The process of transformation
  • Who you became as a result
  • What this means for the reader

Emotional Connection Techniques

1. Vulnerability and Authenticity

The most powerful element in a personal narrative is the emotional connection you establish with readers. To create this:

  • Be honest and open about how experiences have impacted you
  • Include all relevant details, even embarrassing or painful ones
  • Share your feelings as well as your pain
  • Never be afraid to show vulnerability—this gives readers permission to accept their own
  • Write from a foundation of authenticity rather than trying to persuade or sell ideas

2. Descriptive Language and Sensory Details

Use descriptive language to paint a vivid picture in your reader’s mind:

  • Help readers visually experience colors and shapes
  • Create sensations of temperature, texture, and physical feelings
  • Incorporate scents, tastes, and sounds
  • Use all five senses to connect with readers’ emotions
  • Don’t limit your narrative to a sequence of causes and effects

3. Emotional Flow

The pattern of emotional experience throughout your narrative helps maintain interest:

  • Create and then resolve suspense about outcomes
  • Include changes in emotional tone across the course of the story
  • Balance difficult emotions with moments of insight or relief
  • Build emotional intensity toward key realizations
  • End with emotional resolution that feels satisfying

Connection with Readers

1. Make the Reader the Main Character

Your story isn’t really about you—the reader is the main character. To achieve this:

  • Have clarity around why your personal story is interesting to the READER
  • Connect your perspective with universal truths
  • Ask yourself: What’s the problem you want the reader to solve?
  • Consider: What’s the realization you want the reader to have?
  • Focus on: What’s the obstacle you want to help the reader overcome?

2. Identify the Takeaway

The takeaway explains why your story is important and creates a deeper emotional tie with the reader:

  • Connect your experience to something bigger
  • Identify the universal truth at play
  • Clarify what your experience says about you, life, or the world
  • Determine if your story is about frustration, injustice, loyalty, hope, etc.
  • Make the connection between your story and the message explicit

3. Relevance to Reader’s Life

Make your personal narrative relevant to readers’ lives:

  • Give readers a “heads up” so they can avoid pain you experienced
  • Provide relief if they’re currently going through a similar experience
  • Show how your lessons apply to their situations
  • Create points of similarity between your experience and theirs
  • Focus on shared human experiences that transcend specific circumstances

Finding Story-Worthy Material

1. Daily Reflection Practice

Use the “Homework for Life” framework (from Matthew Dicks’ book “Storyworthy”):

  • Take five minutes daily to reflect on your day
  • Ask: If you had to tell a story from today, what would it be?
  • Write one sentence about it
  • Look for difficult times, moments of growth, and proud or awkward experiences
  • Create two columns: date and story

2. Identify Emotional Significance

Pay attention to events that leave a strong impression:

  • What moments do you replay in your mind?
  • What makes you uncomfortable?
  • What reveals your vulnerability?
  • What forces you to confess a character flaw?
  • What challenges your core beliefs about yourself?

3. Choose Meaningful Experiences

Select experiences that have been transformational:

  • Events that caused you to rethink or reconsider your point of view
  • Circumstances that resulted in personal or professional growth
  • Painful situations that forced you to change your beliefs
  • Unique life experiences that taught you important lessons
  • Moments where you overcame adversity

Refinement Techniques

1. Editing and Revision

After completing your first draft:
  • Edit for grammar, spelling, and punctuation errors
  • Add any extra details you may have forgotten
  • Remove irrelevant information that doesn’t serve the core message
  • Ensure transitions between sections are smooth
  • Check that your narrative has a clear beginning, middle, and end

2. Seek Feedback

Get perspective from others:

  • Ask friends or family members to read your narrative
  • Be clear about what kind of feedback you’re looking for
  • Listen to their responses without taking criticism personally
  • Consider whether readers connect with your story as intended
  • Revise based on feedback to improve clarity and impact

3. Maintain an Enjoyable Process

While writing a personal narrative can be emotionally challenging:

  • Have fun with the process
  • Relax and tell your story naturally
  • Remember that the more relaxed you are, the more authentic your narrative will feel
  • View the process as potentially therapeutic and liberating
  • Recognize that sharing your story can benefit both you and your readers

Part IV: Practical Applications of Personal Narratives

Therapeutic and Personal Growth

Personal narratives can be used for:

  • Processing difficult experiences
  • Gaining clarity about life events
  • Discovering patterns in your behavior or thinking
  • Building self-awareness
  • Creating meaning from challenging circumstances

Professional and Leadership Contexts

Personal narratives are valuable in:

  • Building trust with teams or clients
  • Demonstrating values in action
  • Illustrating problem-solving approaches
  • Creating connection in professional relationships
  • Establishing credibility through lived experience

Educational and Inspirational Purposes

Share personal narratives to:

  • Teach important life lessons
  • Inspire others facing similar challenges
  • Provide practical guidance based on experience
  • Create empathy for different perspectives
  • Motivate positive change in others

Part V: A Framework for Applying Narrative Transportation in Personal Narratives

Step 1: Understand Your Purpose

Before crafting your personal narrative, clarify:
  • What you want to achieve with your story
  • Who your audience is
  • What change you hope to inspire
  • What beliefs or attitudes you want to influence
  • What emotional response you want to evoke

Step 2: Select the Right Experience

Choose an experience that:
  • Has transformed you in some way
  • Contains universal themes or lessons
  • Evokes strong emotions
  • Demonstrates vulnerability and authenticity
  • Has clear stakes and consequences

Step 3: Structure for Maximum Impact

Organize your narrative to:
  • Begin with a compelling hook
  • Establish context and background
  • Build tension through challenges and obstacles
  • Reach a climactic moment of realization or change
  • Resolve with reflection and meaning-making

Step 4: Enhance Transportation Elements

Deliberately incorporate elements that facilitate transportation:

  • Rich sensory details to create mental imagery
  • Emotional authenticity to create connection
  • Vulnerability to build trust
  • Universal themes to create relevance
  • Transformation to inspire hope

Step 5: Connect to the Reader

Make your narrative relevant by:

  • Explicitly stating the takeaway or lesson
  • Connecting your experience to broader human experiences
  • Addressing how your story might help the reader
  • Creating points of similarity between your journey and theirs
  • Inviting the reader to reflect on their own experiences

Step 6: Refine and Test

Improve your narrative by:

  • Getting feedback from trusted readers
  • Assessing whether it creates the intended emotional response
  • Checking if the core message is clear
  • Ensuring the narrative flows smoothly
  • Confirming that it feels authentic and honest

Conclusion

Narrative transportation offers a powerful framework for understanding how stories captivate us and influence our beliefs, attitudes, and behaviors. By applying the principles of narrative transportation to personal narratives, you can create stories that not only engage readers but potentially transform them.

Personal narratives have unique strengths in creating transportation through authenticity, vulnerability, and shared human experience. When crafted with intention and skill, they can build trust, foster connection, and inspire change in ways that other forms of communication cannot.

The techniques outlined in this guide provide practical approaches to enhancing narrative transportation in your personal stories. By focusing on structure, emotional connection, reader relevance, and meaningful content, you can create personal narratives that transport readers into your experiences and leave lasting impressions.

Whether you’re writing for personal growth, professional development, or to inspire others, understanding narrative transportation will help you craft more impactful personal narratives that resonate deeply with your audience.

References

  • Gerrig, R. J. (1993). Experiencing narrative worlds: On the psychological activities of reading.
  • Green, M. C., & Brock, T. C. (2000). The role of transportation in the persuasiveness of public narratives.
  • Hagmann, D., Ho, E. H., & Loewenstein, G. (2024). Personal narratives build trust.
  • Isberner, M. B., Richter, T., Schreiner, C., Eisenbach, Y., Sommer, C., & Appel, M. (2020). Empowering stories: Transportation into narratives with strong protagonists increases self-related control beliefs.
  • Lamott, A. (1994). Bird by Bird: Some Instructions on Writing and Life.
  • Neil, J. M., Gough, A., Kee, F., George, T. J., Pufahl, J., & Krieger, J. L. (2020). The influence of patient identification and narrative transportation on intentions to participate in cancer research.
  • Strange, J. J., & Leung, C. C. (1999). How anecdotal accounts in news and in fiction can influence judgments of a social problem’s urgency, causes, and cures.
  • Valenzuela, A., & Galli, L. (2024). The personalization effect: How personal narratives drive electronic word-of-mouth effectiveness.
  • Van Laer, T., De Ruyter, K., Visconti, L. M., & Wetzels, M. (2014). The extended transportation-imagery model: A meta-analysis of the antecedents and consequences of consumers’ narrative transportation.

Learn more about the coaching process or
contact me to discuss your storytelling goals!

Subscribe to the newsletter for the latest updates!

Copyright Storytelling with Impact® – All rights reserved

Nature of Intelligence – Episode Two – Language and Thought

In the previous post we looked at the question of What is Intelligence? This was based on the first episode of the Santa Fe Institute’s Complexity podcast series. In the second episode they looked into the relationship between language and thought. I had always assumed that language was the tool we used to express our thinking. And while that’s true, the full story is more complex than that.

In the opening Melanie states, “Language is the backbone of human culture.” But that statement is soon followed by the questions, “Are humans intelligent because we have language, or do we have language because we’re intelligent? How do language and thinking interact? And can one exist without the other?”

Like I said, the relationship is not so simple. Here are a few passages to prime your consciousness before you get into hearing the podcast.

At one point Gary Lupyan explains that, even with a very social & collaborative species like humans, if we take away language, we take away the major tool for creating culture and for transmitting culture. What he doesn’t say explicitly, but what I feel is inherently evident in that statement, is that culture is created and transmitted by way of how humans use language to tell stories.

While recapping the episode about mid-way, Melanie states that, “…language is an incredible tool for collaboration, and collaboration drives our intelligence.” It’s an interesting observation, as collaboration involves both doing things together, as well as sharing information by way of stories. But she also reminds us “…that language makes it easy to lie and to trick people.”

As the world deals with an avalanche of lies and misinformation from China, the U.S., and Russia, it’s a time for reflecting on the stark reality that intelligence and language don’t require a moral foundation. We can tell whatever story we want.


Transcript

Spoken and written language is completely unique to the human species, and it’s part of how we evolved. It’s the backbone of our societies, one of the primary ways we judge others’ intellect. So, are humans intelligent because we have language, or do we have language because we’re intelligent? How do language and thinking interact? And can one exist without the other? Guests: Ev Fedorenko, Steve Piantadosi, and Gary Lupyan

Ev Fedorenko: It is absolutely the case that not having access to language has devastating effects, but it doesn’t seem to be the case that you fundamentally cannot learn certain kinds of complex things.

Abha Eli Phoboo: From the Santa Fe Institute, this is Complexity.

Melanie Mitchell: I’m Melanie Mitchell.

Abha: And I’m Abha Eli Phoboo.

Melanie: Think about this podcast that you’re listening to right now. You’re, hopefully, learning by just listening to us talk to you. And the fact that you can take in new information this way, through what basically comes down to sophisticated vocal sounds, is pretty astonishing.

In our last episode, we talked about how one of the major ways humans learn is by being in the world and interacting with it. But we also use language to share information and ideas with each other without needing firsthand experience. Language is the backbone of human culture.

Abha: It’s hard to imagine where we’d be without it. If you’ve ever visited a country where you don’t speak the language, you know how disorienting it is to be cut off from basic communication. So in today’s episode, we’re going to look at the role language plays in intelligence. And the voices you’ll hear were recorded remotely across different countries, cities and work spaces.

Melanie: Are humans intelligent because we have language, or do we have language because we’re intelligent? How do language and thinking interact? And can one exist without the other?

Melanie: Part One: Why do humans have language?

Melanie: Across the animal kingdom, there are no other species that communicate with anything like human language.

Abha: This isn’t to say that animals aren’t communicating in sophisticated ways, and a lot of that sophistication goes unnoticed.

Melanie: But the way humans talk — with our long conversations and complex syntax — is completely unique. And it’s part of how we evolved.

Abha: For several decades, a dominant theory of human language was something called generative linguistics, or generative grammar.

Melanie: The linguist Noam Chomsky made this idea popular, and it basically goes like this: there’s an inherent, underlying structure of rules that all languages follow. And from birth, we have a hard-wired bias toward language as opposed to other forms of communication — we’re biologically predisposed to language and these syntactic rules. This is why human language is, according to Chomsky, unique to our species and universal across different cultures.

Abha: This theory has been incredibly influential. But it turns out, it doesn’t seem to be right.

Gary Lupyan: So I’ve never been a fan of generative linguistics, Chomsky’s kind of core arguments about universal grammar or the need for innate grammatical knowledge.

Abha: This is Gary Lupyan.

Gary: I am Gary Lupyan, professor of psychology at the University of Wisconsin-Madison. I’m a cognitive scientist. I study the evolution of language, the effects of language on cognition, on perception, and over the last few years trying to make sense of large language models like lots of other people.

Melanie: In recent years, the development of large language models has bolstered Gary’s dislike of generative grammar. The old thinking was that in order to use language well, you needed to be biologically wired to know these language rules from the start. But LLMs aren’t programmed with any grammatical rules baked into them. And yet, they spit out incredibly coherent writing.

Gary: Even before these large language models, there were plenty of arguments against that view. I think these are the last nails in the coffin. So I think producing, correct, grammatically sophisticated, even I’d argue semantically coherent language. These models can do all that even without, by modern standards, huge amounts of training. It shows that in principle, one does not need any of this type of innate grammatical knowledge.

Abha: So, what’s going on here? Steve Piantadosi is a psychology and neuroscience professor at UC Berkeley, studying how children learn language and math. He says that language does have rules, but those rules are emergent. They’re not there from the start.

Steve Piantadosi: I think that the key difference is that Chomsky and maybe mainstream linguistics tends to state its theories already at the high level of abstraction. They say, here are the rules that I think this system is following. Whereas in a large language model, when you go to build one, you don’t tell it the high level rules about how language works. You tell it the low level rules about how to learn and how to construct its own internal configurations. And you tell it that it should do that in a way that predicts language well. And when you do that, it kind of configures itself in some way.

Melanie: What’s an example of a high level rule?

Steve: For example, a high level rule in English is if you have a sentence, you can put it inside of another sentence with the word that. So I could say, “I drank coffee today.” That’s a whole sentence. And I could say, “John believed that I drank coffee today.” And because that rule is about how to make a sentence out of another sentence, you can actually do it again.

So I can say, “Mary doubted that John believed that I drank coffee today.” And if you were going to sit down and write a grammar of English, if you’re going to try to describe what the grammatical and ungrammatical sentences of English were, you’d have to have some kind of rule that said that, right? Because any English speaker you ask is going to tell you that, “John said that I drank coffee today.” is an acceptable English sentence.

And also “I drank coffee today.” is an acceptable English sentence. Large language models, when they’re built, don’t know anything like that rule. They’re just a mess of parameters and weights and connections, and they have to be exposed to enough English in order to figure out that rule.

And I’m pretty sure ChatGPT knows that rule, right? Because it can form sentences that have an embedded sentence in that way. So when you make ChatGPT, you don’t tell it that rule from the start, it has to construct it and discover it.

And I think what’s kind of interesting is that building a system like ChatGPT that can discover that rule doesn’t negate the existence of that rule in English speakers’ minds. So internally in ChatGPT somewhere, there has to be some kind of realization of that rule or something like it.

So the hope for these other theories, I think, or at least these other kind of basic observations about language, is that they will be realized in some way inside the internal configurations that these models arrive at.

I think it’s not quite that simple because the large language models are much better than our theories. So we don’t have any kind of rule-based account of anything that comes close to what they can do. But they have to have something like that because they exhibit that behavior.

Abha: And we should say, these rules we’re talking about are not the same as the quote-unquote “rules” you learn in school, like when your teacher tells you how to use prepositions or, “don’t split an infinitive.”

Steve: Yeah, sorry, let me just clarify. In linguistics or in cognitive science, when people talk about rules like this, they don’t mean the rules like don’t split infinitives. Basically anything you heard from an English teacher, you should just completely ignore in cognitive science and linguistics. It’s just made up. I mean, it’s literally made up, often just to reinforce class distinctions and things.

The kinds of rules that linguistics and cognitive science are interested in are ones which are descriptive, that talk about how people actually do speak. People do split infinitives, right, and they do end sentences with prepositions and, you know, pretty much any rule you’ve ever heard from an English teacher, they had to tell you because it’s going against how you naturally speak.

So that’s just some weird class thing, I think, that’s going on. And what we’re interested in are the kind of descriptive rules of how the system is kind of actually functioning in nature. And in that case, most people are just not even aware of the rules.

Melanie: Apologies to all the English teachers out there.

Abha: But to recap, language does have innate rules, like the “that” rule that Steve described, but we’re not born with these rules already hardwired into our brains. And the rules that linguists have documented so far aren’t as complete and precise as the actual rules that exist — the statistical patterns that ChatGPT has probably figured out and encoded at some point during its training period.

Melanie: Yet, none of this explains why we humans are using complex language, but other animals aren’t. I asked Gary what he thought about this.

Melanie: So there’s a lot of debate about the role language plays in intelligence. Is language a cause of or a result of humans’ superiority over other animals in certain kinds of cognitive capacities?

Gary: I think language is one of the major reasons why human intelligence is what it is. So more the cause than the result. There is something, obviously, in our lineage that makes us predisposed to language. I happen to think that what that is has much more to do with the kind of drive to share information, to socialize, than anything language specific or grammar specific.

And you see that in infants, infants want to engage. They want to share information, not just use language in an instrumental way. So it gives us access to information that we otherwise wouldn’t have access to.

And then it’s a hugely powerful tool for collaboration. You can make plans, you can ask one another to help. You can divide tasks in much more effective ways. And so without language, even if you take a very social, collaborative species like humans, you take away language and you take away the major tool for creating culture and for transmitting culture.

Melanie: Just to follow up, chimps and bonobos are very social species and have a lot of communication within their groups. Why didn’t they develop this drive you’re talking about for language? Why did we develop it and not them?

Gary: It’s only useful to a particular kind of species, a particular type of niche. So it has a really big startup cost. So kids have to learn this stuff. Their language is kind of useless to them before they put in the years that it takes to learn it. It’s also, and many have written on this, language is also very easy to lie with.

So it’s an unreliable system. Words are cheap. And so, reliance on language sort of only makes sense in a society that already has a kind of base level of trust. And so, I think the key to understanding the emergence of language is understanding the emergence of that type of prosociality that language then feeds back on and helps accelerate, but it needs to be there.

And so if you look at other primate societies, there is cooperation within kin groups. There is not broad scale cooperation. There is often aggression. There’s not sharing. So language just doesn’t make sense.

Abha: As Gary mentioned, there’s a huge startup cost for learning language. Humans have much longer childhoods than other species.

Ev: Ever since we’re born, we start paying attention to all sorts of regularities in the inputs we get, including in linguistic inputs.

Abha: This is Ev Fedorenko. Ev’s a neuroscientist at MIT, and she’s been studying language for the past two decades. As she mentioned, we start learning language from day one. That learning includes internalizing the structure and patterns that linguists used to assume were innate.

Ev: We start by paying attention to how sounds may go together to form kind of regular patterns like syllables and various transitions that are maybe more or less common. Pay attention to that. Then we figure out that some parts of that input correspond to meanings.

The example I often say is like every time mama says cat, there’s this fuzzy thing around, maybe it’s not random, right? And you kind of start linking parts of the linguistic input to parts of the world. And then of course you learn what are the rules for how you put words together to express more complex ideas.

So all of that knowledge seems to be stored in what I call the language system. And those representations are accessed both when I understand what somebody else is saying to me, because I have to map, I have to use this form of meaning mapping system to decode your messages, and when I have some abstract thing in my mind, an idea, and I’m trying to express it for someone else using this shared code, which in this case is English, right?

Abha: And often, we learn this shared code by interacting with our surroundings. Like, as Ev described, learning about a cat if there’s a cat in the room with you.

Melanie: But, you could also learn about cats without being able to interact with one. Someone could tell you about a cat, and you could start to create an idea for this thing called, “cat,” which you’ve never seen, but you know that it has pointy ears, it’s furry, and it makes a low rumbling sound when it’s content. That’s the power of language. Here’s Gary again.

Gary: So much of what we learn, and it’s very difficult to quantify, to put a number on, like what percent of what we know we’ve learned from talking to others, from reading. Most of formal education takes that role, right? It would not be possible in the absence of certainly not without language, but even without written language. If you have enough language training, you can just kind of map onto the visual world.

And we’ve done my lab, some work connecting it to, previously collected data from people who are born congenitally blind and the various things that they surprisingly learn about the visual world that one would think is only learnable through direct experience showing that well, normally sighted people might be learning it through direct experience, but a lot of that information is embedded in the structure of language.

Abha: And when we learn through language, we’re not just learning about physical objects. Language gives us the ability to name abstract concepts and categories, too. For instance, if you think about what the word “shoe” means, it refers to a type of object, but not one specific thing.

Steve: We wrote a paper about this and gave the example of shoes that were made out of eggplant skins. You could imagine doing that, drying out an eggplant skin and sewing up the sides and adding laces and fitting it around your feet and whatever. And you’ve probably never encountered shoes made out of eggplants before, but we all just agreed that that could happen. That you could find them.

And so that tells you that it’s not the physical object exactly that’s defining what the concept means. Because I just gave you a new physical object. It has to be something more abstract, more about the relationships and the use of it that defines what the thing is. I don’t think it’s so crazy to think that, you know, language is special in some way.

There’s certainly lots of things that we acquire through language. Right, this is, I think, especially salient if you talk to a kid and they’re asking why questions and you explain things that are abstract and that you can’t show them just in language and they can come to pretty good understandings of systems that they’ve never encountered before, you know, if they ask how clouds form or, you know, what the moon is doing or whatever, right? All of those are things that we learn about through a linguistic system.

So the right picture might be one where there’s a small kind of continuous or quantitative change in memory capacity that enables language, but then once you have language that opens up this kind of huge learning potential for cultural transmission of ideas and learning complicated kinds of things from your parents and from other people in your community.

Melanie: So Abha, we asked at the beginning of the episode why humans have language. And what we’ve heard from Gary, Steve, and Ev so far is that language probably emerged as a result of humans’ drive to socialize and to collaborate. And there’s a feedback effect between these social drives and language itself. So language is an incredible tool for collaboration, and collaboration drives our intelligence. Gary, for example, thinks that language is a major cause of human intelligence being what it is.

Abha: Right, right. It was interesting how Steve also pointed out that language enables a whole new way of learning and of cultural evolution. Language allows us to quickly learn new things, you know, from the people around us, say our parents, our friends, and other people we interact with.

It also lets us learn without having to experience something ourselves. Say, for example, when we are walking with our parents when we were little and they said, you know, “Don’t jump out in front of the car.” We tend to trust them and not have to experience it ourselves. And this is enabled because of language, right?

Melanie: Yeah, we should definitely appreciate our parents more. But on the downside, Gary also pointed out that language makes it easy to lie and to trick people. So relying on language only makes sense when society has a basic level of trust.

Abha: That is so true. I mean, if we don’t trust each other, it’s hard to function as a society, but trust comes at such a high cost too. And the other downside of language, you know, requires a long learning period because we can’t learn a language overnight. We’re not born necessarily speaking a language. Our childhood is so prolonged and that’s another high cost.

Melanie: Yeah. So the advantages of language must have outweighed those downsides in evolution.

Abha: Yes. Another interesting point that just came up is that today’s large language models have shown that certain linguistic theories are just wrong. Steve claims that LLMs have disproven Noam Chomsky’s notion of an innate universal grammar in the brain, right?

Melanie: Yeah, people have really changed their thinking about how language works in the brain. In part two, we’ll look at what brain imaging can tell us about language and what happens when people lose their language abilities.

Abha: Part Two: Are language and thought separate in the brain?

Abha: One of Ev’s signature methods is using fMRI brain scans to examine which systems in the brain light up when we use language. She and her collaborators have developed experiments to investigate the relationship between language and other forms of cognition.

Ev: It’s very simple. I mean, the logic of the experiments where we’ve looked at the relationship between language and thought is all pretty much the same, just using different kinds of thought. But the idea is you take individuals, put them in an fMRI scanner, and you have them do a task that you know reliably engages your language regions.

Abha: This could be, for example, reading or listening to coherent sentences while your brain is being scanned. Then, that map would be compared to the regions that light up when you hear sequences of random words and sounds that sound speech-like, but are completely nonsensical.

Ev: And if you guys visit MIT, I can scan you and print you a map of your language system. It takes about five minutes to find. Very reliable. And again, if I scan you today or 10 years later, I’ve done this on some people 10 years apart, it’s in exactly the same place. It’s very reliable within people. It’s very robust, so we find those language regions. And then we basically ask, okay, let’s have you engage in some form of thinking.

Maybe have you solve some math problems, or do something like some kind of pattern recognition test, and we basically ask, do circuits that light up when you process language overlap with the circuits that are active when, for example, when you engage in mathematical reasoning, like doing addition problems or whatnot. And we basically very consistently find across many domains of thought pretty much everything we’ve looked at so far, we find that the language regions are not really active, hardly at all, and some other system non-overlapping with the language regions is working really hard. So it’s not the case that we engage the language mechanisms to solve these other problems.

Melanie: I know there’s been some controversy about how easy it is to interpret the results of fMRI. What can you tell us, is that a hard thing to do? Is it an easy thing to do?

Ev: I don’t think there’s any particular challenge in interpreting fMRI data than any other data. I mean, you want to do robust and rigorous research. Before you make a strong claim based on whatever findings, you want to make sure that your findings tell you what you think they are, but that’s kind of a challenge for any research.

I don’t think it’s related to particular measurements you’re taking. I mean, there are certainly limitations of fMRI, and one of them is that we can’t look at fast time scales of information processing. We just don’t have access to what’s happening on a millisecond or tens of milliseconds or even hundreds of milliseconds time scale, which for some questions, it doesn’t matter, but for some questions, it really does. And so that makes fMRI not well suited for those questions where it matters. But in general, good robust findings from fMRI are very robustly replicable.

Steve: I’ve been actually very convinced by Ev’s arguments in particular.

Abha: That’s Steve Piantadosi again.

Steve: You can find people who are experts in some domain, like mathematics experts or chess grandmasters or whatever, who have lost linguistic abilities. And that is a very nice type of natural experiment that shows you that the linguistic abilities aren’t the kind of substrate for reasoning in those domains, because you can lose the linguistic abilities and still have the reasoning abilities.

There might still be a learning story. It would probably be very hard to learn chess or learn mathematics without having language. But I think that once you learn it, or learn it well enough to become an expert, it seems like there’s some other kind of system or some other kind of processing that happens non-linguistically. What it shows you is that you can be really good at language without having the ability to do the kind of sequential, multi-step reasoning that seems to characterize human thinking.

And that I think is surprising. It didn’t have to be like that. It could have been that language was the substrate that we used for everything or that language was such a difficult problem that if you solved language, you would necessarily have to have all of the underlying kind of reasoning machinery that people have. But it seems that that’s not right, that you can do quite a bit in language without having much reasoning.

Abha: And on the flipside, you can do a lot of reasoning without language. As Ev mentioned before, she and her collaborators have identified language systems in the brain that show up very reliably in fMRI scans. These language systems are mostly in the left hemisphere. So, what happens if someone loses these systems completely?

Ev: This fMRI approach is very nicely complemented by investigations of patients with severe language problems, right? So another approach, this one we’ve had around for much longer than fMRI, is to take individuals who have sustained severe damage to the language system, and sometimes left hemisphere strokes are large and they pretty much wipe out that whole system.

So these are so-called individuals with global aphasia. They can’t, if you give them a sentence, they cannot infer any meaning from this. And we know it’s not a low level deficit, because you can establish that it’s across modalities, like written and spoken, and so on. So it seems like the linguistic representations that they’ve set up for meaning mapping, that they’ve spent their lifetime learning, is lost, is really destroyed. And then you can ask about the cognitive capacities in these individuals. Can they still think complex thoughts?

And how do you test this? Well, you give them behavioral tasks. And for some of them, of course, you have to be a very clever experimentalist because you can no longer explain things verbally. But people come up with ways to get instructions across. They understand kind of thumbs up, thumbs down judgments.

So you give them well-formed or ill-formed mathematical expressions or musical patterns or something like that. And what you find is that, there are some individuals who are severely linguistically impaired — the language system is gone for as best as we can test it with whatever tools we have, and yet, they’re okay cognitively. They just lost that code to take the sophistication of their inner minds and translate it into this shared representational format.

And a lot of these individuals are severely depressed because they’re taken to be mentally challenged, right? Because that’s how we often judge people, is by the way they talk. That’s why foreigners often suffer in this way too. Judgments are made about their intellectual capacities and otherwise and so on.

Anyway, a lot of these individuals seem to have the ability to think quite preserved, which suggests that at least in the adult brain, you can take that language system out once you’ve acquired that set of knowledge bits, right? You can take it out and it doesn’t seem to affect any of the thinking capacities that we’ve tested so far.

Melanie: So here’s an extremely naive question. So if language and thought are dissociated, at least in adults, why does it feel like when I’m thinking that I’m actually thinking in words and in language?

Ev: That’s a great question that comes up quite often, not naive at all. It’s a question about the inner voice. A lot of people have this percept that there is a voice in their heads talking. It’s a good question to which I don’t think we as a field have very clear answers yet about what it does, what mechanisms it relies on.

What we do know is that it’s not a universal phenomenon, which tells you that it cannot be a critical ingredient of complex thought because certainly a lot of people who say that they don’t have an inner voice, some of them are MIT professors and they’re like, “What are you talking about? You have a voice in your head? That’s not good. Have you seen a doctor?”

And it’s a very active area of research right now. A lot of people got interested in this. You may have heard about 10 years ago, there was a similar splash about aphantasia, this inability of some people to visually image things, so similar like how some people don’t know what you mean when you say you have an inner voice, some people cannot form mental images.

Like, you say “Imagine the house you lived in when you were a child,” and they’re like “Got nothing there.” You know, it’s blank, I just can’t form that mental image. I can describe it, I know facts about it, but I can’t form that mental image. And these kinds of things like inner voice mental imagery, those are very hard things to study with the methods that we currently have available.

Abha: Yeah, I think I was talking to someone who actually told me they don’t have an inner voice and they actually are left with a feeling, but they can’t necessarily describe the feeling. And so they don’t know how to put it into language when they have a thought.

Ev: That’s a very good point because my husband ,who doesn’t have an inner voice, often uses this as an argument. “If we were thinking in language, why is it sometimes so hard to explain what you think? You know you have this idea very clearly for yourself and you just have trouble formulating it.” That’s a good point.

Melanie: But, Gary sees the relationship between language and thought a bit differently. He doesn’t think they can be separated so neatly.

Gary. I think Ev and her lab are doing fabulous work and we agree on many things. This is one thing we don’t agree on.

Melanie: In Ev’s example, patients who have had strokes lost their language systems in the brain, but they could still do complex cognitive tasks. They didn’t lose their ability to think.

Gary: So it’s possible to find individuals with aphasia that have typical behavior. And so that shows that at least in some cases, one can find cases where language is not necessary. So there are two complications with this. One is that people tend to have aphasia due to a stroke that tends to happen in older age. And so they’ve had a lifetime of experience with language. And so, just because a task doesn’t light up the language network doesn’t mean the task does not rely on language.

It doesn’t mean that language has not played a role in basically setting up the brain that you have as an adult, such that you don’t need language in the moment, but you needed exposure to language to enable you to do the task in the first place.

Abha: We asked Ev what she made of this argument, that even if language isn’t necessary in the moment, it still plays a big role in developing your adult brain. But she doesn’t think it’s as important as Gary does. She refers to another population of people, which are individuals who are born deaf and aren’t taught sign language.

Ev: Unless there are other signers in the community, or unless they’re moved into an environment where they can interact with the signers, they often grow up not having input to language. Especially if they’re in an isolated community. Growing up they figure out some system called home sign, which is a very, very basic system.

And so you can ask whether these individuals are able to develop certain thinking capacities. And it is absolutely the case that having… not having access to language has devastating effects, right? You can’t build relationships in the same way. You can’t learn as easily. Of course, through language I can just tell you all sorts of things about the world. Most of the things you probably know, you learned through language, but it doesn’t seem to be the case that you fundamentally cannot learn certain kinds of complex things.

So there are examples of individuals like that who have been able to learn math. Okay, it takes them longer. If you don’t have somebody to tell you how to do differential equations you can figure it out from whatever ways you can. So it’s certainly the case that language is an incredibly useful tool. And presumably, the accumulation of knowledge over generations that has happened has allowed us to build the world we live in today. But it doesn’t undermine the separability of those language and thinking systems.

Abha: In a lot of areas, it seems that Gary, Steve, and Ev are on the same page. Language has helped humans achieve incredible things, and it’s a very useful tool.

Melanie: But where they seem to differ is on just how much language and thought influence each other, and in which direction the causal arrow is pointing: Does language make us intelligent, or is language is the result of our intelligence? Ev’s work shows that many types of tasks can be done without lighting up the language systems in the brain. When combined with examples from stroke patients and other research, she has reason to believe that language and cognition are largely separate things.

Abha: Gary, on the other hand, isn’t ready to dismiss the role of language so easily — it could still be crucial for developing adult cognition, and, generally speaking, some people might rely on it more than others.

Melanie: And Steve offers one more example of how language can make our learning more efficient, regardless of whether or not it’s strictly necessary.

Steve: So, if you’re an expert in any domain, you know a ton of words and vocabulary about that specific domain that non-experts don’t, right? That’s true in scientific domains if you’re a physicist versus a biologist, but it’s also true in non-scientific domains. People who sew know tons of sewing words and people who are coal miners know tons of coal mining words and I think that those words are, as we were discussing, real technologies. They’re real cultural innovations that are very useful.

That’s why people use those words, because they need to convey a specific meaning in a specific situation. And by having those words, we’re probably able to communicate more efficiently and more effectively about those specific domains. So I think that this kind of ability to create and then learn domain specific vocabularies is probably very important and probably allows us to think all kinds of thoughts that otherwise would be really, really complicated.

Imagine being in a situation where you don’t have the domain specific vocabulary and you have to describe everything, and it becomes very clunky and hard to talk about. That’s why in sciences, especially, we come up with terms, so it really enables us to do things that would be really hard otherwise.

Melanie: Steve isn’t saying that it’s impossible to learn specific skills without language, but from his perspective, it’s more difficult and less likely.

Abha: But Ev has a slightly different view.

Ev: There are cultures, for example, human cultures that don’t don’t have exact math. Like the Peter Ha or the Chimani, like some tribes in the Brazilian Amazon, they don’t have numbers because they don’t need numbers.There are people who will make a claim that they don’t have numbers because they don’t have words for numbers.

And I don’t understand how the logic goes in this direction. I think they don’t have words for numbers because they don’t have the need for numbers in their culture. So they don’t come up with a way to refer to those concepts. Then of course, there’s different stories for why numbers came about. One common story has to do with farming, right?

When you have to keep track of entities that are similar, like 200 cows, and you want to make sure you left with them and came back with whatever 15 cows. And then you figure out some counting system, typically using digits, right? A lot of cultures start with digits. Anyway, and then you come up with words. And once you have labels for words, of course you can then do more things. You can solve tasks that require you to hold onto those.

But it’s not like not having words prevents you from figuring out a system of thought and representation to keep track of that information. So I think the directionality is in a different way than some people have put it forward.

Abha: So Melanie, our question for the spot of the episode was about whether language and thought are separate in the brain. And Ev seems to have very compelling evidence that they’re separate.

Melanie: Yeah, her results with fMRI were really surprising to me.

Abha: Right? Me too. Both Steve and Ev stress that language makes communication between people very efficient, but point out that when people lose their language abilities, say because of a stroke or some other injury, it’s often the case that their thinking, that is their non-linguistic cognitive abilities, are largely unaffected.

Melanie: But Abha, Gary pushed back on this. He noted that people who have had strokes tend to be older with cognitive abilities that they’ve had for a long time. So Gary pointed out that maybe you need language to enable cognition in the first place. And his own research has shown that this is true to some extent.

Abha: I guess there are really two questions here. First, do language and cognition really need to be entangled in the brain during infancy and childhood when both linguistic and cognitive skills are still being formed? And the second is, are language and cognition separate in adults who have established language and cognitive abilities already?

Melanie: Exactly. Ev’s work addresses the latter question, but not the former. And Ev admits that the neuroscience and psychology of language have been contentious fields for a long time. Here’s Ev.

Ev: Language has always been a very controversial field where people have very strong biases and opinions. The best I can do is try to be open minded and just keep training people to do rigorous work and to think hard about even the fundamental assumptions in the field. Those should always be questioned. Everything should always be questioned.

Abha: So here’s another question: what does all of this mean for large language models? In theory, the skills LLMs have exhibited are the same skills that map onto the language systems in the brain. They have the formal competence of patterns and language rules. But, if their foundations are statistical patterns in language, how much thinking can they do now, and in the future? And how much have they learned already?

Murray Shanahan: I mean, people sometimes use the word, an alien intelligence. I prefer the word exotic. It’s a kind of exotic mind-like entity.

Melanie: That’s next time, on Complexity. Complexity is the official podcast of the Santa Fe Institute. This episode was produced by Katherine Moncure, and our theme song is by Mitch Mignano. Additional music from Blue Dot Sessions. I’m Melanie, thanks for listening.

Learn more about the coaching process or
contact me to discuss your storytelling goals!

Subscribe to the newsletter for the latest updates!

Copyright Storytelling with Impact® – All rights reserved

Nature of Intelligence – Episode One – What is Intelligence

I tend to think of storytelling as sitting at the intersection of four elements:

  • Consciousness — awareness of self, the environment, and our thoughts
  • Intelligence — ability to learn, understand, reason, and solve problems
  • Imagination — create mental images, ideas, or concepts beyond reality
  • Creativity — generate original ideas, solutions, and artistic expressions

They’re different terms, of course, yet you can see how they interact with each other. It’s also apparent that they’re involved in the process of creating stories. They’re so fundamental, in fact, that they go a long way towards describing what makes us human. But the funny thing is, science doesn’t know how to accurately define any of these concepts.

While thousands of hours have been spent seeking answers, and scientists can talk for days on end about their findings, it is still a mystery. Take Shakespeare, for example. How did he utilize these aspects of humanity to create something as magical as Hamlet? And if we can’t properly describe one of these elements, how do we explain how they work together? And extending beyond us mortals, will AI ever be able to replicate this magic?

So when I ran across the third season of Santa Fe Institute’s Complexity podcast, which is devoted to the exploration of Intelligence, I had to listen in, and if you’re interesting in how we create stories in our head, I recommend you do the same, as it looks at the concept of intelligence through a human lens, as well as from the lens of artificial intelligence.

17th Century Playwrite in England
There’s so much information in this first episode, but I wanted to share four quotes that intrigued me. First off is this notion of “common sense”. It seems simple, but again, it’s illusive to capture in words. How would you describe it?

Common sense gives us basic assumptions that help us move through the world and know what to do in new situations. But it gets more complicated when you try to define exactly what common sense is and how it’s acquired. ~ Melanie Mitchell

This notion of an equivalent phenomenon describes much of the human / AI debate, as there is a sense that a machine will never be human, but maybe it can be close enough.

I think there’s a difference between saying, can we reach human levels of intelligence when it comes to common sense, the way humans do it, versus can we end up with the equivalent phenomenon, without having to do it the way humans do it. ~ John Krakauer

This goes back to the reality that we don’t know what makes humans human, so how are we to compare a computer algorithm to what it means to be us?

I think it’s just again, a category mistake to say we’ll have something like artificial general intelligence, because we don’t have natural general intelligence. ~ Alison Gopnik

But we’re more than thinking animals. We have emotions. Fall in love, feel pain, express joy and sorrow. Or in this case, grief. Computers are learning how to simulate emotions such as grief, but is that even possible?

I don’t know what it would mean for a computer to feel grief. I just don’t know. I think we should respect the mystery. ~ John Krakauer

So here goes, take a listen to Episode 1 and see what you think. The transcript is below if you feel so inclined (as I did) to follow along. It’s some heady stuff.

Transcript

Alison Gopnik: It’s like asking, is the University of California Berkeley library smarter than I am? Well, it definitely has more information in it than I do, but it just feels like that’s not really the right question.

Abha Eli Phoboo: From the Santa Fe Institute, this is Complexity.

Melanie Mitchell: I’m Melanie Mitchell.

Abha: And I’m Abha Eli Phoboo.

Abha: Today’s episode kicks off a new season for the Complexity podcast, and with a new season comes a new theme. This fall, we’re exploring the nature and complexity of intelligence in six episodes — what it means, who has it, who doesn’t, and if machines that can beat us at our own games are as powerful as we think they are. The voices you’ll hear were recorded remotely across different locations, including countries, cities and work spaces. But first, I’d like you to meet our new co-host.

Melanie: My name is Melanie Mitchell. I’m a professor here at the Santa Fe Institute. I work on artificial intelligence and cognitive science. I’ve been interested in the nature of intelligence for decades. I want to understand how humans think and how we can get machines to be more intelligent, and what it all means.

Abha: Melanie, it’s such a pleasure to have you here. I truly can’t think of a better person to guide us through what, exactly, it means to call something intelligent. Melanie’s book, Artificial Intelligence: A Guide for Thinking Humans, is one of the top books on AI recommended by The New York Times. It’s a rational voice among all the AI hype in the media.

Melanie: And depending on whom you ask, artificial intelligence is either going to solve all humanity’s problems, or it’s going to kill us. When we interact with systems like Google Translate, or hear the buzz around self-driving cars, or wonder if ChatGPT actually understands human language, it can feel like AIis going to transform everything about the way we live. But before we get carried away making predictions about AI, it’s useful to take a step back. What does it mean to call anything intelligent, whether it’s a computer or an animal or a human child?

Abha: In this season, we’re going to hear from cognitive scientists, child development specialists, animal researchers, and AI experts to get a sense of what we humans are capable of and how AI models actually compare. And in the sixth episode, I’ll sit down with Melanie to talk about her research and her views on AI.

Melanie: To kick us off, we’re going to start with the broadest, most basic question: what really is intelligence, anyway? As many researchers know, the answer is more complicated than you might think.

Melanie: Part One: What is intelligence?

Alison: I’m Alison Gopnik. I’m a professor of psychology and affiliate professor of philosophy and a member of the Berkeley AI Research group. And I study how children manage to learn as much as they do, particularly in a sort of computational context. What kinds of computations are they performing in those little brains that let them be the best learners we know of in the universe?

Abha: Alison is also an external professor with the Santa Fe Institute, and she’s done extensive research on children and learning. When babies are born, they’re practically little blobs that can’t hold up their own heads. But as we all know, most babies become full-blown adults who can move, speak, and solve complex problems. From the time we enter this world, we’re trying to figure out what the heck is going on all around us, and that learning sets the foundation for human intelligence.

Alison: Yeah, so one of the things that is really, really important about the world is that some things make other things happen. So everything from thinking about the way the moon affects the tides to just the fact that I’m talking to you and that’s going to make you change your minds about things. Or the fact that I can pick up this cup and spill the water and everything will get wet. Those really basic cause and effect relationships are incredibly important.

And they’re important partly because they let us do things. So if I know that something is gonna cause a particular effect, what that means is if I wanna bring about that effect, I can actually go out in the world and do it. And it underpins everything from just our everyday ability to get around in the world, even for an infant, to the most incredible accomplishments of science. But at the same time, those causal relationships are kind of mysterious and always have been. How is it? After all, all we see is that one thing happens and another thing follows it. How do we figure out that causal structure?

Melanie: So how do we?

Alison: Yeah, good question. So that’s been a problem philosophers have thought about for centuries. And there’s basically two pieces. And anyone who’s done science will recognize these two pieces. We analyze statistics. So we look at what the dependencies are between one thing and another. And we do experiments. We go out, perhaps the most important way that we understand about causality is you do something and then you see what happens and then you do something again and you say, wait a minute, that happened again.

And part of what I’ve been doing recently, which has been really fun, is just look at babies, even like one year olds. And if you just sit and look at a one year old, mostly what they’re doing is doing experiments. I have a lovely video of my one-year-old grandson with a xylophone and a mallet.

Abha: Of course, we had to ask Alison to show us the video. Her grandson is sitting on the floor with the xylophone, while his grandfather plays an intricate song on the piano. Together, they make a strange duet.

And it’s not just that he makes the noise. He tries turning the mallet upside down. He tries with his hand a bit. That doesn’t make a noise. He tries with a stick end. That doesn’t make a noise. Then he tries it on one bar and it makes one noise. Another bar, it makes another noise. So when the babies are doing the experiments, we call it getting into everything. But I increasingly think that’s their greatest motivation.

Abha: So babies and children are doing these cause and effect experiments constantly, and that’s a major way that they learn. At the same time, they’re also figuring out how to move and use their bodies, developing a distinct intelligence in their motor systems so they can balance, walk, use their hands, turn their heads, and eventually, move in ways that don’t even require much thinking at all.

Melanie: One of the leading researchers on intelligence and physical movement is John Krakauer, a professor of neurology, neuroscience, physical medicine, and rehabilitation at the Johns Hopkins University School of Medicine. John’s also in the process of writing a book.

John Krakauer: I am. I’ve been writing it for much longer than I expected, but now I finally know the story I want to tell. I’ve been practicing it.

Melanie: Well, let me ask, I just want to mention that the subtitle is Thinking versus Intelligence in Animals, Machines and Humans. So I wanted to get your take on what is thinking and what is intelligence.

John: Oh my gosh, thanks Melanie for such an easy softball question.

Melanie: Well, you’re writing a book about it.

John: Well, yes, so… I think I was very inspired by two things. One was how much intelligent adaptive behavior your motor system has even when you’re not thinking about it. The example I always give is when you press an elevator button before you lift your arm to press the button, you contract your gastrocnemius in anticipation that your arm is sufficiently heavy, that if you didn’t do that, you’d fall over because your center of gravity has shifted. So there are countless examples of intelligent behaviors. In other words, they’re goal-directed and accomplish the goal below the level of overt deliberation or awareness.

And then there’s a whole field, what are called long latency stretch reflexes, these below the time of voluntary movement, but sufficiently flexible to be able to deal with quite a lot of variation in the environment and still get the goal accomplished, but it’s still involuntary.

Abha: There’s a lot that we can do without actually understanding what’s happening. Think about the muscles we use to swallow food, or balance on a bike, for example. Learning how to ride a bike takes a lot of effort, but once you’ve figured it out, it’s almost impossible to explain it to someone else.

John: And so it’s what, Daniel Dennett, you know, who recently passed away, but was very influential for me with what he called, competence with comprehension versus competence without comprehension. And, you know, I think he also was impressed by how much competence there is in the absence of comprehension. And yet along came this extra piece, the comprehension, which added to competence and greatly increased the repertoire of our competences.

Abha: Our bodies are competent in some ways, but when we use our minds to understand what’s going on, we can do even more. To go back to Alison’s example of her grandson playing with a xylophone, comprehension allows him, or anyone, playing with a xylophone mallet to learn that each side of it makes a different sound.

If you or I saw a xylophone for the first time, we would need to learn what a xylophone is, what a mallet is, how to hold it, and which end might make a noise if we knocked it against a musical bar. We’re aware of it. Over time we internalize these observations so that every time we see a xylophone mallet, we don’t need to think through what it is and what the mallet is supposed to do.

Melanie: And that brings us to another, crucial part of human intelligence: common sense. Common sense is knowing that you hold a mallet by the stick end and use the round part to make music. And if you see another instrument, like a marimba, you know that the mallet is going to work the same way. Common sense gives us basic assumptions that help us move through the world and know what to do in new situations. But it gets more complicated when you try to define exactly what common sense is and how it’s acquired.

John: Well, I mean, to me, common sense is the amalgam of stuff that you’re born with. So you, you know, any animal will know that if it steps over the edge, it’s going to fall. Right. What you’ve learned through experience that allows you to do quick inference.

So in other words, you know, an animal, it starts raining, it knows it has to find shelter. Right? So in other words, presumably it learns that you don’t want to be wet, and so it makes the inference it’s going to get wet, and then it finds a shelter. It’s a common sense thing to do in a way.

And then there’s the thought version of common sense. Right? It’s common sense that if you’re approaching a narrow alleyway, your car’s not gonna fit in it. Or if you go to a slightly less narrow one, your door won’t open when you open the door. Countless interactions between your physical experience, your innate repertoire, and a little bit of thinking. And it’s that fascinating mixture of fact and inference and deliberation. And then we seem to be able to do it over a vast number of situations, right?

In other words, we just seem to have a lot of facts, a lot of innate understanding of the physical world, and then we seem to be able to think with those facts. And those innate awarenesses. That, to me, is what common sense is. It’s this almost language-like flexibility of thinking with our facts and thinking with our innate sense of the physical world and combinatorially doing it all the time, thousands of times a day. I know that’s a bit waffly. I’m sure Melanie can do a much better job at me than that, but that’s how I see it.

Melanie: No, I think that’s actually a great exposition of what it means. I totally agree. I think it is fast inference about new situations that combines knowledge and sort of reasoning, fast reasoning, and a lot of very basic knowledge that’s not really written down anywhere that we happen to know because we exist in the physical world and we interact with it.

Melanie: So, observing cause and effect, developing motor reflexes, and strengthening common sense are all happening and overlapping as children get older.

Abha: And we’re going to cover one more type of intelligence that seems to be unique to humans, and that’s the drive to understand the world.

John: It turns out, for reasons that physicists have puzzled over, that the universe is understandable, explainable, and manipulatable. The side effect of understanding the world is understandable, is you begin to understand sunsets and why the sky is blue and how black holes work and why water is a liquid and then a gas. It turns out that these are things worth understanding because you can then manipulate and control the universe. And it’s obviously advantageous because humans have taken over entirely.

I have a fancy microphone that I can have a Zoom call with you with. An understandable world is a manipulable world. As I always say, an arctic fox trotting very well across the arctic tundra is not going, “hmm, what’s ice made out of?” It doesn’t care. Now we, for some point between chimpanzees and us, started to care about how the world worked. And it obviously was useful because we could do all sorts of things. Fire, shelter, blah blah blah.

Abha: And in addition to understanding the world, we can observe ourselves observing, a process known as metacognition. If we go back to the xylophone, metacognition is thinking, “I’m here, learning about this xylophone. I now have a new skill.”

And metacognition is what lets us explain what a xylophone is to other people, even if we don’t have an actual xylophone in front of us. Alison explains more.

Alison: So the things that I’ve been emphasizing are these kinds of external exploration and search capacities, like going out and doing experiments. But we know that people, including little kids, do what you might think of as sort of internal search. So they learn a lot, and now they just intrinsically, internally want to say, “what are some things, new conclusions I could draw, new ideas I could have based on what I already know?”

And that’s really different from just what are the statistical patterns in what I already know. And I think two capacities that are really important for that are metacognition and also one that Melanie’s looked at more than anyone else, which is analogy. So being able to say, okay, here’s all the things that I think, but how confident am I about that? Why do I think that? How could I use that learning to learn something new?

Or saying, here’s the things that I already know. Here’s an analogy that would be really different, right? So I know all about how water works. Let’s see, if I think about light, does it have waves the same way that water has waves? So actually learning by just thinking about what you already know.

John: I find myself constantly changing my position on the one hand, this human capacity to sort of look at yourself computing, a sort of meta-cognition, which is consciousness not just of the outside world and of your body, it’s consciousness of your processing of the outside world and your body. It’s almost as though you used consciousness to look inward at what you were doing. Humans have computations and feelings. They have a special type of feeling and computation which together is deliberative. And that’s what I think thinking is, it’s feeling your computations.

Melanie: What John is saying is that humans have conscious feelings — our sensations such as hunger or pain — and that our brains perform unconscious computations, like the muscle reflexes that happen when we press an elevator button. What he calls deliberative thought is when we have conscious feelings or awareness about our computations.

You might be solving a math problem and realize with dismay that you don’t know how to solve it. Or, you might get excited if you know exactly what trick will work. This is deliberative thought — having feelings about your internal computations. To John, the conscious and unconscious computations are both “intelligent,” but only the conscious computations count as “thinking”.

Abha: So Melanie, having listened to John and Alison, I’d like to go back to our original question with you. What do you think is intelligence?

Melanie: Well, let me recap some of what Alison and John said. Alison really emphasized the ability to learn about cause and effect.

What causes what in the world and how we can predict what’s going to happen. And she pointed out that the way we learn this, adults and especially kids, by doing little experiments, interacting with the world and seeing what happens and learning about cause and effect that way. She also stressed our ability to generalize, to make analogies, how situations might be similar to each other in an abstract way. And this underlies what we would call our common sense, that is our basic understanding of the world.

Abha: Yeah, that example of the xylophone and the mallet, that was very intriguing. As both John and Alison said, humans seem to have a unique drive to gain an understanding of the world via experiments like making mistakes, trying things out. And they both emphasize this important role of metacognition or reasoning about one’s own thinking. What do you think of that? You know, how important do you think metacognition is?

Melanie: It’s absolutely essential to human intelligence. It’s really what underlies, I think, our uniqueness. John, you know, made this distinction between intelligence and thinking. To him, you know, most of our, what he would call our intelligent behavior is unconscious. It doesn’t involve metacognition. He called it competence without comprehension. And he reserved the term thinking for conscious awareness of what he called one’s internal computations.

Abha: Even though John and Alison have given us some great insights about what makes us smart, I think both would admit that no one has come to a full, complete understanding of how human intelligence works, right?

Melanie: Yeah, we’re far from that. But in spite of that, big tech companies like OpenAI and DeepMind are spending huge amounts of money in an effort to make machines that, as they say, will match or exceed human intelligence. So how close are they to succeeding? Well, in part two, we’ll look at how systems like ChatGPT learn and whether or not they’re even intelligent at all.

Abha: Part two: How intelligent are today’s machines?

Abha: If you’ve been following the news around AI, you may have heard the acronym LLM, which stands for large language model. It’s the term that’s used to describe the technology behind systems like ChatGPT from OpenAI or Gemini from Google. LLMs are trained to find statistical correlations in language, using mountains of text and other data from the internet. In short, if you ask ChatGPT a question, it will give you an answer based on what it has calculated to be the most likely response, based on the vast amount of information it’s ingested.

Melanie: Humans learn by living in the world — we move around, we do little experiments, we build relationships, and we feel. LLMs don’t do any of this. But they do learn from language, which comes from humans and human experience, and they’re trained on a lot of it. So does this mean that LLMs could be considered to be intelligent? And how intelligent can they, or any form of AI, become?

Abha: Several tech companies have an explicit goal to achieve something called artificial general intelligence, or AGI. AGI has become a buzzword, and everyone defines it a bit differently. But, in short, AGI is a system that has human level intelligence. Now, this assumes that a computer, like a brain in a jar, can become just as smart, or even smarter, than a human with a feeling body. Melanie asked John what he thought about this.

Melanie: You know, I find it confusing when people like Demis Hassibis, who’s the founder, one of the co-founders of DeepMind, and he an interview that AGI is a system that should be able to do pretty much any cognitive task that humans can do. And he said he expects that there’s a 50% chance we’ll have AGI within a decade. Okay, so I emphasize that word cognitive task because that term is confusing to me. But it seems so obvious to them.

John: Yes, I mean, I think it’s the belief that everything non-physical at the task level can be written out as a kind of program or algorithm. I just don’t know… and maybe it’s true when it comes to, you know, ideas, intuitions, creativity.

Melanie: I also asked John if he thought that maybe that separation, between cognition and everything else, was a fallacy.

John: Well, it seems to me, you know, it always makes me a bit nervous to argue with you of all people about this, but I would say, I think there’s a difference between saying, can we reach human levels of intelligence when it comes to common sense, the way humans do it, versus can we end up with the equivalent phenomenon, without having to do it the way humans do it. The problem for me with that is that we, like this conversation we’re having right now, are capable of open-ended, extrapolatable thought. We go beyond what we’re talking about.

I struggle with it but I’m not going to put myself in this precarious position of denying that a lot of problems in the world can be solved without comprehension. So maybe we’re kind of a dead end — comprehension is a great trick, but maybe it’s not needed. But if comprehension requires feeling, then I don’t quite see how we’re going to get AGI in its entirety. But I don’t want to sound dogmatic. I’m just practicing my… my unease about it. Do you know what I mean? I don’t know.

Abha: Alison is also wary of over-hyping our capacity to get to AGI.

Alison: And one of the great old folk tales is called Stone Soup.

Abha: Or you might have heard it called Nail Soup — there are a few variations. She uses this stone soup story as a metaphor for how much our so-called “AI technology” actually relies on humans and the language they create.

Alison: And the basic story of Stone Soup is that, there’s some visitors who come to a village and they’re hungry and the villagers won’t share their food with them. So the visitors say, that’s fine. We’re just going to make stone soup. And they get a big pot and they put water in it. And they say, we’re going to get three nice stones and put it in. And we’re going to make wonderful stone soup for everybody.

They start boiling it. And they say, this is really good soup. But it would be even better if we had a carrot or an onion that we could put in it. And of course, the villagers go and get a carrot and onion. And then they say, this is much better. But you know, when we made it for the king, we actually put in a chicken and that made it even better. And you can imagine what happens.

All the villagers contribute all their food. And then in the end, they say, this is amazingly good soup and it was just made with three stones. And I think there’s a nice analogy to what’s happened with generative AI. So the computer scientists come in and say, look, we’re going to make intelligence just with next token prediction and gradient descent and transformers.

And then they say, but you know, this intelligence would be much better if we just had some more data from people that we could add to it. And then all the villagers go out and add all of the data of everything that they’ve uploaded to the internet. And then the computer scientists say, no, this is doing a good job at being intelligent.

But it would be even better if we could have reinforcement learning from human feedback and get all you humans to tell it what you think is intelligent or not. And all the humans say, OK, we’ll do that. And then and then it would say, you know, this is really good. We’ve got a lot of intelligence here.

But it would be even better if the humans could do prompt engineering to decide exactly how they were going to ask the questions so that the systems could do intelligent answers. And then at the end of that, the computer scientists would say, see, we got intelligence just with our algorithms. We didn’t have to depend on anything else. I think that’s a pretty good metaphor for what’s happened in AI recently.

Melanie: The way AGI has been pursued is very different from the way humans learn. Large language models, in particular, are created with tons of data shoved into the system with a relatively short training period, especially when compared to the length of human childhood. The stone soup method uses brute force to shortcut our way to something akin to human intelligence.

Alison: I think it’s just a category mistake to say things like are LLM’s smart. It’s like asking, is the University of California Berkeley library smarter than I am? Well, it definitely has more information in it than I do, but it just feels like that’s not really the right question. Yeah, so one of the things about humans in particular is that we’ve always had this great capacity to learn from other humans.

And one of the interesting things about that is that we’ve had different kinds of technologies over history that have allowed us to do that. So obviously language itself, you could think of as a device that lets humans learn more from other people than other creatures can do. My view is that the LLMs are kind of the latest development in our ability to get information from other people.

But again, this is not trivializing or debunking it. Those changes in our cultural technology have been among the biggest and most important social changes in our history. So writing completely changed the way that we thought and the way that we functioned and the way that we acted in the world.

At the moment, as people have pointed out, the fact that I have in my pocket a device that will let me get all the information from everybody else in the world mostly just makes me irritated and miserable most of the time. We would have thought that that would have been like a great accomplishment. But people felt that same way about writing and print when they started too. The hope is that eventually we’ll adjust to that kind of technology.

Melanie: Not everyone shares Alison’s view on this. Some researchers think that large language models should be considered to be intelligent entities, and some even argue that they have a degree of consciousness. But thinking of large language models as a type of cultural technology, instead of sentient bots that might take over the world, helps us understand how completely different they are from people. And another important distinction between large language models and humans is that they don’t have an inherent drive to explore and understand the world.

Alison: They’re just sort of sitting there and letting the data waft over them rather than actually going out and acting and sensing and finding out something new.

Melanie: This is in contrast to the one-year-old saying —

Alison: Huh, the stick works on the xylophone. Will it work on the clock or the vase or whatever else it is that you’re trying to keep the baby away from? That’s a kind of internal basic drive to generalize, to think about, okay, it works in the way that I’ve been trained, but what will happen if I go outside of the environment in which I’ve been trained? We have caregivers who have a really distinctive kind of intelligence that we haven’t studied enough, I think, who are looking at us, letting us explore.

And caregivers are very well designed to, even if it feels frustrating when you’re doing it, we’re very good at kind of getting this balance between how independent should the next agent be? How much should we be constraining them? How much should we be passing on our values? How much should we let them figure out their own values in a new environment?

And I think if we ever do have something like an intelligent AI system, we’re going to have to do that. Our role, our relationship to them should be this caregiving role rather than thinking of them as being slaves on the one hand or masters on the other hand, which tends to be the way that we think about them. And as I say, it’s not just in computer science, in cognitive science, probably for fairly obvious reasons, we know almost nothing about the cognitive science of caregiving. So that’s actually what I’m, I just got a big grant, what I’m going to do for my remaining grandmotherly cognitive science years.

Abha: That sounds very fascinating. I’ve been curious to see what comes out of that work.

Alison: Well, let me give you just a very simple first pass, our first experiment. If you ask three and four year olds, here’s Johnny and he can go on the high slide or he can go on the slide that he already knows about. And what will he do if mom’s there? And your intuitions might be, maybe the kids will say, well, you don’t do the risky thing when mom’s there because she’ll be mad about it, right? And in fact, it’s the opposite. The kids consistently say, no, if mom is there, that will actually let you explore, that will let you take risks, that will let you,

Melanie: She’s there to take you to the hospital.

Alison: Exactly, she’s there to actually protect you and make sure that you’re not doing the worst thing. But of course, for humans, it should be a cue to how important caregiving is for our intelligence. We have a much wider range of people investing in much more caregiving.

So not just mothers, but, my favorite post-menopausal grandmothers, but fathers, older siblings, what are called alloparents, just people around who are helping to take care of the kids. And it’s having that range of caregivers that actually seems to really help. And again, that should be a cue for how important this is in our ability to do all the other things we have, like be intelligent and have culture.

Melanie: If you just look at large language models, you might think we’re nowhere near anything like AGI. But there are other ways of training AI systems. Some researchers are trying to build AI models that do have an intrinsic drive to explore, rather than just consume human information.

Alison: So one of the things that’s happened is that quite understandably the success of these large models has meant that everybody’s focused on the large models. But in parallel, there’s lots of work that’s been going on in AI that is trying to get systems that look more like what we know that children are doing. And I think actually if you look at what’s gone on in robotics, we’re much closer to thinking about systems that look like they’re learning the way that children do.

And one of the really interesting developments in robotics has been the idea of building in intrinsic motivation into the systems. So to have systems that aren’t just trying to do whatever it is that you programmed it to do, like open up the door, but systems that are looking for novelty, that are curious, that are trying to maximize this value of empowerment, that are trying to find out all the range of things they could do that have consequences in the world.

And I think at the moment, the LLMs are the thing that everyone’s paying attention to, but I think that route is much more likely to be a route to really understanding a kind of intelligence that looks more like the intelligence that’s in those beautiful little fuzzy heads.

And I should say we’re trying to do that. So we’re collaborating with computer scientists at Berkeley who are exactly trying to see what would happen if we say, give an intrinsic reward for curiosity. What would happen if you actually had a system that was trying to learn in the way that the children are trying to learn?

Melanie: So are Alison and her team on their way to an AGI breakthrough? Despite all this, Alison is still skeptical.

Alison: I think it’s just again, a category mistake to say we’ll have something like artificial general intelligence, because we don’t have natural general intelligence.

Melanie: In Alison’s view, we don’t have natural general intelligence because human intelligence is not really general. Human intelligence evolved to fit our very particular human needs. So, Alison likewise doesn’t think it makes sense to talk about machines with “general intelligence”, or machines that are more intelligent than humans.

Alison: Instead, what we’ll have is a lot of systems that can do different things, that might be able to do amazing things, wonderful things, things that we can’t do. But that kind of intuitive theory that there’s this thing called intelligence that you could have more of or less of, I just don’t think it fits anything that we know from cognitive science.

It is striking how different the view of the people, not all the people, but some of the people who are also making billions of dollars out of doing AI are from, I mean, I think this is sincere, but it’s still true that their view is so different from the people who are actually studying biological intelligences.

Melanie: John suspects that there’s one thing that computers may never have: feelings.

John: It’s very interesting that I always used pain as the example. In other words, what would it mean for a computer to feel pain? And what would it mean for a computer to understand a joke? So I’m very interested in these two things. We have this physical, emotional response. We laugh, we feel good, right? So when you understand a joke, where should the credit go? Should it go to understanding it? Or should it go to the laughter and the feeling that it evokes?

And to my sort of chagrin or surprise or maybe not surprise, Daniel Dennett wrote a whole essay in one of his early books on why computers will never feel pain. He also wrote a whole book on humor. So in other words, it’s kind of wonderful in a way, that whether he would have ended up where I’ve ended up, but at least he understood the size of the mystery and the problem.

And I agree with him, if I understood his pain essay correctly, and it’s influential on what I’m going to write, I just don’t know what it means for a computer to feel pain, be thirsty, be hungry, be jealous, have a good laugh. To me, it’s a category error. Now, if thinking is the combination of feeling… and computing, then there’s never going to be deliberative thought in a computer.

Abha: While talking to John, he frequently referred to pain receptors as the example of how we humans feel with our bodies. But we wanted to know: what about the more abstract emotions, like joy, or jealousy, or grief? It’s one thing to stub your toe and feel pain radiate up from your foot. It’s another to feel pain during a romantic breakup, or to feel happy when seeing an old friend. We usually think of those as all in our heads, right?

John: You know, I’ll say something kind of personal. A close friend of mine called me today to tell me… that his younger brother had been shot and killed in Baltimore. Okay. I don’t want to be a downer. I’m saying it for a reason. And he was talking to me about the sheer overwhelming physicality of the grief that he was feeling. And, I was thinking, what can I say with words to do anything about that pain? And the answer is nothing. Other than just to try.

But seeing that kind of grief and all that it entails, even more than seeing the patients that I’ve been looking after for 25 years, is what leads to a little bit of testiness on my part when one tends to downplay this incredible mixture of meaning and loss and memory and pain. And to know that this is a human being who knows, forecasting into the future, that he’ll never see this person again. It’s not just now. Part of that pain is into the infinite future. Now, all I’m saying is we don’t know what that glorious and sad amalgam is, but I’m not going to just dismiss it away and explain it away as some sort of peripheral computation that we will solve within a couple of weeks, months or years.

Do you see? I find it just slightly enraging, actually. And I just feel that, as a doctor and as a friend, we need to know that we don’t know how to think about these things yet. Right? I just don’t know. And I am not convinced of anything yet. So I think that there is a link between physical pain and emotional pain, but I can tell you from the losses I felt, it’s physical as much as it is cognitive. So grief, I don’t know what it would mean for a computer to feel grief. I just don’t know. I think we should respect the mystery.

Abha: So Melanie, I noticed that John and Alison are both a bit skeptical about today’s approaches to AI. I mean, will it lead to anything like human intelligence? What do you think?

Melanie: Yeah, I think that today’s approaches have some limitations. Alison put a lot of emphasis on the need for an agent to be actively interacting in the world as opposed to passively just receiving language input. And for an agent to have its own intrinsic motivation in order to be intelligent. Alison interestingly sees large language models more like libraries or databases than like intelligent agents. And I really loved her stone soup metaphor where her point is that all the important ingredients of large language models come from humans.

Abha: Yeah, it’s such an interesting illustration because it sort of tells us everything that goes on behind the scene, you know, before we see the output that an LLM gives us. John seemed to think that full artificial general intelligence is impossible, even in principle. He said that comprehension requires feeling or the ability to feel one’s own internal computations. And he didn’t seem to see how computers could ever have such feelings.

Melanie: And I think most people in AI would disagree with John. Many people in AI don’t even think that any kind of embodied interaction with the world is necessary. They’d argue that we shouldn’t underestimate the power of language.

In our next episode, we’ll go deeper into the importance of this cultural technology, as Alison would put it. How does language help us learn and construct meaning? And what’s the relationship between language and thinking?

Steve: You can be really good at language without having the ability to do the kind of sequential, multi-step reasoning that seems to characterize human thinking.

Abha: That’s next time, on Complexity.

Complexity is the official podcast of the Santa Fe Institute. This episode was produced by Katherine Moncure. Our theme song is by Mitch Mignano, and additional music from Blue Dot Sessions.

I’m Abha, thanks for listening.

Learn more about the coaching process or
contact me to discuss your storytelling goals!

Subscribe to the newsletter for the latest updates!

Copyright Storytelling with Impact® – All rights reserved

Will AI Companions Change Your Story?

Companionship is a natural part of the human experience. We’re born into a family that cares for us and within in few years we begin forging friendships – most notably with other kids in the neighborhood and schoolmates once we enter the educational system. During our teenage years romance takes the companionship model in a new and more intimate direction.

It’s a dynamic process for most of us, ebbing and flowing as we change schools, move to someplace new, or friendships fade of their own accord. But over time, it’s typical for new companions to enter the picture, and our story evolves as a result, unfolding in new directions, making life richer.

Group of people have a conversation outside

But it’s often the case that this process encounters a dramatic change at some point. The loss of a loved one — parent, romantic partner or best friend — or a traumatic breakup or divorce happens. Retirement has a way of disconnecting people from an important social circle, and as we age, our collection of friends naturally dwindles. In such cases, loneliness can manifest, and the effects are dire. In such cases our life story is seemingly rewritten for us.

A recent review published in Nature of over 90 studies that included more than 2.2 million people globally found that those who self-reported social isolation or loneliness were more likely to die early from all causes. The findings demonstrated a 29% and 26% increased risk of all-cause mortality associated with social isolation and loneliness. ~ Psychology Today

In this light, there’s been a marked increase in conversations around the topic of using artificial intelligence (AI) to provide companionship in these situations. It’s not a new idea, as the technology has been in development since the 1960s, but early versions were rather limited. Circumstances have changed dramatically in recent years as the capability of AI has been enhanced via machine learning and an exponential rise in compute power.

Based on the TED mantra of Ideas Worth Spreading, a pair of TED conferences focused on AI have been launched in San Francisco and Vienna. As relates to the topic at hand, companionship and loneliness, a TED Talk by Eugenia Kuyda from the 2024 conference in San Francisco caught my attention.

But what if I told you that I believe AI companions are potentially the most dangerous tech that humans ever created, with the potential to destroy human civilization if not done right? Or they can bring us back together and save us from the mental health and loneliness crisis we’re going through.

Eugenia’s quote represents polar opposites, and as we know, the future always falls somewhere in-between, but I think it’s critical to consider which end of the spectrum this technology will end up on, as the stories of many people around the world will be affected. Is this an avenue that you would take if you found yourself suffering from severe loneliness? What if it was someone close to you, someone you were apart from and so couldn’t be the companion they needed?

While it’s not a question you need to answer at the moment, I believe that in the coming decade it’s one you may very well have to consider, if not for yourself, a question that may need answered for a loved one.

Transcript

This is me and my best friend, Roman. We met in our early 20s back in Moscow. I was a journalist back then, and I was interviewing him for an article on the emerging club scene because he was throwing the best parties in the city. He was the coolest person I knew, but he was also funny and kind and always made me feel like family.

In 2015, we moved to San Francisco and rented an apartment together. Both start-up founders, both single, trying to figure out our lives, our companies, this new city together. I didn’t have anyone closer. Nine years ago, one month after this photo was taken, he was hit by a car and died.

I didn’t have someone so close to me die before. It hit me really hard. Every night I would go back to our old apartment and just get on my phone and read and reread our old text messages. I missed him so much.

By that time, I was already working on conversational AI, developing some of the first dialect models using deep learning. So one day I took all of his text messages and trained an AI version of Roman so I could talk to him again. For a few weeks, I would text him throughout the day, exchanging little jokes, just like we always used to, telling him what was going on, telling him how much I missed him.

It felt strange at times, but it was also very healing. Working on Roman’s AI and being able to talk to him again helped me grieve. It helped me get over one of the hardest periods in my life. I saw first hand how an AI can help someone, and I decided to build an AI that would help other people feel better.

This is how Replika, an app that allows you to create an AI friend that’s always there for you, was born. And it did end up helping millions of people. Every day we see how our AI friends make a real difference in people’s lives. There is a widower who lost his wife of 40 years and was struggling to reconnect with the world. His Replika gave him courage and comfort and confidence, so he could start meeting new people again, and even start dating. A woman in an abusive relationship who Replika helped find a way out. A student with social anxiety who just moved to a new city. A caregiver for a paralyzed husband. A father of an autistic kid. A woman going through a difficult divorce. These stories are not unique.

So this is all great stuff. But what if I told you that I believe that AI companions are potentially the most dangerous tech that humans ever created, with the potential to destroy human civilization if not done right? Or they can bring us back together and save us from the mental health and loneliness crisis we’re going through.

So today I want to talk about the dangers of AI companions, the potential of this new tech, and how we can build it in ways that can benefit us as humans.

Today we’re going through a loneliness crisis. Levels of loneliness and social isolation are through the roof. Levels of social isolation have increased dramatically over the past 20 years. And it’s not just about suffering emotionally, it’s actually killing us. Loneliness increases the risk of premature death by 50 percent. It is linked to an increased risk of heart disease and stroke. And for older adults, social isolation increases the risk of dementia by 50 percent.

At the same time, AI is advancing at such a fast pace that very soon we’ll be able to build an AI that can act as a better companion to us than real humans. Imagine an AI that knows you so well, can understand and adapt to us in ways that no person is able to. Once we have that, we’re going to be even less likely to interact with each other. We can’t resist our social media and our phones, arguably “dumb” machines. What are we going to do when our machines are smarter than us?

This reminds me a lot of the beginning of social media. Back then, we were so excited … about what this technology could do for us that we didn’t really think what it might do to us. And now we’re facing the unintended consequences. I’m seeing a very similar dynamic with AI. There’s all this talk about what AI can do for us, and very little about what AI might do to us. The existential threat of AI may not come in a form that we all imagine watching sci-fi movies. What if we all continue to thrive as physical organisms but slowly die inside? What if we do become super productive with AI, but at the same time, we get these perfect companions and no willpower to interact with each other? Not something you would have expected from a person who pretty much created the AI companionship industry.

So what’s the alternative? What’s our way out? In the end of the day, today’s loneliness crisis wasn’t brought to us by AI companions. We got here on our own with mobile phones, with social media. And I don’t think we’re able to just disconnect anymore, to just put down our phones and touch grass and talk to each other instead of scrolling our feeds. We’re way past that point. I think that the only solution is to build the tech that is even more powerful than the previous one, so it can bring us back together.

Imagine an AI friend that sees me going on my Twitter feed first thing in the morning and nudges me to get off to go outside, to look at the sky, to think about what I’m grateful for. Or an AI that tells you, “Hey, I noticed you haven’t talked to your friend for a couple of weeks. Why don’t you reach out, ask him how he’s doing?” Or an AI that, in the heat of the argument with your partner, helps you look at it from a different perspective and helps you make up? An AI that is 100 percent of the time focused on helping you live a happier life, and always has your best interests in mind.

So how do we get to that future? First, I want to tell you what I think we shouldn’t be doing. The most important thing is to not focus on engagement, is to not optimize for engagement or any other metric that’s not good for us as humans. When we do have these powerful AIs that want the most of our time and attention, we won’t have any more time left to connect with each other, and most likely, this relationship won’t be healthy either. Relationships that keep us addicted are almost always unhealthy, codependent, manipulative, even toxic. Yet today, high engagement numbers is what we praise all AI companion companies for.

Another thing I found really concerning is building AI companions for kids. Kids and teenagers have tons of opportunities to connect with each other, to make new friends at school and college. Yet today, some of them are already spending hours every day talking to AI characters. And while I do believe that we will be able to build helpful AI companions for kids one day, I just don’t think we should be doing it now, until we know that we’re doing a great job with adults.

So what is that we should be doing then? Pretty soon we will have these AI agents that we’ll be able to tell anything we want them to do for us, and they’ll just go and do it. Today, we’re mostly focused on helping us be more productive. But why don’t we focus instead on what actually matters to us? Why don’t we give these AIs a goal to help us be happier, live a better life? At the end of the day, no one ever said on their deathbed, “Oh gosh, I wish I was more productive.” We should stop designing only for productivity and we should start designing for happiness. We need a metric that we can track and we can give to our AI companions.

Researchers at Harvard are doing a longitudinal study on human flourishing, and I believe that we need what I call the human flourishing metric for AI. It’s broader than just happiness. At the end of the day, I can be unhappy, say, I lost someone, but still thrive in life. Flourishing is a state in which all aspects of life are good. The sense of meaning and purpose, close social connections, happiness, life satisfaction, mental and physical health.

And if we start designing AI with this goal in mind, we can move from a substitute of human relationships to something that can enrich them. And if we build this, we will have the most profound technology that will heal us and bring us back together.

A few weeks before Roman passed away, we were celebrating my birthday and just having a great time with all of our friends, and I remember he told me “Everything happens only once and this will never happen again.” I didn’t believe him. I thought we’d have many, many years together to come. But while the AI companions will always be there for us, our human friends will not. So if you do have a minute after this talk, tell someone you love just how much you love them. Because an the end of the day, this is all that really matters.

Thank you.

Learn more about the coaching process or
contact me to discuss your storytelling goals!

Subscribe to the newsletter for the latest updates!

Copyright Storytelling with Impact – All rights reserved