Curbing AI creativity: How to provide effective guardrails

AI models can generate astonishingly creative content. However, their outputs can become cliched, unpredictable, and problematic without proper guardrails. How can we harness their potential while maintaining control? In this article, we’ll show you what you can do to provide guardrails for your AI chatbot. Thanks to these techniques, you can ensure its creative outputs align with your specific needs and objectives.

Understanding the need for guardrails

As AI continues to evolve, so do its capabilities to generate creative content. Generative AI can do everything, from writing articles and creating marketing copy to composing music and generating artwork. However, this comes with great responsibilities. Unchecked creativity in AI can lead to various challenges and risks. It’s very important to implement guardrails.

What is AI creativity?

Generative AI refers to the ability of models to generate new content. This can include text, images, music, and other forms of media. AI models like GPT-4, for instance, can write poetry, draft emails, create fictional stories, and even generate code. At Yoast, we use it to power the AI title and meta description generator in Yoast SEO. There are various ways to determine how creative the chatbot or AI system can get while generating that content. For instance, various AI tools like Copilot and Gemini have options to make the output more or less adventurous.

Where AI gets its creativity from

AI models, particularly Large Language Models (LLMs) like GPT-4, exhibit creativity through their ability to generate content. But where does this creativity come from? The answer lies at the intersection of training data, deep learning architectures, and fine-tuned parameters.

Diverse training data

The foundation of AI creativity is the huge datasets used during training. These datasets contain a range of text sources, including books, articles, websites, and other forms of written content. Exposure to a wide variety helps the model learn patterns, styles, and contextual nuances across different genres and topics. Diversity helps AI generate content that is not only coherent but also varied and imaginative.

Deep neural networks

At the heart of LLMs are deep neural networks, specifically transformer architectures. These consist of multiple layers of attention mechanisms. These layers allow the model to understand and generate complex language structures by focusing on the relationships between words and their context. With billions of parameters fine-tuned during training, these models can produce human-like text that mirrors the creativity found in their training data.

Predictive text generation

LLMs’ predictive text generation capabilities also drive creativity. The models generate text one token (word or subword) at a time, predicting the next token based on the preceding context. This token-by-token generation, influenced by probability distributions, allows the AI to craft coherent and contextually relevant content that can surprise and engage readers.

Influence of parameters

Parameters like temperature and top_p are crucial in modulating the model’s output. Temperature controls the randomness of predictions, with higher values leading to more diverse and “creative” outputs, while lower values result in more deterministic and focused text. Top_p, or nucleus sampling, controls the diversity of the output by sampling from a subset of probable tokens. By fine-tuning these parameters, users can balance creativity with coherence — more on this later. These are helpful tools to guide the AI in producing content that meets your needs.

Pattern recognition and replication

Ultimately, the AI’s creativity stems from its ability to recognize and replicate patterns from its training data. By mimicking the linguistic and stylistic patterns it has learned, the model can generate content that feels original and inspired. This pattern recognition allows LLMs to compose poetry, write stories, create marketing copy, and generate artistic descriptions that resonate with human creativity.

AI creativity is a product of training on diverse datasets, neural network architectures, and calibrated parameters. Understanding these components helps harness AI’s creativity while ensuring the content aligns with your objectives.

Human creativity vs. AI creativity

Various forms of creativity often produce similar outputs but from very different backgrounds. Human creativity is rooted in personal experiences, emotions, and conscious thought. This allows people to create art, literature, and innovations that resonate emotionally and culturally. It involves intuition, inspiration, and the ability to make abstract connections that are uniquely human.

In contrast, AI creativity consists of processing data and recognizing patterns within that data. AI generates new content based on learned patterns and statistical probabilities, not personal experiences or emotions. While AI can mimic human creativity and make coherent and relevant content, it lacks human understanding and emotional depth. Fusing human and AI creativity can lead to interesting results, but it’s crucial to recognize and appreciate each’s distinct nature.

Letting the AI run wild

While AI’s creative capabilities are impressive, they come with inherent risks. With proper guardrails, the outputs can become predictable and manageable.

AI can produce off-topic, irrelevant, or even inappropriate content without proper constraints. As a result, businesses and content creators might get hurt. For instance, an AI writing tool might generate marketing copy that is in the wrong tone or even offensive, which can damage a brand’s reputation.

Controlled creativity can generate content that aligns differently with the brand’s voice or message. The end goal, of course, is clarity and consistency.

Guardrails are critical for generative AI

Given these risks, it’s clear that guardrails help control AI’s creative potential. Here’s why guardrails are crucial:

  • Maintaining relevance and focus:
    • Guardrails help keep the AI’s outputs focused on the intended topic, preventing deviations that can dilute the message.
  • Ensuring appropriateness:
    • Guardrails protect your brand’s reputation and ensure that the content suits your audience by filtering out inappropriate or offensive content.
  • Aligning with brand voice:
    • Guardrails ensure that AI-generated content is consistent with your brand’s voice and tone, maintaining coherence in your messaging.
  • Enhancing credibility:
    • By preventing factual inaccuracies, guardrails enhance the credibility and reliability of AI-generated content, especially in fields that require precision.
  • Optimizing user experience:
    • Well-implemented guardrails contribute to a better user experience by ensuring the content is engaging, relevant, and valuable to the audience.

The following sections will explore practical techniques for providing these guardrails to manage AI creativity effectively.

Techniques for providing guardrails

Effective guardrails for AI are strategies that can help control the output, ensuring it meets specific requirements and aligns with your objectives.

Keyword filtering

Without limiting what the LLM does, it loves to come up with sentences/words like: “In the ever-evolving landscape of…” and “As we stand on the cusp of this new era, the possibilities are as limitless as our imagination.” It uses long-winded sentences with very expressive language, full of cliches. You can curb this by limiting the words or expressions it can use.

Keyword filtering involves setting up filters to exclude specific words, phrases, or types of content deemed inappropriate, irrelevant, or not aligned with your brand’s voice. This technique is useful for maintaining content suitability and relevance.

It’s not hard to implement:

  • Identify keywords: List words or phrases that should be excluded. This can include offensive language, jargon, or off-topic terms.
  • Set up filters: Use AI tools that support keyword filtering. Configure these tools to flag or exclude content containing the identified keywords.
  • Continuous monitoring: Regularly update the list of keywords based on feedback and new requirements.

Try this as an experiment. You’ll notice it’s fairly easy to influence what chatbots use and don’t use.

Write a short piece on the future of content creation with generative AI. Don't use the following words:

Buckle up
Delve
Dive
Elevate
Embark
Embrace
Explore
Discover
Demystified

but do use:

Unleash
Unlocked
Unveiled
Beacon
Bombastic
Competitive digital world

You can also make this process more proficient and scalable using APIs to communicate with LLMs and chatbots.

Prompt engineering

Prompt engineering involves writing prompts to guide the AI in generating content that meets the criteria. Leo S. Lo from the University of New Mexico developed the CLEAR method (context, limitations, examples, audience, requirements), an effective approach to prompt engineering. Of course, there are plenty of other ways to write great prompts for your content.

A practical example of using the CLEAR framework

Imagine we are creating content for a travel blog. Using the CLEAR framework, we devised the following prompt to inspire the AI chatbot to create a blog post about Kyoto, Japan.

Prompt: “Describe a day in the life of a local in Kyoto, Japan. Focus on their morning routine, interactions with neighbors, and favorite spots in the city. Use a descriptive and engaging tone to captivate travel enthusiasts. Include at least two historical landmarks and one local cuisine.”

  1. Clear: The instructions are straightforward to understand. We specifically ask for a description of a day in the life of a local in Kyoto, including particular elements like their morning routine, interactions, and favorite spots.
  2. Logical: The prompt is logically structured. It begins with a general description of a day in the life and then narrows down to specific details such as the morning routine, interactions with neighbors, and favorite spots. This logical flow helps generate a coherent and comprehensive piece of content.
  3. Engaging: The tone is described as “descriptive and engaging,” which is crucial for captivating travel enthusiasts. The prompt invites the writer to create a vivid and relatable narrative by focusing on personal interactions and favorite spots.
  4. Accurate: The prompt asks for at least two historical landmarks and one local cuisine. This ensures that the description is rooted in Kyoto’s actual cultural and historical elements.
  5. Relevant: The topic is highly relevant to travel enthusiasts interested in different places’ cultural and daily life aspects. The prompt taps into a subject of high interest by focusing on Kyoto, a city known for its rich history and cultural landmarks.
Enhanced prompt

To refine it even further, you can add a few more specific guidelines to enhance clarity and completeness:

“Describe a day in the life of a local in Kyoto, Japan. Focus on their morning routine, interactions with neighbors, and favorite spots in the city. Use a descriptive and engaging tone to captivate travel enthusiasts. Include at least two historical landmarks (e.g., Kinkaku-ji, Fushimi Inari Taisha) and one local cuisine (e.g., yudofu, kaiseki). Ensure the narrative captures the essence of Kyoto’s culture and daily life.”

Why these improvements work:
  • Clear: Specific examples such as Kinkaku-ji and yudofu provide clarity.
  • Logical: The flow from morning routine to interactions and favorite spots remains logical.
  • Engaging: The descriptive and engaging tone is maintained.
  • Accurate: Named landmarks and cuisines ensure accuracy.
  • Relevant: Provides a detailed, culturally rich experience relevant to travel enthusiasts.

Now, the prompt is well-crafted and aligns with the CLEAR framework, and the enhanced version provides additional guidance and specificity.

Template usage

Templates provide a structured framework the AI chatbot can follow, ensuring consistency and completeness in the generated content. Templates can be particularly useful for recurring content types like blog posts, reports, product descriptions, etc. Using templates, you can maintain a uniform structure across different pieces of content. As a result, all necessary elements are included and appropriately organized.

  • Identify common content types: Determine the types of content you frequently generate, such as blog posts, product descriptions, social media posts, etc.
  • Create templates: Develop templates for each content type. These templates should include sections and prompts for each part of the content.
  • Provide clear instructions: Include detailed instructions within each template section to guide the AI. This can involve specifying the tone, style, length, and key points to cover.
  • Consistent use: Use these templates consistently to maintain uniformity across all generated content. Review and update the templates regularly to reflect new requirements or insights.

Parameter tuning

Adjusting parameters like temperature and top_p can control the randomness and creativity of the AI’s output. This might seem like it controls creativity, but that’s not actually the case. Instead, it fine-tunes how the model balances creativity with coherence. Temperature affects the variability of the generated content, while top_p controls the diversity by sampling from a subset of probable tokens.

Understanding temperature and top_p in LLMs

Imagine you’re baking cookies, and you want to experiment with different flavors. You have a big jar of various ingredients (chocolate chips, nuts, dried fruits, etc.), and you can either stick to the classic recipe or get a bit adventurous.

Temperature:
Think of temperature as the level of adventurousness in your cookie recipe.

  • Low temperature (e.g., 0.2): You’re playing it safe. You mostly stick to the classic ingredients like chocolate chips and maybe a few nuts. Your cookies are predictable but reliably good.
  • High temperature (e.g., 0.8): You’re feeling adventurous! You start throwing in various ingredients, like mango bits, chili flakes, and marshmallows. The cookies are more unpredictable — some might be amazing, while others might be too wild.

In AI text generation, a lower temperature means the model plays it safe and chooses more predictable words. A higher temperature allows for more creativity and variety but with the risk of less coherence.

Top_p (Nucleus sampling):
Now, imagine you have a friend who helps you pick the ingredients. Top_p is like telling your friend only to consider the most popular ingredients but with a twist.

  • Low top_p (e.g., 0.1): Your friend only picks the top 10% of frequently used ingredients. You end up with a very standard and safe mix.
  • High top_p (e.g., 0.9): Your friend considers a wider variety of ingredients, maybe the top 90%. This allows for more interesting and diverse combinations but still within a reasonable limit, so the cookies don’t turn out too strange.

In AI text generation, a lower top_p value means the model selects from a smaller set of high-probability words. This makes the output more predictable. A higher top_p value lets the model choose from a larger set of words, increasing the output’s diversity and “creativity” while maintaining coherence.

Adjusting temperature and top_p controls how adventurous or safe the AI is in generating text. This is much like how you control the ingredients in your cookie recipe.

A misconception

As we’ve mentioned, the temperature and top_p control the randomness and diversity of AI-generated text. However, they do not create or improve creativity. Instead, they manage how the AI explores different word choices. True creativity in AI comes from the model’s ability to generate new content based on the patterns it has learned from its training data.

Experimenting with and fine-tuning these parameters helps you guide the AI. These tools help it produce imaginative and relevant content without veering off into incoherence or irrelevance.

Generative AI tools like TypingMind let you carefully control the performance of various language models

Combining techniques

Combining the above techniques can provide a more robust framework for controlling AI creativity. Each technique complements the others, creating a comprehensive system of guardrails.

An integrated approach combines keyword filtering, prompt engineering, template usage, and parameter tuning to create a multi-layered control system. You can support this using a feedback loop that considers all aspects of the content generation process, from initial prompts to final outputs.

Conclusion to creativity in AI

It’s important to maintain control while still harnessing AI’s creative potential. Use guardrails such as keyword filtering, prompt engineering with frameworks, template usage, and parameter tuning to help the AI produce relevant, high-quality content that aligns with your objectives.

Remember that parameters like temperature and top_p do not define creativity; they merely influence the randomness and diversity of the output. True creativity in AI is limited and cannot be replicated without outside help from real people.

With some help from these techniques, we can purposefully use generative AI’s creative capabilities. Whether generating blog posts, marketing copy, or educational content, these strategies help the AI to add value and meet desired standards.

Coming up next!


3 Responses to Curbing AI creativity: How to provide effective guardrails