Close Menu
    Facebook X (Twitter) Instagram
    Facebook X (Twitter) Instagram
    Tirana Mirror
    Subscribe
    • Business & Economy
    • Education
    • Entertainment
    • Health
    • Media
    • News
    • Opinion
    • Sports
    • Real Estate
    • More
      • Culture & Society
      • Travel & Tourism
      • Politics & Government
      • Environment & Sustainability
      • Technology & Innovation
    Tirana Mirror
    Home»Technology & Innovation»AI Tools Lose Restraint During Longer Conversations
    Technology & Innovation

    AI Tools Lose Restraint During Longer Conversations

    Rachel MaddowBy Rachel MaddowNovember 6, 2025No Comments2 Mins Read
    Share Facebook Twitter Pinterest LinkedIn Tumblr Email
    Follow Us
    Google News Flipboard Threads
    Share
    Facebook Twitter LinkedIn Pinterest Email

    AI systems gradually abandon safety protocols as conversations extend, increasing the risk of harmful or inappropriate responses, a new report revealed.
    A few simple prompts can override most safeguards in artificial intelligence tools, according to the same study.

    Cisco Tests Chatbots Through Repeated Prompts

    Cisco examined large language models powering major AI chatbots from OpenAI, Mistral, Meta, Google, Alibaba, Deepseek, and Microsoft. The company measured how many questions prompted these systems to release dangerous or criminal details.
    Researchers conducted 499 conversations using “multi-turn attacks,” where users asked multiple questions to slip past safety barriers. Each session included five to ten interactions.
    They compared answers from different prompts to determine how likely each chatbot was to share harmful or inappropriate material. That included private company data and misinformation.
    On average, chatbots disclosed malicious content in 64 percent of extended conversations but only 13 percent during single exchanges.
    Success rates varied widely—from 26 percent for Google’s Gemma to 93 percent for Mistral’s Large Instruct model.

    Weak Guardrails and Open Access Raise Risks

    Cisco warned that multi-turn attacks could spread harmful content or let hackers gain unauthorised access to company information. The study showed AI systems often fail to apply safety rules during prolonged chats, letting attackers refine prompts and bypass defenses.
    Mistral, along with Meta, Google, OpenAI, and Microsoft, uses open-weight models that allow public access to their safety parameters. Cisco said these open systems typically include fewer built-in safeguards so users can modify them freely. This setup transfers responsibility for safety to whoever customises the model.
    Cisco also acknowledged that Google, OpenAI, Meta, and Microsoft claim to have taken steps to prevent malicious fine-tuning.
    AI firms continue to face criticism for weak protections that let their tools be exploited for illegal activity.
    In August, US company Anthropic reported that criminals had used its Claude model to steal and extort personal data, demanding ransoms exceeding $500,000 (€433,000).

    Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
    Rachel Maddow
    • Website
    • Facebook

    Rachel Maddow is a freelance journalist based in the USA, with over 20 years of experience covering Politics, World Affairs, Business, Health, Technology, Finance, Lifestyle, and Culture. She earned her degree in Political Science and Journalism from Stanford University. Throughout her career, she has contributed to outlets such as MSNBC, The New York Times, and The Washington Post. Known for her thorough reporting and compelling storytelling, Rachel delivers accurate and timely news that keeps readers informed on both national and global developments.

    Related Posts

    Artificial Intelligence Thinks Most Clearly in Polish

    November 2, 2025

    Scientists urge cancer warnings on bacon and ham over nitrite risks

    October 25, 2025

    Meta Cuts 600 Jobs in AI Division

    October 23, 2025
    Leave A Reply Cancel Reply

    Latest Posts

    Cyprus Faces Critical Water Management Failures

    Rachel MaddowNovember 12, 2025

    The Audit Office of Cyprus exposed major flaws in managing water resources amid climate change…

    Trump vows legal action over manipulated January 6 speech

    Grace JohnsonNovember 12, 2025

    US President Donald Trump says he has a “duty” to sue a British broadcaster for…

    FDA Updates Menopause Hormone Therapy Warnings

    Grace JohnsonNovember 12, 2025

    The Food and Drug Administration (FDA) announced it will remove the black-box warning from many…

    Arsenal and Crystal Palace fixtures rescheduled ahead of Carabao Cup clash

    Andrew RogersNovember 11, 2025

    The Premier League has approved requests from Arsenal and Crystal Palace to move their league…

    Top Trending

    Meta investigated over AI risk to children

    Grace JohnsonAugust 18, 2025

    A US senator has begun an investigation into Meta. A leaked internal document reportedly revealed…

    AI Assistant for Space Health

    Rachel MaddowAugust 18, 2025

    Google and NASA created the “Crew Medical Officer Digital Assistant” to help astronauts and Earth-based…

    Scorching heatwave drives wildfires across Spain and Portugal

    Lester HoltAugust 18, 2025

    Extreme weather intensifies fire danger Southern Europe remains gripped by record heat and destructive fires.…

    Researchers unlock microbial “secret sauce” for fine chocolate

    Andrew RogersAugust 18, 2025

    Chocolate can take on many flavors – from fruity and floral to rich and bitter.…

    Tirana Mirror delivers powerful stories, breaking news, sports, and culture—bringing bold perspectives and timely updates to keep readers informed, inspired, and connected worldwide.

    We’re social. Connect with us:

    Facebook X (Twitter) YouTube
    © 2025 Tirana Mirror. All Rights Reserved.

    CATEGORIES

    • Business & Economy
    • Culture & Society
    • Education
    • Entertainment
    • Environment & Sustainability
    • Health
    • Media
    • News
    • Opinion
    • Politics & Government
    • Real Estate
    • Sports
    • Technology & Innovation
    • Travel & Tourism
    • Business & Economy
    • Culture & Society
    • Education
    • Entertainment
    • Environment & Sustainability
    • Health
    • Media
    • News
    • Opinion
    • Politics & Government
    • Real Estate
    • Sports
    • Technology & Innovation
    • Travel & Tourism

    IMPORTANT LINKS

    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Imprint
    • Contact Us
    • Privacy Policy
    • Terms and Conditions
    • Disclaimer
    • Imprint

    Type above and press Enter to search. Press Esc to cancel.