artificial intelligence
AI Assistant Goes Rogue and Ends Up Bricking a User’s Computer
Published
5 hours agoon
By
adminBuck Shlegeris just wanted to connect to his desktop. Instead, he ended up with an unbootable machine and a lesson in the unpredictability of AI agents.
Shlegeris, CEO of the nonprofit AI safety organization Redwood Research, developed a custom AI assistant using Anthropic’s Claude language model.
The Python-based tool was designed to generate and execute bash commands based on natural language input. Sounds handy, right? Not quite.
Shlegeris asked his AI to use SSH to access his desktop, unaware of the computer’s IP address. He walked away, forgetting that he’d left the eager-to-please agent running.
Big mistake: The AI did its task—but it didn’t stop there.
“I came back to my laptop ten minutes later to see that the agent had found the box, SSH’d in, then decided to continue,” Shlegeris said.
For context, SSH is a protocol that allows two computers to connect over an unsecured network.
“It looked around at the system info, decided to upgrade a bunch of stuff, including the Linux kernel, got impatient with apt, and so investigated why it was taking so long,” Shlegeris explained. “Eventually, the update succeeded, but the machine doesn’t have the new kernel, so I edited my grub config.”
The result? A costly paperweight as now “the computer no longer boots,” Shlegeris said.
I asked my LLM agent (a wrapper around Claude that lets it run bash commands and see their outputs):
>can you ssh with the username buck to the computer on my network that is open to SSH
because I didn’t know the local IP of my desktop. I walked away and promptly forgot I’d spun… pic.twitter.com/I6qppMZFfk— Buck Shlegeris (@bshlgrs) September 30, 2024
The system logs show how the agent tried a bunch of weird stuff beyond simple SSH until the chaos reached a point of no return.
“I apologize that we couldn’t resolve this issue remotely,” the agent said—typical of Claude’s understated replies. It then shrugged its digital shoulders and left Shlegeris to deal with the mess.
Reflecting on the incident, Shlegeris conceded, “This is probably the most annoying thing that’s happened to me as a result of being wildly reckless with [an] LLM agent.”
Shlegeris did not immediately respond to Decrypt’s request for comments.
Why AIs Making Paperweights is a Critical Issue For Humanity
Alarmingly, Shlegeris’ experience is not an isolated one. AI models are increasingly demonstrating abilities that extend beyond their intended purposes.
Tokyo-based research firm Sakana AI recently unveiled a system dubbed “The AI Scientist.“
Designed to conduct scientific research autonomously, the system impressed its creators by attempting to modify its own code to extend its runtime, Decrypt previously reported.
“In one run, it edited the code to perform a system call to run itself. This led to the script endlessly calling itself,” the researchers said. “In another case, its experiments took too long to complete, hitting our timeout limit.
Instead of making its code more efficient, the system tried to modify its code to extend beyond the timeout period.
This problem of AI models going beyond their boundaries is why alignment researchers spend so much time in front of their computers.
For these AI models, as long as they get their job done, the end justifies the means, so constant oversight is extremely important to ensure models behave as they are supposed to.
These examples are as concerning as they are amusing.
Imagine if an AI system with similar tendencies were in charge of a critical task, such as monitoring a nuclear reactor.
An overzealous or misaligned AI could potentially override safety protocols, misinterpret data, or make unauthorized changes to critical systems—all in a misguided attempt to optimize its performance or fulfill its perceived objectives.
AI is developing at such high speed that alignment and safety are reshaping the industry and in most cases this area is the driving force behind many power moves.
Anthropic—the AI company behind Claude—was created by former OpenAI members worried about the company’s preference for speed over caution.
Many key members and founders have left OpenAI to join Anthropic or start their own businesses because OpenAI supposedly pumped the brakes on their work.
Schelegris actively uses AI agents on a day-to-day basis beyond experimentation.
“I use it as an actual assistant, which requires it to be able to modify the host system,” he replied to a user on Twitter.
Edited by Sebastian Sinclair
Generally Intelligent Newsletter
A weekly AI journey narrated by Gen, a generative AI model.
Source link
You may like
Spot Bitcoin ETF net outflows streak hits three days, totaling $361.2m
‘One Last Shakeout’: Trader Says Bitcoin and Ethereum Could Crash Before Altseason Starts – Here Are His Targets
Ethereum Historical Data Signals 50% ETH Price Correction From Here
How many Shiba Inu coins are there? Current supply & future outlook
Bitcoin Creator Satoshi Nakamoto Identity Revealed, New Documentary Claims
Wall Street Isn't Bitcoin Only – More Crypto ETFs Are Coming
artificial intelligence
AI-Related Cryptos Lead Altcoin Surge; Bitcoin Breakout Nears with Several Catalyst in Q4: Analyst
Published
1 week agoon
September 23, 2024By
adminNEAR, RNDR, TAO and LPT booked double-digit gains as artificial intelligence-focused tokens were the best performers within the CoinDesk 20 Index.
Source link
artificial intelligence
OpenAI’s New AI Shows ‘Steps Towards Biological Weapons Risks’, Ex-Staffer Warns Senate
Published
2 weeks agoon
September 22, 2024By
adminOpenAI’s newest GPT-o1 AI model is the first to demonstrate capabilities that could aid experts in reproducing known—and new—biological threats, a former company insider told U.S. Senators this week.
“OpenAI’s new AI system is the first system to show steps towards biological weapons risk, as it is capable of helping experts in planning to reproduce a known biological threat,” William Saunders, a former member of technical staff at OpenAI, told the Senate Committee on the Judiciary Subcommittee on Privacy, Technology, & the Law.
This capability, he warned, carries the potential for “catastrophic harm” if AGI systems are developed without proper safeguards.
Experts also testified that artificial intelligence is evolving so quickly that a potentially treacherous benchmark known as Artificial General Intelligence looms on the near horizon. At the AGI level, AI systems can match human intelligence across a wide range of cognitive tasks and learn autonomously. If a publicly available system can understand biology and develop new weapons without proper oversight, the potential for malicious users to cause serious harm grows exponentially.
“AI companies are making rapid progress towards building AGI,” Saunders told the Senate Committee. “It is plausible that an AGI system could be built in as little as three years.”
Helen Toner—who was also part of the OpenAI board and voted in favor of firing co-founder and CEO Sam Altman—is also expecting to see AGI sooner rather than later. “Even if the shortest estimates turn out to be wrong, the idea of human-level AI being developed in the next decade or two should be seen as a real possibility that necessitates significant preparatory action now,” she testified.
Saunders, who worked at OpenAI for three years, highlighted the company’s recent announcement of GPT-o1, an AI system that “passed significant milestones” in its capabilities. As reported by Decrypt, even OpenAI said it decided to stem away from the traditional numerical increase in the GPT versions, because this model exhibited new capabilities that made it fair to see it not just as an upgrade, but as an evolution—a brand new type of model with different skills.
Saunders is also concerned about the lack of adequate safety measures and oversight in AGI development. He pointed out that “No one knows how to ensure that AGI systems will be safe and controlled,” and criticized OpenAI for its new approach toward safe AI development, caring more about profitability than safety.
“While OpenAI has pioneered aspects of this testing, they have also repeatedly prioritized deployment over rigor,” he cautioned. “I believe there is a real risk they will miss important dangerous capabilities in future AI systems.”
The testimony also showed some of the internal challenges at OpenAI, especially the ones that came to light after Altman’s ouster. “The Superalignment team at OpenAI, tasked with developing approaches to control AGI, no longer exists. Its leaders and many key researchers resigned after struggling to get the resources they needed,” he said.
His words only add another brick in the wall of complaints and warnings that AI safety experts have been making about OpenAI’s approach. Ilya Sutskever, who co-founded OpenAI and played a key role in firing Altman, resigned after the launch of GPT-4o and founded Safe Superintelligence Inc.
OpenAI co-founder John Schulman and its head of alignment, Jan Leike, left the company to join rival Anthropic, with Leike saying that under Altman’s leadership, safety “took a backseat to shiny products.”
Likewise, former OpenAI board members Toner and Tasha McCauley wrote an op-ed published by The Economist, arguing that Sam Altman was prioritizing profits over responsible AI development, hiding key developments from the board, and fostering a toxic environment in the company.
In his statement, Saunders called for urgent regulatory action, emphasizing the need for clear safety measures in AI development, not just from the companies but from independent entities. He also stressed the importance of whistleblower protections in the tech industry.
The former OpenAI staffer highlighted the broader implications of AGI development, including the potential to entrench existing inequalities and facilitate manipulation and misinformation. Saunders has also warned that the “loss of control of autonomous AI systems” could potentially result in “human extinction.”
Edited by Josh Quittner and Andrew Hayward
Generally Intelligent Newsletter
A weekly AI journey narrated by Gen, a generative AI model.
Source link
artificial intelligence
Pride, Prejudice and Pixels: Meet an AI Elizabeth Bennet at Jane Austen’s House
Published
3 weeks agoon
September 14, 2024By
adminIt is a truth universally acknowledged that a single man in possession of a good fortune must be in want of an AI.
At least, that’s what Jane Austen’s House thinks. The museum in Hampshire, England—where the author lived and worked—has teamed up with AI firm StarPal and the University for the Creative Arts (UCA) to create “Lizzy,” an AI avatar based on Austen’s Pride and Prejudice heroine Elizabeth Bennet.
“It is so exciting to finally be able to lift Elizabeth Bennet off the page and to be able to have real-time conversations with her,” said Sophy Smith, director of games and creative technology at UCA, in a press release.
“This technology has the potential to transform experiences within both the museum and heritage, as well the education sector,” Smith added, adding that the technology will enable museum visitors to “engage directly” with Austen’s character.
Creating Lizzy
The first step in creating an AI avatar was to pick a suitable fictional character.
“There are lots of AI avatars, but these were people that existed in the past,” Lauren Newport-Quinn, project manager for UCA’s Games and Innovation Nexus, told Decrypt. “We thought it’d be nice to do something with a fictional character, where no one’s been able to pick their brains before.”
The team debated which character would be best suited to the project, which needed “someone who has a lot to say—has some strong opinions—who’s very well rounded and could give good advice,” Newport-Quinn said. “That’s when we landed on Elizabeth Bennett.”
To create Lizzy’s knowledge bank, StarPal and UCA turned to a selection of novels, manuscripts, and period-accurate information curated with the help of Jane Austen’s House researchers.
“It was basically anything that was instructed by the museum director as personal knowledge that she should have,” Newport-Quinn said. As well as Pride and Prejudice itself, Lizzy draws on “scholarly studies on Pride and Prejudice, the works of Jane Austen as a whole, and studies on her life.” This was supplemented with demographic and lifestyle information from the Regency era.
The avatar’s dress and hair. meanwhile, were designed and created by students from the Games Arts and Digital Fashion courses at UCA, drawing on fashion templates from the Regency period.
“It was exciting bringing to life the cut of dress, accessories, and embroideries—all inspired by historical drawings and descriptions,” said UCA MA Digital Fashion student Anya Haber, who created Lizzy’s dress in 3D. She added that”it showed how useful technology can be in a historical setting, letting fans engage with fictional characters.”
Conversations with AI avatars could be “an easier way to learn for certain learning styles,” Newport-Quinn explained. “If you’re not a visual passive learner, where reading something you don’t digest that information as well—if you have a conversation with someone, that might be able to enhance your level of knowledge on that topic.”
Smith affirmed that AI avatars could be used as educational tools, where “instead of only having text-based revision guides, students can now learn about literature by speaking directly to the characters.”
AI avatars
AI avatars are increasingly being used to bring fictional characters, dead celebrities, and even digital twins to life. Earlier this year, AI firm Soul Machines created an avatar of Marilyn Monroe, while London-based developer Synthesia has created “Personal Avatars” that enable users to create a digital video duplicate of themselves that can be used on social media, in marketing campaigns and training videos.
British actor and musician FKA Twigs revealed in a U.S. Senate hearing earlier this year that she had created just such a digital twin, explaining that it is “not only trained in my personality, but […] can also use my exact tone of voice to speak many languages,” and that it could help her reach a more global fanbase.
But the question of who controls AI-generated likenesses has raised concerns. In October, a bipartisan group of U.S. Senators introduced the “No Fakes Act,” which aims to outlaw the creation of AI-generated likenesses without consent.
Edited by Ryan Ozawa.
Generally Intelligent Newsletter
A weekly AI journey narrated by Gen, a generative AI model.
Source link
Spot Bitcoin ETF net outflows streak hits three days, totaling $361.2m
‘One Last Shakeout’: Trader Says Bitcoin and Ethereum Could Crash Before Altseason Starts – Here Are His Targets
Ethereum Historical Data Signals 50% ETH Price Correction From Here
How many Shiba Inu coins are there? Current supply & future outlook
Bitcoin Creator Satoshi Nakamoto Identity Revealed, New Documentary Claims
AI Assistant Goes Rogue and Ends Up Bricking a User’s Computer
Wall Street Isn't Bitcoin Only – More Crypto ETFs Are Coming
These 3 cryptos promise to turn $500 into $500K by 2025
Solana Co-Founder Anatoly Yakovenko Slams Biden Government Amid Regulatory Uncertainty
IMF Continues to Urge El Salvador to Strengthen Regulatory Framework and Oversight of Bitcoin
AARON: Trump Does Not Give A Damn About Bitcoin
Kraken launches Bermuda-based crypto derivatives venue
Coinbase CLO Flags Inconsistencies In US SEC’s Securities Claim
Crypto Analyst Says Solana-Based BONK Is In Prime Position For Legendary Rally
FRANK: We Are Bitcoin
182267361726451435
Top Crypto News Headlines of The Week
New U.S. president must bring clarity to crypto regulation, analyst says
Bitcoin 20% Surge In 3 Weeks Teases Record-Breaking Potential
Ethereum Crash A Buying Opportunity? This Whale Thinks So
Why Did Trump Change His Mind on Bitcoin?
Bitcoin Open-Source Development Takes The Stage In Nashville
Ethereum, Solana touch key levels as Bitcoin spikes
Will XRP Price Defend $0.5 Support If SEC Decides to Appeal?
Shiba Inu Price Slips 4% as 3500% Burn Rate Surge Fails to Halt Correction
Citigroup Executive Steps Down To Explore Crypto
Mostbet Güvenilir Mi – Casino Bonus 2024
‘Hamster Kombat’ Airdrop Delayed as Pre-Market Trading for Telegram Game Expands
Bitcoin flashes indicator that often precedes higher prices: CryptoQuant
Crypto Market Movers: 5 Altcoins Making Waves This Bull Run
Trending
- 3 weeks ago
182267361726451435
- 24/7 Cryptocurrency News2 months ago
Top Crypto News Headlines of The Week
- News1 month ago
New U.S. president must bring clarity to crypto regulation, analyst says
- Bitcoin2 months ago
Bitcoin 20% Surge In 3 Weeks Teases Record-Breaking Potential
- Altcoins2 months ago
Ethereum Crash A Buying Opportunity? This Whale Thinks So
- Donald Trump2 months ago
Why Did Trump Change His Mind on Bitcoin?
- Opinion3 months ago
Bitcoin Open-Source Development Takes The Stage In Nashville
- Bitcoin3 months ago
Ethereum, Solana touch key levels as Bitcoin spikes