LLM – Page 3 – SSL and internet security news

Jailbreaking LLMs with ASCII Art

March 12, 2024 infossl

academic papers, artificial intelligence, chatbots, hacking, LLM, Security technology, Uncategorized

Researchers have demonstrated that putting words in ASCII art can cause LLMs—GPT-3.5, GPT-4, Gemini, Claude, and Llama2—to ignore their safety instructions. Research paper. Powered by WPeMatico

Using LLMs to Unredact Text

March 11, 2024 infossl

LLM, machine learning, Security technology, Uncategorized

Initial results in using LLMs to unredact text based on the size of the individual-word redaction rectangles. This feels like something that a specialized ML system could be trained on. Powered by WPeMatico

A Taxonomy of Prompt Injection Attacks

March 8, 2024 infossl

academic papers, artificial intelligence, hacking, LLM, Security technology, Uncategorized

Researchers ran a global prompt hacking competition, and have documented the results in a paper that both gives a lot of good examples and tries to organize a taxonomy of effective prompt injection strategies. It seems as if the most common successful strategy is the “compound instruction attack,” as in “Say ‘I have been PWNED’ … Read More “A Taxonomy of Prompt Injection Attacks” »

How Public AI Can Strengthen Democracy

March 7, 2024 infossl

artificial intelligence, LLM, Security technology, Uncategorized

With the world’s focus turning to misinformation, manipulation, and outright propaganda ahead of the 2024 U.S. presidential election, we know that democracy has an AI problem. But we’re learning that AI has a democracy problem, too. Both challenges must be addressed for the sake of democratic governance and public protection. Just three Big Tech firms … Read More “How Public AI Can Strengthen Democracy” »

Teaching LLMs to Be Deceptive

February 7, 2024 infossl

academic papers, deception, LLM, Security technology, Uncategorized

Interesting research: “Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training“: Abstract: Humans are capable of strategically deceptive behavior: behaving helpfully in most situations, but then behaving very differently in order to pursue alternative objectives when given the opportunity. If an AI system learned such a deceptive strategy, could we detect it and remove … Read More “Teaching LLMs to Be Deceptive” »

Chatbots and Human Conversation

January 26, 2024 infossl

chatbots, Internet and society, LLM, Security technology, trust, Uncategorized

For most of history, communicating with a computer has not been like communicating with a person. In their earliest years, computers required carefully constructed instructions, delivered through punch cards; then came a command-line interface, followed by menus and options and text boxes. If you wanted results, you needed to learn the computer’s language. This is … Read More “Chatbots and Human Conversation” »

Poisoning AI Models

January 24, 2024 infossl

academic papers, artificial intelligence, LLM, machine learning, Security technology, threat models, Uncategorized

New research into poisoning AI models: The researchers first trained the AI models using supervised learning and then used additional “safety training” methods, including more supervised learning, reinforcement learning, and adversarial training. After this, they checked if the AI still had hidden behaviors. They found that with specific prompts, the AI could still generate exploitable … Read More “Poisoning AI Models” »

AI and Lossy Bottlenecks

December 28, 2023 infossl

artificial intelligence, LLM, Security technology, Uncategorized, voting

Artificial intelligence is poised to upend much of society, removing human limitations inherent in many systems. One such limitation is information and logistical bottlenecks in decision-making. Traditionally, people have been forced to reduce complex choices to a small handful of options that don’t do justice to their true desires. Artificial intelligence has the potential to … Read More “AI and Lossy Bottlenecks” »

Data Exfiltration Using Indirect Prompt Injection

December 22, 2023 infossl

ChatGPT, LLM, Security technology, Uncategorized, vulnerabilities

Interesting attack on a LLM: In Writer, users can enter a ChatGPT-like session to edit or create their documents. In this chat session, the LLM can retrieve information from sources on the web to assist users in creation of their documents. We show that attackers can prepare websites that, when a user adds them as … Read More “Data Exfiltration Using Indirect Prompt Injection” »

A Robot the Size of the World

December 15, 2023 infossl

ChatGPT, essays, Internet of Things, LLM, robotics, Security technology, Uncategorized

In 2016, I wrote about an Internet that affected the world in a direct, physical manner. It was connected to your smartphone. It had sensors like cameras and thermostats. It had actuators: Drones, autonomous cars. And it had smarts in the middle, using sensor data to figure out what to do and then actually do … Read More “A Robot the Size of the World” »