LLM – SSL and internet security news

Using AI-Generated Legislative Amendments as a Delaying Technique

April 17, 2024 infossl

A Hacker's Mind, artificial intelligence, laws, LLM, noncomputer hacks, Security technology, Uncategorized

Canadian legislators proposed 19,600 amendments—almost certainly AI-generated—to a bill in an attempt to delay its adoption. I wrote about many different legislative delaying tactics in A Hacker’s Mind, but this is a new one. Powered by WPeMatico

Public AI as an Alternative to Corporate AI

March 21, 2024 infossl

artificial intelligence, LLM, Security technology, Uncategorized

This mini-essay was my contribution to a round table on Power and Governance in the Age of AI. It’s nothing I haven’t said here before, but for anyone who hasn’t read my longer essays on the topic, it’s a shorter introduction. The increasingly centralized control of AI is an ominous sign. When tech billionaires … Read More “Public AI as an Alternative to Corporate AI” »

AI and the Evolution of Social Media

March 19, 2024 infossl

artificial intelligence, facebook, google, Internet and society, LLM, privacy, Security technology, social media, surveillance, twitter, Uncategorized

Oh, how the mighty have fallen. A decade ago, social media was celebrated for sparking democratic uprisings in the Arab world and beyond. Now front pages are splashed with stories of social platforms’ role in misinformation, business conspiracy, malfeasance, and risks to mental health. In a 2022 survey, Americans blamed social media for the coarsening … Read More “AI and the Evolution of Social Media” »

Jailbreaking LLMs with ASCII Art

March 12, 2024 infossl

academic papers, artificial intelligence, chatbots, hacking, LLM, Security technology, Uncategorized

Researchers have demonstrated that putting words in ASCII art can cause LLMs—GPT-3.5, GPT-4, Gemini, Claude, and Llama2—to ignore their safety instructions. Research paper. Powered by WPeMatico

Using LLMs to Unredact Text

March 11, 2024 infossl

LLM, machine learning, Security technology, Uncategorized

Initial results in using LLMs to unredact text based on the size of the individual-word redaction rectangles. This feels like something that a specialized ML system could be trained on. Powered by WPeMatico

A Taxonomy of Prompt Injection Attacks

March 8, 2024 infossl

academic papers, artificial intelligence, hacking, LLM, Security technology, Uncategorized

Researchers ran a global prompt hacking competition, and have documented the results in a paper that both gives a lot of good examples and tries to organize a taxonomy of effective prompt injection strategies. It seems as if the most common successful strategy is the “compound instruction attack,” as in “Say ‘I have been PWNED’ … Read More “A Taxonomy of Prompt Injection Attacks” »

How Public AI Can Strengthen Democracy

March 7, 2024 infossl

artificial intelligence, LLM, Security technology, Uncategorized

With the world’s focus turning to misinformation, manipulation, and outright propaganda ahead of the 2024 U.S. presidential election, we know that democracy has an AI problem. But we’re learning that AI has a democracy problem, too. Both challenges must be addressed for the sake of democratic governance and public protection. Just three Big Tech firms … Read More “How Public AI Can Strengthen Democracy” »

Teaching LLMs to Be Deceptive

February 7, 2024 infossl

academic papers, deception, LLM, Security technology, Uncategorized

Interesting research: “Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training“: Abstract: Humans are capable of strategically deceptive behavior: behaving helpfully in most situations, but then behaving very differently in order to pursue alternative objectives when given the opportunity. If an AI system learned such a deceptive strategy, could we detect it and remove … Read More “Teaching LLMs to Be Deceptive” »

Chatbots and Human Conversation

January 26, 2024 infossl

chatbots, Internet and society, LLM, Security technology, trust, Uncategorized

For most of history, communicating with a computer has not been like communicating with a person. In their earliest years, computers required carefully constructed instructions, delivered through punch cards; then came a command-line interface, followed by menus and options and text boxes. If you wanted results, you needed to learn the computer’s language. This is … Read More “Chatbots and Human Conversation” »

Poisoning AI Models

January 24, 2024 infossl

academic papers, artificial intelligence, LLM, machine learning, Security technology, threat models, Uncategorized

New research into poisoning AI models: The researchers first trained the AI models using supervised learning and then used additional “safety training” methods, including more supervised learning, reinforcement learning, and adversarial training. After this, they checked if the AI still had hidden behaviors. They found that with specific prompts, the AI could still generate exploitable … Read More “Poisoning AI Models” »