'Dangerous' AI generates words that don't exist

The algorithm was developed by an ex-Instagram engineer who helped build the app's recommendation algorithm

Thursday 14 May 2020 16:26 BST

Language has continually always evolved naturally over time and these are some of the latest mutations (Rex)

Your support helps us to tell the story

From reproductive rights to climate change to Big Tech, The Independent is on the ground when the story is developing. Whether it's investigating the financials of Elon Musk's pro-Trump PAC or producing our latest documentary, 'The A Word', which shines a light on the American women fighting for reproductive rights, we know how important it is to parse out the facts from the messaging.

At such a critical moment in US history, we need reporters on the ground. Your donation allows us to keep sending journalists to speak to both sides of the story.

The Independent is trusted by Americans across the entire political spectrum. And unlike many other quality news outlets, we choose not to lock Americans out of our reporting and analysis with paywalls. We believe quality journalism should be available to everyone, paid for by those who can afford it.

Your support makes all the difference.

A new AI has been created to generate words that do not exist.

The one-shot website develops new, artificially-generated definitions for the non-existent words.

ThisWordDoesNotExist.com generates new words such as “wacamole” (a single serving of waffle batter made with a sweet cornmeal mixture), “pileset” (form a mass of, or make a shape about, something), or “prayman” (the principal or leading men in a society or enterprise).

Users click a button on the site, and a new word is made.

The website was developed by San Francisco-based developer Thomas Dimson, an engineer who used to work for the Facebook-owned Instagram developing its recommendations algorithm.

The actual artificial intelligence that creates new words is based on the natural language processing algorithm Transformers and the language framework GPT-2 - an algorithm that can be fed a piece of text and use the information to predict the words that can come next and create writing that can be near-indistinguishable from that written by a human.

GPT-2 gained notoriety for being “too dangerous to release” but the researchers have since made it available for use.

The site works by looking through a database of eight million webpages, taken from the most upvoted content on the social media site Reddit. Algorithms are able to detect when one word appears next to another word, and using that information (and replicating it enough times) means it can generate new words and sentences.

Like every other artificial intelligence, the system is not perfect. A small disclaimer at the bottom of the site says that “words are not reviewed and may reflect bias in the training set”.

Artificial intelligence systems have often been been criticised when it is unregulated or used for warfare but also has many benefits.

Data-driven platforms have been promoted as ways of helping disrupt illegal wildlife trades or translating thoughts into text directly from your brain.

Join our commenting forum

Join thought-provoking conversations, follow other Independent readers and see their replies

0Comments

Thank you for registering

'Dangerous' AI generates words that don't exist

The algorithm was developed by an ex-Instagram engineer who helped build the app's recommendation algorithm

Join our commenting forum

Thank you for registering