Fail | Page 2 of 2 | ✦ Smart Content Report

Hawaii newspaper ends AI news anchor experiment

February 5, 2025November 25, 2024

A local Hawaiian newspaper, The Garden Island, discontinued its AI-generated news broadcast program after a two-month trial period, reports Guthrie Scrimgeour. The program, which featured AI presenters James and Rose created by Israeli firm Caledo, was the first of its kind in the United States. The virtual anchors struggled with pronunciation, displayed unnatural behavior, and …

OpenAI Whisper prone to hallucinations, researchers say

February 5, 2025October 26, 2024

Researchers have discovered that Whisper, an AI-powered transcription tool used in various industries including healthcare, is prone to making up text or entire sentences, known as hallucinations. According to interviews with software engineers, developers, and academic researchers by AP, these hallucinations can include problematic content such as racial commentary, violent rhetoric, and imagined medical treatments. …

How easy it is to fool an AI

February 5, 2025October 11, 2024

Google’s NotebookLM can be fooled by manipulated websites. Developer Ted Benson demonstrated this by presenting his website to Google’s AI crawler with a made-up story about a trip to the moon with a bicycle, balloons, and scuba gear, while human visitors saw the regular page. He warns that this method of feeding LLMs with targeted …

Figma withdraws AI tool “Make Designs”

February 5, 2025July 12, 2024

Figma has temporarily withdrawn its new AI tool “Make Designs” after it created designs for a weather app that were strikingly similar to Apple’s version. This raises the question of whether the models from OpenAI and Amazon used by Figma were trained on Apple’s designs. Figma has taken responsibility for the incident and intends to …

Why ChatGPT & Co. sometimes fail spectacularly at certain tasks

December 7, 2024July 12, 2024

In a previous Smart Content Report, I featured a funny illustrated guide generated by ChatGPT’s Dall-E: I find such “failures” interesting to see, because they can reveal fundamental problems. For example, we are still a long way from an AI that actually understands the world around it (“General World Model”). At the moment, these tools …

Even advanced AI still struggles as an agent

February 5, 2025June 28, 2024

A new benchmark test from Sierra shows that even advanced language models such as GPT-4o still struggle with more complicated tasks in everyday scenarios, achieving a success rate of less than 50 percent. The test, called TAU-bench, is designed to help developers evaluate the performance of AI agents in realistic situations, taking into account factors …

Google’s “AI Overviews” stumble

February 5, 2025May 31, 2024

The recently introduced “AI Overviews” in Google Search have produced some strange results – some embarrassing, some ridiculous, some dangerous. This can be seen as an example for what various experts already know and preach: Don’t let your AI work unsupervised. For example, one of Google AI’s recommendations was that cheese sticks better to pizza …

Another thing AI doesn’t understand: Mirrors

December 3, 2024March 22, 2024

AI image generators often fail because they don’t understand (yet?) what they’re creating. Mirrors are a good example. Source: Reddit

Google embarrasses itself with Gemini’s political correctness

February 5, 2025March 8, 2024

We reported on Google’s AI offensive under the “Gemini” banner, but soon after, it was the integrated image generator that made the headlines: It had apparently been steered too much in favor of diversity. What is generally a good idea makes no sense if, for example, you want a picture of the “founding fathers” of …

Air Canada has to answer for incorrect information provided by its chatbot

February 5, 2025February 23, 2024

Air Canada’s chatbot gave a customer incorrect information about the terms of a refund. In court, the airline argued that the chatbot itself was responsible for what it said, not Air Canada. The court disagreed, and the company had to pay up. Source: The Guardian