AI research

Google researchers make voice a solid smartphone interface

Summary Until now, AI has had a hard time controlling smartphone interfaces. But Google researchers seem to have found a way. To improve voice-based interaction with mobile user interfaces, researchers at Google Research have been investigating the use of large language models (LLM). Current mobile intelligent assistants are limited in conversational interactions because they cannot …

Google researchers make voice a solid smartphone interface Read More »

HumanRF enables photorealistic 3D avatars

Summary HumanRF brings high-resolution 3D avatars to NeRFs. Behind it is an AI startup for synthetic media. Neural Radiance Fields (NeRFs) learn 3D representations from photos or videos and can render individual objects or entire scenes. Some variants specialize in moving scenes or objects, others experiment with editing capabilities, and others attempt to render people …

HumanRF enables photorealistic 3D avatars Read More »

Starcoder is a performant open-source model for copyright-compliant code

Summary BigCode, a joint initiative of Hugging Face and ServiceNow, introduces Starcoder and StarcoderBase, two large open-source code language models. The researchers place special emphasis on transparent and copyright-compliant data selection. The 15.5 billion parameter Starcoder models can generate code in 86 programming languages. In a novel approach, the researchers used a method called “multi-query …

Starcoder is a performant open-source model for copyright-compliant code Read More »

Between dietary advice and surveillance dystopia

Summary DetGPT gives a preview of the AI ​​applications that will be possible with multimodal models in the future – and not just the good ones. At the GPT-4 launch, OpenAI demonstrated some multimodal capabilities, including converting a photographed and scribbled web design into code or the ability to answer questions about images, which is …

Between dietary advice and surveillance dystopia Read More »

OpenAI tests whether GPT-4 can explain how AI works

Summary Can OpenAI’s GPT-4 help make AI safer? The company’s large language model tried to explain GPT-2 neurons. In a recent paper, OpenAI shows how AI can help interpret the internal workings of large language models. The team used GPT-4 to generate and evaluate explanations for neurons from its older predecessor, GPT-2. The work is …

OpenAI tests whether GPT-4 can explain how AI works Read More »

Meta’s new open-source model combines six data types

Summary Meta’s ImageBind is a new multimodal model that combines six data types. Meta is releasing it as open source. ImageBind makes the metaverse seem a little less like a distant vision of the future: In addition to text, the AI ​​model understands audio, visual, motion sensor, thermal, and depth data. At least in theory, …

Meta’s new open-source model combines six data types Read More »

Shap-E is OpenAI’s fastest text-to-3D model to date

Summary OpenAI dominates the media with ChatGPT, but the company is also researching other generative AI models. A new paper shows a text-to-3D model. In late 2022, OpenAI unveiled Point-E, a generative AI model for text-to-3D that received little attention given the enormous success of ChatGPT that same month. In part, this was because Point-E …

Shap-E is OpenAI’s fastest text-to-3D model to date Read More »

AI’s most puzzling aspect has just been challenged

Summary Emergent abilities in large language models have generated both excitement and concern. Now, Stanford researchers suggest that these abilities may be more of a metric-induced mirage than a real phenomenon. The sudden emergence of new abilities while scaling large language models is a fascinating topic and a reason for and against further scaling of …

AI’s most puzzling aspect has just been challenged Read More »

Scroll to Top