Top News
OpenAI announces SearchGPT, its AI-powered search engine
OpenAI has announced its entry into the search market with SearchGPT, an AI-powered search engine that organizes and makes sense of search results rather than just providing a list of links. The search engine, which is currently in prototype stage, is powered by the GPT-4 family of models and will initially be accessible to 10,000 test users. OpenAI is working with third-party partners and using direct content feeds to build its search results, with the aim of integrating the search features directly into ChatGPT. The company has collaborated with various news partners, including The Wall Street Journal, The Associated Press, and Vox Media, to develop SearchGPT, and publishers will have the option to manage how they appear in OpenAI search features.
OpenAI’s Search Tool Has Already Made a Mistake
Mistral AI Unveils Mistral Large 2, Beats Llama 3.1 on Code and Math
Mistral AI has launched Mistral Large 2, a new generation of its flagship model, boasting 123 billion parameters and a 128k context window. The model supports over 80 coding languages and multiple languages, including French, German, Spanish, and Chinese, making it a versatile tool for diverse linguistic needs. It outperforms leading models like GPT-4o and Llama 3, achieving 84.0% accuracy on the MMLU benchmark. The model is designed for research and non-commercial use, with a focus on reducing hallucinations and enhancing reasoning and problem-solving skills. It is available through partnerships with Google Cloud Platform, Azure AI Studio, Amazon Bedrock, and IBM watsonx.ai, and can be accessed via la Plateforme under the name mistral-large-2407.
Silicon Valley shaken as open-source AI models Llama 3.1 and Mistral Large 2 match industry leaders
Google Is the Only Search Engine That Works on Reddit Now Thanks to AI Deal
Google has become the exclusive search engine capable of surfacing results from Reddit, one of the internet’s most significant sources of user-generated content. This development means that alternative search engines like Bing, DuckDuckGo, Mojeek, and Qwant, which do not rely on Google’s indexing, will no longer display recent results from Reddit. The change follows Reddit’s decision to restrict access to its site to prevent companies from scraping it for AI training data, a privilege now only granted to Google due to a multi-million dollar agreement. This situation highlights Google’s near-monopoly on search, which is increasingly seen as a barrier to competition, especially as the tech giant faces criticism over the quality of its search results.
Other News
Tools
Adobe rolls out more generative AI features to Illustrator and Photoshop – Adobe introduces new generative AI features to Illustrator and Photoshop, including tools like Generative Shape Fill and Text to Pattern in Illustrator, and Generate Image and Enhance Detail in Photoshop, aiming to speed up creative workflows.
Apple takes on Meta with new open-source AI model — here’s why it matters – Apple has released a new open-source AI model with 7B parameters, aiming to contribute to the wider AI ecosystem and provide a truly open-source model for researchers and companies to use and adapt.
Google gives free Gemini users access to its faster, lighter 1.5 Flash AI model – Google is enhancing its Gemini AI with the faster and more efficient 1.5 Flash model, expanding its capabilities for users and making it more accessible across various platforms and languages.
Stability AI Open-Sources Stable Audio Open: An Audio Generation Model with Variable-Length (up to 47s) Stereo Audio at 44.1kHz from Text Prompts – Open-weight text-to-audio model with Creative Commons data offers high-quality audio synthesis, ethical data use, and openness, setting new standards for the industry.
Business
Self-driving car startups Pony.ai and WeRide ready to go public – Self-driving car startups Pony.ai and WeRide are preparing for initial public offerings in the US, amid declining interest in autonomous vehicle companies in the stock market.
AI startups raised $41.5 billion worldwide in five years – AI startups have raised $41.5 billion worldwide in the past five years, surpassing other industries and indicating a significant role for AI in the future development and modernization of various sectors.
Elon Musk will ‘discuss’ Tesla investing $5 billion in his private AI company – Elon Musk plans to discuss investing $5 billion in his private AI company xAI with Tesla, despite facing a lawsuit for breach of fiduciary duty and potential pushback from the board.
AMD says its new laptop chips can beat Apple — but still has to prove it – AMD claims its new laptop chips can outperform Apple’s MacBook and Qualcomm’s Snapdragon, but has yet to provide concrete evidence to support these claims.
Tech’s splurge on AI chips has companies in ‘arms race’ that’s forcing more spending – He’s not the only one expressing that sentiment. On Alphabet’s earnings call on Wednesday, CEO Sundar Pichai said his company may well be spending too much money on AI infrastructure, which largely consists of Nvidia’s graphics processing units (GPUs). But he sees little choice.
OpenAI could be on the brink of bankruptcy in under 12 months, with projections of $5 billion in losses – OpenAI faces potential bankruptcy with projected $5 billion losses due to high operational costs and insufficient revenue from its AI ventures, despite its significant role in the AI landscape.
Research
AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? – Web agents struggle to perform realistic and time-consuming tasks on the open web, as demonstrated by the limitations exposed in a new benchmark called AssistantBench, with current systems failing to reach high accuracy.
AI achieves silver-medal standard solving International Mathematical Olympiad problems – AI systems AlphaProof and AlphaGeometry 2 achieve silver-medal standard by solving four out of six problems from the International Mathematical Olympiad, demonstrating advanced mathematical reasoning capabilities.
MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens – Introducing MINT-1T, the largest open-source multimodal dataset to date, featuring one trillion text tokens and three billion images, and rivaling the performance of models trained on the previous leading dataset.
DeepMind makes big jump toward interpreting LLMs with sparse autoencoders – DeepMind introduces JumpReLU SAE, a new architecture that improves the performance and interpretability of sparse autoencoders for large language models, providing a more accurate and efficient way to understand and steer LLM behavior.
MovieDreamer: Hierarchical Generation for Coherent Long Visual Sequence – A novel hierarchical framework called MovieDreamer integrates autoregressive models with diffusion-based rendering to pioneer long-duration video generation with intricate plot progressions and high visual fidelity, achieving superior visual and narrative quality and extending the duration of generated content significantly beyond current capabilities.
Learning to Manipulate Anywhere: A Visual Generalizable Framework For Reinforcement Learning – A visual generalizable framework for reinforcement learning, called Maniwhere, enables robots to generalize across multiple visual disturbance types and demonstrate strong visual generalization and sim2real transfer abilities across different hardware platforms.
Shape of Motion: 4D Reconstruction from a Single Video – A new method for reconstructing 4D motion from single videos is introduced, utilizing a compact set of SE3 motion bases and data-driven priors to achieve state-of-the-art performance.
AI models collapse when trained on recursively generated data – AI models trained on recursively generated data, such as language models, are susceptible to model collapse, leading to degraded performance and the introduction of errors in the generated data.
OpenDevin: An Open Platform for AI Software Developers as Generalist Agents – OpenDevin is an open platform for developing powerful and flexible AI agents that interact with the world in ways similar to human developers, allowing for safe interaction with sandboxed environments and coordination between multiple agents.
Concerns
The Backlash Against AI Scraping Is Real and Measurable – AI companies are facing a growing backlash from website owners who are blocking their scraper bots, leading to concerns about the availability of data for AI training and potential limitations on the development of generative AI models.
Video game performers will go on strike over artificial intelligence concerns – Video game performers are going on strike due to concerns over artificial intelligence protections and the definition of who constitutes a “performer” in the industry.
Elon Musk’s X under pressure from regulators over data harvesting for Grok AI – Elon Musk’s X platform is under pressure from data regulators after it emerged that users are consenting to their posts being used to build artificial intelligence systems via a default setting on the app.
Anthropic’s crawler is ignoring websites’ anti-AI scraping policies – Anthropic’s web crawler, ClaudeBot, has been aggressively scraping websites like iFixit, violating their Terms of Use and causing strain on their resources, prompting the affected sites to take action.
Policy
US lawmakers send a letter to OpenAI requesting government access – US lawmakers send a letter to OpenAI requesting government access for pre-deployment testing, review, analysis, and assessment of its next foundation model, amid whistleblower reports of lax safety standards and retaliation against employees.
Meta needs updated rules for sexually explicit deepfakes, Oversight Board says – Meta’s Oversight Board is urging the company to update its rules around sexually explicit deepfakes, as it found the current language to be outdated and may make it more difficult for users to report AI-made explicit images.
FTC is investigating how companies are using AI to base pricing on consumer behavior – FTC investigates how companies use AI to implement surveillance pricing based on consumer behavior and personal data, seeking information from eight major companies.
Source: Read MoreÂ