AI Innovations on the Horizon: Mistral's AI Coder, OpenAI's Moves, Perplexity Pages

Exploring the Latest in AI Tools, Deals, and Developments

đź‘‹ Hey and welcome to AI News Daily. 

Each week, we post AI Tools, Tutorials, News, and practical knowledge aimed at improving your life with AI. 

Read time: 6 minutes

The tech race is accelerating, and staying ahead is crucial. Today, we'll dive into the latest AI advancements, major deals, and groundbreaking releases. Let's get started.👇

Remember Devin, the AI software engineer introduced by Cognition a couple of months ago? Meet his new competitor, Codestral, developed by Mistral, a startup valued at $6 billion. 

source: Codestral

Codestral is designed to assist developers in writing and interacting with code.

With 22 billion parameters and training in over 80 programming languages, including Python, Java, C++, and JavaScript, Codestral performs coding functions, writes tests, completes incomplete code, and answers questions about the codebase. Mistral claims that interacting with AI will help developers improve their skills and reduce the risk of bugs.

Thanks for reading AI News Daily! Subscribe for free to receive new posts and support my work.

Codestral’s Superior Code Generation Performance

Mistral has benchmarked Codestral against other AI models for code generation, and it reportedly outperforms its competitors. The company used four benchmarks: HumanEval pass@1, MBPP sanitized pass@1 for Python code generation, CruxEval for Python output prediction, and RepoBench EM for long-range repository-level code completion.

Codestral is a 22 billion parameter open-weight model licensed under the Mistral AI Non-Production License, allowing use for research and testing. You can try Codestral on HuggingFace. While it's promising that Codestral excels in tests, real-world performance compared to human engineers remains to be seen. Independent comparisons are expected soon.

News Of The Week 🌍

Rapid Updates from OpenAI

OpenAI continues to roll out updates, so let's dive into a quick overview:

  • Custom GPTs, data analytics, vision, and memory features are now available to free ChatGPT users, enhancing one of the best chatbots.

  • OpenAI forms a safety and security committee to oversee critical decisions for all projects.

  • Introducing "OpenAI for Nonprofits," making AI tools more affordable for nonprofit organizations, including discounts on ChatGPT Team and Enterprise.

  • ChatGPT Edu Announcement: A new offering for universities and colleges to implement AI responsibly on campuses.

  • OpenAI is rebooting its robotics team, reflecting the growing investment in AI-powered robotics.

Scale AI has introduced its first leaderboards, ranking AI model performance in specific domains. This new ranking system evaluates advanced LLMs using private, curated datasets to assess their capabilities in common use cases, including coding, instruction following, math, and multilingualism.

Source: Scale AI

Understanding the Leaderboard’s Impact

The Scale AI leaderboard is designed to provide a more nuanced understanding of how different AI models perform in real-world scenarios. By using private, curated datasets, the leaderboard aims to offer a more accurate picture of an AI model's capabilities beyond standard benchmarks. This is particularly useful for developers and businesses looking to implement the most effective AI solutions for their specific needs. The inclusion of diverse domains such as coding, instruction following, and multilingualism highlights the versatility and applicability of these AI models across various sectors.

PricewaterhouseCoopers (PwC), a Big Four accounting firm, will deploy ChatGPT Enterprise to its 75,000 U.S. employees and 26,000 U.K. employees, totaling over 100,000 licenses. The financial terms of the deal remain undisclosed. With the constant focus on generative AI in business, this partnership underscores the growing reliance on AI in professional services.

The Implications for Professional Services

PwC's adoption of ChatGPT Enterprise is a significant step forward for AI integration in professional services. This move not only enhances PwC's capabilities but also sets a precedent for other firms in the industry. By leveraging ChatGPT Enterprise, PwC aims to improve efficiency, streamline workflows, and deliver more insightful analysis to clients. The partnership also reflects a broader trend of AI adoption across various sectors, highlighting the transformative potential of AI in enhancing productivity and innovation.

Perplexity AI has introduced Perplexity Pages, a feature allowing users to create reports, articles, or guides in a visually appealing format. Tools in the library section enable users to tailor content to their audience. Once generated, these pages can be shared via links or found through Google search. This new feature is valuable for both professional and personal use, enhancing the usability of one of my favorite chatbots.

Source: Perplexity AI

Enhancing User Engagement with Perplexity Pages

The addition of Perplexity Pages significantly boosts the platform's functionality. This feature allows users to transform their searches into polished, shareable documents, making it easier to communicate findings and insights. Whether for work or personal projects, Perplexity Pages offers a streamlined way to compile and present information. The ability to share these pages via links or have them indexed by Google further extends their reach and utility, making Perplexity AI an even more valuable tool for researchers, writers, and professionals.

The Pentagon has awarded Palantir a $480 million deal for its Maven Smart System prototype. The U.S. Army aims to use AI tools like Maven for its Combined Joint All-Domain Command and Control (CJADC2) construct, enhancing connectivity and operational efficiency across military platforms. The integration of AI into military operations signifies a major technological advancement.

AI's Role in Military Modernization

Palantir's $480 million contract with the U.S. Army underscores the strategic importance of AI in modern military operations. The Maven Smart System is designed to enhance the U.S. Army's CJADC2 construct, aiming to improve decision-making, operational efficiency, and collaboration across military platforms. This integration of AI into military infrastructure highlights the transformative potential of AI in national defense, providing advanced capabilities for surveillance, data analysis, and strategic planning.

OpenAI has signed new deals with Vox Media and The Atlantic, following previous partnerships with Reddit, Financial Times, and News Corp. This collaboration will allow OpenAI to use licensed content from these media companies to train GPT and share it with users. As AI continues to integrate with major media holdings, it has the potential to become a comprehensive and powerful information resource.

Expanding AI's Content Horizons

OpenAI's partnerships with prominent media companies signify a significant expansion of AI's content capabilities. By gaining access to licensed content from Vox Media, The Atlantic, and others, OpenAI can further enhance the training of its models, resulting in more accurate and comprehensive AI-generated content. This move positions ChatGPT as a robust information resource, capable of delivering high-quality content across various topics. For users, this means more reliable and diverse information at their fingertips, elevating the utility of AI in research, journalism, and everyday inquiries.

Snapshots 🏆

PS: I curate this AI newsletter every week for FREE, your support is what keeps me going. If you find value in your reading, share it with your friends by clicking the share button below!