• Weekly AI News
  • Posts
  • X-AI Grok-2 šŸ’”, Hermes 3: First Fine-Tuned Llama 3.1 405B Modelāš™ļø , Google Vs. OpenAI (Voice Bot) šŸ”„

X-AI Grok-2 šŸ’”, Hermes 3: First Fine-Tuned Llama 3.1 405B Modelāš™ļø , Google Vs. OpenAI (Voice Bot) šŸ”„

Grok-2 Beta Release, Nous Research / Hermes-3-Llama-3.1-8B, Google Voice Bot Vs. ChatGPT Voice Bot, & More...


šŸ”œ Upcoming AI Events

September 18-19, 2024 The AI Conference 2024 | San Francisco, USA Register Now


🌐 Top AI Highlights

Introducing Grok-2: Redefining the Frontier of AI Innovation

The release of Grok-2, a major upgrade from Grok-1.5, marks a significant milestone in AI development, introducing advanced capabilities in chat, coding, and reasoning. Grok-2, along with its smaller variant Grok-2 mini, is currently in beta on the š• platform and will soon be available through an enterprise API, expanding access to its powerful AI features. This new model has outperformed leading competitors like Claude 3.5 Sonnet and GPT-4-Turbo on the LMSYS leaderboard, highlighting its superior language understanding and reasoning skills.

Grok-2 has been rigorously tested by AI Tutors, showcasing improvements in following instructions, providing accurate information, and handling complex tasks. It excels in reasoning with retrieved content, identifying missing information, and filtering irrelevant data. The model has also achieved impressive results in academic benchmarks across diverse domains, including reasoning, math, science, and vision-based tasks, making it a versatile tool for various applications.

The upcoming release of Grok-2 and Grok-2 mini through an enterprise API will provide developers with robust and secure access to these advanced AI models. On the š• platform, Grok-2 enhances user experience with improved text and vision understanding, while Grok-2 mini offers a balance between speed and accuracy. xAI continues to push the boundaries of AI development, with future updates expected to further advance Grok's capabilities, particularly in multimodal understanding and core reasoning functions.

Unveiling Hermes 3: The First Fine-Tuned Llama 3.1 405B Model is on Lambda’s Cloud

Hermes 3 is an advanced AI model built on Meta's Llama 3.1, designed for tasks like reasoning, creativity, and problem-solving. Available in three sizes—8B, 70B, and 405B parameters—it is highly adaptable and optimized for efficient performance on Lambda's cloud infrastructure. The model excels in creative writing, role-playing, and professional decision-making, featuring advanced capabilities like structured output and visual communication.

Architecture: Hermes 3 comes in three versions: 8B, 70B, and 405B parameters, all based on Llama 3.1. The model's sensitivity to system prompts is particularly strong in the 405B version, making it highly adaptable to different personas and tasks.

Agentic Abilities: The model includes advanced features like XML tagging, internal monologues, and diagram generation to enhance multi-step problem-solving and reasoning.

Tool Use: Hermes 3 can invoke tools using the Hermes Function Calling standard, making it suitable for tasks requiring external data or computations.

Supervised Fine-Tuning (SFT): Hermes 3 models were fine-tuned using the AdamW optimizer with specific learning rates optimized for different model sizes.

šŸ˜ Enjoying so far, share it with your friends!

Google Voice Bot Vs. ChatGPT Voice Bot

Google’s Gemini Live and OpenAI’s Advanced Voice Mode are leading advancements in AI voice assistants, each offering unique features. Gemini Live, launched at the 2024 Made by Google event, focuses on fluid, multitasking conversations with real-time voice adaptation and deep integration with Google services. OpenAI’s Advanced Voice Mode emphasizes natural, immersive conversations with native speech processing and emotional understanding, making it ideal for reflective interactions.

Gemini Live: Seamless Multitasking

Gemini Live allows users to multitask effortlessly, continuing conversations even when the phone is locked. Integrated with Google’s suite of apps, it provides a cohesive experience for managing tasks and will soon support multimodal input, making it a versatile tool for daily life.

Advanced Voice Mode: Deep, Reflective Conversations

OpenAI’s Advanced Voice Mode enhances conversational depth with direct speech processing and emotional nuance, perfect for thoughtful dialogue. While it faces challenges in conversational etiquette and tool access, its potential for future improvements makes it a strong contender for personal AI interactions.


šŸš€ Tech Glimpse of the Week

Anthropic’s new Claude prompt caching will save developers a fortune
Anthropic introduced a prompt caching feature on its Claude API, allowing developers to avoid repeating prompts by remembering context between API calls. Available in public beta for Claude 3.5 Sonnet and Claude 3 Haiku, this feature significantly reduces costs and improves speed by storing frequently used contexts. Cached prompts are cheaper, with a 10x cost-saving potential.

Google’s Gemini upgrades put the pressure on OpenAI’s GPT-5
Google's recent upgrades to its Gemini AI have significantly heightened the competition with OpenAI's anticipated GPT-5. At the Pixel 9 event, Google showcased new Gemini features, including advanced reasoning, planning, and memory capabilities, and the ability to perform complex tasks like generating research reports. These innovations put pressure on OpenAI to release its next major ChatGPT update soon. The competition is intensifying, with both companies pushing the boundaries of AI to offer more powerful and versatile tools for users.

China’s AI video rush is a wake-up call for the world
The article discusses China's rapid advancements in AI-generated video content, particularly by companies like ByteDance and Kuaishou. These platforms are leveraging AI to create and distribute highly engaging short videos, posing a challenge to traditional content creators. The rise of AI in content creation is reshaping the media landscape, with China leading in innovation. This shift raises questions about the future of creative industries and the role of AI in shaping cultural content globally.


Elon Musk’s new image generation tool hit by wave of outrage over pictures it produces
Elon Musk's AI startup, xAI, has introduced a tool that allows users to generate AI images directly on Twitter (now X), sparking outrage due to concerns over the potential for misuse. Critics fear this could exacerbate the spread of misinformation and deepfakes, raising ethical and security issues as powerful AI tools become more integrated into social media platforms. This controversy underscores the broader anxieties about AI's impact on public discourse.

Google’s upgraded AI image generator is now available
Google has released Imagen 3, its latest AI text-to-image generator, in the US, offering improved detail and fewer artifacts. Available through Google's AI Test Kitchen and Vertex AI platform, Imagen 3 allows users to create and edit images based on prompts, though it has guardrails to prevent generating images of public figures and copyrighted characters. Despite these restrictions, users have found ways to generate images resembling popular characters and company logos, contrasting with the more permissive Grok AI image generator on Elon Musk’s X platform.

šŸ‘„ Connect & Feedback!

šŸ‘‰ Join Us:

šŸ“§ Advertise In Weekly AI News:

šŸ“§ Contact directly at [email protected]

šŸ˜ Share with your friends!

Reply

or to participate.