- Weekly AI News
- Posts
- OpenAI o1🚀 ,Llama-Omni🦙, Mistral’s Pixtral 12B🖼️
OpenAI o1🚀 ,Llama-Omni🦙, Mistral’s Pixtral 12B🖼️
RPA Platforms, Multimodal AI Models, Real-Time Conversational Systems Shaping the Future of Technology, and More...
Welcome to another exciting edition of AI This Week! Today’s issue highlights a variety of groundbreaking advancements in the AI space, from OpenAI’s powerful reasoning model to the rise of multimodal systems like Mistral’s Pixtral. We also explore the AI tools that are shaking up the job market and meet humanoid robots entering industries like manufacturing. Whether it’s Oracle’s supercomputing breakthrough or Google’s efforts to reduce AI hallucinations, these stories reveal the rapid pace of AI’s evolution and its growing impact across industries.
🌐 Top AI Highlights
Introducing OpenAI o1-preview

OpenAI has introduced a new AI model series, called the o1 series, designed for complex problem-solving across areas like science, coding, and mathematics. These models, including o1-preview and o1-mini, are designed to spend more time "thinking" before responding, improving accuracy and reasoning capabilities. They excel in step-by-step problem-solving, making them suitable for advanced research, coding, and math computations.
The o1-preview model scored 83% on an International Mathematics Olympiad qualifying exam and reached the 89th percentile in Codeforces coding competitions, showcasing significant improvements over GPT-4o. Additionally, the o1 series includes enhanced safety features, using new training methods to reason about safety rules in real-time. The o1-preview model achieved an 84 out of 100 in tests designed to bypass safety rules, compared to GPT-4o’s 22.
These models are available to select users via ChatGPT and API, with limited message allowances. OpenAI plans to expand their functionality with features like web browsing and file uploads. The o1 series signifies a move towards AI models specialized for deeper cognitive tasks, combining precision, safety, and cost-effectiveness for professionals in research, software development, and more.
Mistral AI's Pixtral 12B

French AI startup Mistral has launched Pixtral 12B, a 12-billion-parameter multimodal model capable of processing both text and images. Built on their previous text model, Nemo 12B, Pixtral 12B adds visual capabilities, making it a strong competitor to OpenAI and Anthropic. It stands out with its ability to handle an unlimited number of high-resolution images and has advanced architecture for robust image analysis.
The model is freely available under an Apache 2.0 license on GitHub and Hugging Face, and will soon be testable through Mistral’s chatbot, Le Chat, and API platform, La Plateforme. Mistral’s open-access approach, along with partnerships with Microsoft, AWS, and Snowflake, positions the company as a growing AI force.
Following Pixtral 12B, Mistral continues to develop models like Codestral (for coding) and Mixtral (for math and science), democratizing AI for developers and organizations alike.
LLaMA-Omni: Open-Source AI Rivaling Siri and Alexa

Researchers at the Chinese Academy of Sciences have developed LLaMA-Omni, an AI model that enables real-time voice interactions with large language models (LLMs). Built on Meta's Llama 3.1 8B Instruct model, LLaMA-Omni processes spoken instructions and generates text and speech responses with low latency, as fast as 226 milliseconds, rivaling human conversation speed.
This voice-enabled AI system could transform industries like customer service, healthcare, and education by allowing for natural, responsive voice interactions. LLaMA-Omni, which can be trained in less than three days using just four GPUs, is accessible to startups and smaller companies, democratizing advanced voice AI development. This innovation could level the playing field in a space traditionally dominated by tech giants like Google and Amazon.
While the model currently supports only English and synthesized speech, it has far-reaching implications, especially as the code and model have been open-sourced for further development. LLaMA-Omni's efficient architecture could revolutionize voice-enabled AI across various sectors, offering opportunities for businesses and startups to gain a competitive edge in the rapidly evolving conversational AI market.
😍 Enjoying so far, share it with your friends!
🎓 AI Courses
What you will learn
Web Development Basics
Developing Sites for the Web
Introduction to HTML and CSS
Bringing Websites to Life with JavaScript
Website Testing and Deployment
Develop an Interactive Task List Web Page
Your Future in Web Development: The Job Landscape
⭐⭐⭐
What you will learn
Identify the purpose of web browsers
Describe markup languages and the challenges they can overcome
Explain the structure, functions, and evolution of HTML
Identify improvements that HTML5 introduced
Explain the features and functions of CSS
Explain the features and functions of JavaScript
Identify the ways JavaScript interacts with CSS and HTML
Identify phases in the software development lifecycle (SDLC)
Differentiate between waterfall and agile approaches to development
Highlight the scrum framework
⭐⭐⭐
🚀 RPA DEVELOPERS
LIST OF TOOLS
🤖 UiPath: Comprehensive RPA platform with AI capabilities.
☁️ Automation Anywhere: Cloud-native RPA platform with cognitive automation.
🏛️ Blue Prism: RPA platform with strong governance and AI integration.
🔄 Microsoft Power Automate: Automation platform with AI Builder.
🌐 IBM Robotic Process Automation: AI-driven RPA solutions.
🧩 Kofax RPA: Low-code platform with AI and ML.
📞 NICE Robotic Automation: AI-powered RPA for customer service.
🧠 WorkFusion: AI-driven platform for intelligent automation.
🚀 Tech Glimpse of the Week
OpenAI Starts On-Campus Recruiting
OpenAI has launched its on-campus recruiting program, starting at UC Berkeley, and plans to expand to other top universities. This initiative aims to attract top talent through direct recruitment and the Emerging Talent Tech Community, offering hands-on experience through its Residency Program.
Why it Matters: OpenAI’s push to recruit top university talent underscores its commitment to shaping the next generation of AI innovators.
Practical Insight: Students and recent graduates have new opportunities to enter the AI field, gaining competitive experience with one of the leading AI organizations.
Oracle Unveils Zettascale AI Supercomputer
Oracle has launched the world’s first zettascale AI supercomputer, powered by 131,000 NVIDIA Blackwell GPUs, delivering 2.4 zettaflops for AI workload training. This new development positions Oracle as a leader in high-performance AI computing.
Why it Matters: Oracle’s supercomputer significantly boosts AI training capabilities, enabling faster advancements in research and development.
Practical Insight: Companies requiring heavy computational power for AI can leverage Oracle’s supercomputer for faster model training and deployment.
AI Coding Assistants Replace Junior Developers
AI coding assistants, like GitHub Copilot, are automating tasks traditionally done by junior developers. As these tools become more capable, they reduce the need for entry-level coders, reshaping the software development landscape.
Why it Matters: AI coding assistants are streamlining routine tasks, which could lead to a reduced demand for junior developers.
Practical Insight: Organizations can increase development speed and reduce costs by integrating AI coding assistants, although they may need to rethink junior-level hiring strategies.
Face to Face with Figure’s New Humanoid Robot
Figure’s humanoid robot is designed for labor-intensive industries, offering human-like mobility and dexterity. The robot is set to fill workforce gaps in manufacturing, logistics, and service industries, using advanced AI to interact with its environment.
Why it Matters: Figure’s robot brings human-like capabilities to industries facing labor shortages, potentially transforming how tasks are performed in these sectors.
Practical Insight: Businesses in manufacturing and logistics can explore adopting humanoid robots to boost efficiency and address workforce challenges.
DataGemma: Using Real-World Data to Combat AI Hallucinations
Google’s DataGemma AI model aims to reduce hallucinations in LLMs by grounding them in real-world data from Data Commons. By using RIG and RAG techniques, DataGemma ensures more accurate, factual outputs in decision-making and research tasks.
Why it Matters: DataGemma helps make AI models more reliable, addressing one of the biggest challenges in LLMs—hallucinations.
Practical Insight: Organizations can use DataGemma to ensure more trustworthy AI outputs, especially in critical decision-making processes like research and data analysis.
From revolutionizing how businesses operate to altering the way we engage with AI systems, the developments in this newsletter demonstrate the limitless potential of the AI technology. But as we embrace these changes, the key question is: How can we make AI work for us, ethically and efficiently? Keep following as we uncover the ongoing transformations shaping the future!
👥 Connect & Feedback!
👉 Join Us:
📧 Advertise In Weekly AI News:
📧 Contact directly at [email protected]
😍 Share with your friends!

Reply