Enhancing AI Coding Assistants with Context Using RAG and SEM-RAG
Basic AI coding assistants, while helpful, often fall short of delivering the most relevant and contextually accurate code suggestions due to their reliance on a general understanding of software...
View ArticleRed Hat Podman ‘Lab’ Gets Developers Started on GenAI
Podman, Red Hat‘s desktop tool for managing container pods, has been given extended duty, that of providing developers a workspace to build generative AI-based applications. Unlike many tools for...
View ArticleA Guide to Model Composition
Consider an AI-powered image recognition app designed to identify and classify wildlife photos. You upload a picture taken during a hike, and within moments, the app not only identifies the animal in...
View ArticleBuilding Smarter Chatbots With Advanced Language Models
The development of chatbots is evolving rapidly, with new tools and frameworks making it easier and more efficient to build sophisticated systems. But current large language models (LLMs) suffer from...
View ArticleInference Is Table Stakes. That’s a Good Thing for Ampere
PARIS — Ampere, the maker of CPUs based on ARM architecture, is making its presence known, using inference as a big hook. AI training is a batch job workflow, but inference is crucial in AI-focused...
View ArticleKPMG’s CEO Poll: Labor Markets, 4-Day Workweeks, and GenAI
CEOs are responding to tight labor markets in three ways, according to a recent survey from consulting firm KPMG. Dropping the college degree requirement for some jobs; Upskilling employees; Using...
View ArticleRafay’s PaaS Now Supports GPU Workloads for AI/ML in the Cloud
Platform as a Service (PaaS) provider Rafay has extended its Kubernetes management platform to better support enterprise AI and ML workloads, with a focus on GPU resource management, democratizing...
View ArticleReviewing Code With GPT-4o, OpenAI’s New ‘Omni’ LLM
This week OpenAI gave us access to GPT-4o (“o” for “omni”), which aims to compete better in speech recognition and conversation. It’s is almost certainly a stronger LLM, too. But can it do a code...
View ArticlePyCon US: Simon Willison on Hacking LLMs for Fun and Profit
Simon Willison shared his thoughts about large language models at PyCon US. PITTSBURGH — Simon Willison, co-creator of the widely-used Python Django framework, has focused his creative energies on...
View ArticleHow Adaptive Applications Unlock Innovation in a New AI Age
We stand on the verge of a generative AI (GenAI) revolution. Some 98% of organizations have specific GenAI goals for 2024 — accounting for nearly a third of digital modernization spend last year and...
View ArticleBuild an Advanced RAG Application Using MyScaleDB and LlamaIndex
Eight large language models (LLMs) have brought immense value with their ability to understand and generate human-like text. However, these models also come with notable challenges. They are trained...
View ArticleGoogle Wants Developers to Build On-Device AI Applications
Today’s phones and PCs are equipped with new hardware to directly run AI on devices; and at Google I/O this year, Google encouraged coders to take advantage of it. The idea is to run large language...
View ArticleRAG: Still Relevant in the Era of Long Context Models
Google recently released Gemini 1.5 Pro, a large language model boasting a mammoth one million token context window. This sparked a buzz in the AI community, with some dubbing it the “RAG killer.”...
View ArticleTransparency and Community: An Open Source Vision for AI
Open source software is a part of everyone’s life. A task as seemingly mundane as sending an email owes its magic to the collaborative contributions of developers, including contributors to...
View ArticleMicrosoft Copilot for Azure: Managing Cloud Ops Through Chat
This week at the Build conference, Microsoft is announcing its usual range of new cloud services and enhancements to existing options, from a new Automatic service for Azure Kubernetes Service to...
View ArticleWhat OpenAI CEO Sam Altman Really Expects In AI’s Future
Last week OpenAI launched GPT-4o, the latest voice-interactive version of the powerful chatbot. But just days before its launch, OpenAI CEO Sam Altman had shared some surprisingly candid thoughts in...
View ArticleMS, GitHub Boost Copilots with Advanced Dev Features and Tools
Copilot automated assistants from Microsoft and GitHub are getting lots of fresh attention as both companies are updating their Copilot tools with new features aimed at giving developers more ways to...
View ArticleWhy Latency and ‘Total Cost of Ownership’ Matter More in AI Apps
Developer-turned-CEO Lin Qiao foresees a new emerging era in AI, where language model models are fine-tuned based on an organization’s own specialized data. This will allow organizations to take...
View ArticleLessons From Kubernetes and the Cloud Should Steer the AI Revolution
We’ve seen this story before… Over the past decade, cloud computing and Kubernetes emerged as revolutionary forces by promising scalability, efficiency and operational flexibility. These innovations...
View ArticleNutanix Gives an AI Push to End Kubernetes-Adoption Issues
BARCELONA, Spain — Nutanix executives say its AI-assisted processes and tools for developers, administrators and CIOs will represent simplified platforms and tools for user organizations. But now...
View Article