Red Hat AI

Look inside Red Hat AI Inference on Amazon EKS to understand its core architectural components and Kubernetes resources.

Discover how to use EvalHub and OCI persistence to make your AI evaluation results immutable, content-addressable, and fully auditable.

Explore the mechanics of gradient synchronization in PyTorch distributed training, focusing on MPI primitives like All-Reduce and core techniques like pipeline parallelism, tensor parallelism, and sharded data parallelism.

Learn how speculative decoding can improve the performance of large language models (LLMs) in production by using a small, fast model to generate tokens speculatively and a large model to verify them.

Add automated AI evaluations to your CI/CD pipeline

William Caban Babilonia +2

June 11, 2026

Learn how to use the EvalHub CLI to automate AI evaluations in your CI/CD pipelines. Install the SDK, configure profiles, and set up a production gate.

Learn how llm-d routes each inference request to the GPU that already has the relevant data cached, cutting down on time-to-first-token, and doubling throughput without changing hardware. Discover how Red Hat's stack packages this neatly into a single Kubernetes resource.

Bring your own evaluation framework to EvalHub

William Caban Babilonia +2

June 9, 2026

Learn how to onboard a custom evaluation framework into EvalHub using one class, one method, and a container image. This guide covers the contract, data structures, and a complete minimal adapter.

Headed to WeAreDevelopers World Congress Europe 2026? Visit the Red Hat Developer booth on-site to speak to our expert technologists.

Understanding evaluation collections in EvalHub

William Caban Babilonia +2

June 4, 2026

Learn how to read an existing system collection, understand its threshold logic, and build your own collection that encodes your actual measurement strategy with thresholds that mean something.

Speculators v0.5.0 introduces DFlash support, enabling single-pass draft token generation with block diffusion for more efficient speculative decoding workflows. The release also adds unified online and offline training through vLLM’s native hidden states extraction system, improving training flexibility, version stability, and production readiness.

Red Hat and DeepLearning.AI have released a free hands-on course on the full LLM

Learn how to use Red Hat OpenShift AI's reusable components to build modular AI pipelines, speed up development, and focus on what differentiates your applications.

Evaluation-driven development with EvalHub

William Caban Babilonia +1

June 2, 2026

Learn how evaluation-driven development (EDD) turns AI optimization from an art into an engineering discipline with EvalHub.

Learn about LogAn, an open source tool designed to overcome the limitations of using LLMs to analyze massive volumes of production logs.

A Llama Stack-dependent backend, or any rapidly-evolving upstream project faces a version-drift problem. Explore our no-cost solution that provides early warnings.

Learn how an expert red-teamed an infrastructure using Red Hat AI, OpenClaw, and abliterated models on Red Hat OpenShift on IBM Cloud.

Learn how to transform a simple chatbot into an enterprise RAG application by applying metadata filtering, hybrid search, and neural reranking using the OGX framework in Red Hat OpenShift AI.

Learn how to prevent GPU waste and financial loss by implementing just-in-time (JIT) checkpointing with Kubeflow Training SDK on OpenShift AI.

Learn about the five primary structural challenges in enterprise AI evaluation and how EvalHub addresses them with a unified foundation for AI evaluation.

Learn how our team implemented CI/CD pipelines for the it-self-service-agent AI quickstart and the benefits of using CI/CD for agentic systems.

Learn how Red Hat AI can help address the security challenges of AI agents in production, from semantic malware to container escapes.

Scale agentic AI with Red Hat’s trusted software factory. Use Policy as Code and SBOMs to strengthen your development pipeline and manage software provenance.

Learn how Red Hat AI 3.4 uses EvalHub to orchestrate AI evaluations on Kubernetes. Scale frameworks like Garak and LightEval with built-in MLflow tracking.

Learn how Kagenti ADK, an open source toolkit, handles the complexities of managing production AI agents. It aligns with the Linux Foundation's Agent2Agent (A2A) protocol and provides a set of runtime services for easier deployment and operation.

Learn about our team's experience implementing a defense-in-depth safety architecture for AI agents using Llama Stack shields.

Red Hat AI

Red Hat AI Inference on Amazon EKS: Exploring the Kubernetes resources

Store immutable AI evaluation records with EvalHub and OCI

MPI-powered gradient synchronization in PyTorch distributed training

How speculative decoding delivers faster LLM inference

Add automated AI evaluations to your CI/CD pipeline

Intelligent inference scheduling with llm-d on Red Hat AI

Bring your own evaluation framework to EvalHub

Red Hat at WeAreDevelopers World Congress Europe 2026

Understanding evaluation collections in EvalHub

Speculators v0.5.0: DFlash support and online training

Learn to optimize, deploy, and benchmark LLMs with vLLM: A New Free Course

Build modular AI pipelines with OpenShift AI and reusable components

Evaluation-driven development with EvalHub

LogAn: Large-scale log analysis with small language models

How we built integration testing for fast-moving AI backend

Testing infrastructure red teaming with abliterated models

Build an enterprise RAG system with OGX

Preventing GPU waste: A guide to JIT checkpointing with Kubeflow Trainer on OpenShift AI

EvalHub: Because "looks good to me" isn't a benchmark

Deploy with confidence: Continuous integration and continuous delivery for agentic AI

Every layer counts: Defense in depth for AI agents with Red Hat AI

Trusted software factory: Building trust in the agentic AI era

How EvalHub manages two-layer Kubernetes control planes

How Kagenti ADK simplifies production AI agent management

Guardrails: Enterprise safety shields with Llama Stack

Platforms

Build

Quicklinks

Communicate

RED HAT DEVELOPER

Red Hat legal and privacy links

Red Hat legal and privacy links