Welcome! Type "help" for available commands.

Running 122B-Parameter LLMs Locally on AMD Strix Halo for...

~/bookmarks

/**/

Running 122B-Parameter LLMs Locally on AMD Strix Halo for OpenClaw: A Deep Dive | LinkedIn

linkedin.comSaved March 21, 2026March 10, 2026

Hardware Performance Analysis

Summary

Andrea Pellegrini details building a local LLM inference stack on the AMD Ryzen AI Max+ 395 with 128GB unified memory. The article covers performance benchmarks using Vulkan and ROCm backends for models up to 142B parameters.

Highlights

Utilization of AMD Ryzen AI Max+ 395 with 128GB unified memory for local LLM inference.
Performance benchmarks showing up to 884 tokens/s for Llama 2 7B Q4_0 using Vulkan.
Achievement of 270 tokens/s for 120B parameter models at pp512 context length.
Comparison of different inference backends including HIP, Vulkan, and ROCm.
Demonstration of running large models up to 142B weights through hardware optimization.

auto-generated

Preview of Running 122B-Parameter LLMs Locally on AMD Strix Halo for OpenClaw: A Deep Dive | LinkedIn

Andrea Pellegrini · via LinkedIn

Context

Audience

AI Engineers and Hardware Enthusiasts

DomainMachine Learning Hardware

Formatlong-form article

Accessfree online

Topics

Local LLMs AMD Strix Halo LLM Inference Ryzen AI Hardware Optimization

Visit Site All Bookmarks

AMD Strix HaloROCmVulkanHIPLLM QuantizationLocal LLM Inference

~/bookmarks

Running 122B-Parameter LLMs Locally on AMD Strix Halo for OpenClaw: A Deep Dive | LinkedIn

Summary

Highlights

Context

Topics

Related

Running 122B-Parameter LLMs Locally on AMD Strix Halo for OpenClaw: A Deep Dive | LinkedIn

Summary

Highlights

Context

Topics

Related

~/bookmarks

Running 122B-Parameter LLMs Locally on AMD Strix Halo for OpenClaw: A Deep Dive | LinkedIn

Summary

Highlights

Context

Topics

Related

Discover Similar Content

GLM 4.5-Air-106B and Qwen3-235B on AMD “Strix Halo” AI Ryzen MAX+ 395 (HP Z2 G1a Mini Workstation)

AI-Capabilities-Overview

GitHub - kyuz0/amd-strix-halo-toolboxes

Discover Similar Content

Running 122B-Parameter LLMs Locally on AMD Strix Halo for OpenClaw: A Deep Dive | LinkedIn

Summary

Highlights

Context

Topics

Related

Discover Similar Content

GLM 4.5-Air-106B and Qwen3-235B on AMD “Strix Halo” AI Ryzen MAX+ 395 (HP Z2 G1a Mini Workstation)

AI-Capabilities-Overview

GitHub - kyuz0/amd-strix-halo-toolboxes