All Stories

From Theory to Practice: Quantization and Dequantization Made Simple

Quantization transforms floating-point values (‘float32’) into lower-precision formats, such as 8-bit integers (‘int8’), while attempting to preserve the numerical range and accuracy of the original data. This reduces memory usage...

Breaking Down Vision Transformers: A Code-Driven Explanation

In this article, I’ll break down the layers of a ViT step by step with code snippets, and a dry run of each layer.

Turn 3D Gaussian Splat Files into Stunning Assets in Unity 6

This guide walks you through the process of loading splat files in Unity 6 using the Gaussian Splatting Playground Unity plugin. By the end of this tutorial, you’ll be able...

Intel GPU Scheduling: Exploring Matrix Addition with SYCL and PyTorch

If you’ve ever worked with GPUs, you know how crucial it is to understand how they manage workloads. Today, we’re diving into Intel’s approach to kernel scheduling, particularly when using...

HLSL Ray Tracing: Crafting Realistic Scenes in Unity, One Ray at a Time

Instead of just slapping textures on polygons, ray tracing lets us simulate how light interacts with surfaces, giving us those jaw-dropping reflections and realistic lighting effects that make you go,...

Harnessing Local Llama to Process Complete Projects: How I use AI for code suggestions and refactoring my Projects

We’ll walk through a Python script that leverages the LangChain framework to process a codebase, embed the data, and perform queries using LLMs.

The Magic of DPAS on Intel's XMX Engines: Cracking Why GPUs are Fast

When you think of multiplying matrices, you probably imagine a lot of numbers flying around and crunching in the background. But what if I told you there’s a super-efficient way...

Understanding Cholesky Decomposition with PyTorch

When dealing with symmetric and positive-definite matrices, Cholesky decomposition emerges as an indispensable tool in numerical computing. This matrix factorization technique not only simplifies complex computations but also finds applications...

Code, Run, Debug on AutoPilot: Let Your Local Llama Do All Your Heavy Lifting!

AutoGen isn’t just another framework; it marks a revolutionary leap in leveraging Large Language Models (LLMs). Built to empower developers, AutoGen excels in orchestrating multi-agent conversations, where these agents are...