Home |

Featured

The Magic of DPAS on Intel's XMX Engines: Cracking Why GPUs are Fast

When you think of multiplying matrices, you probably imagine a lot of numbers flying around and crunching in the background. But what if I told...

Gurwinder
29 Sep 2024

The Simple Path to PyTorch Graphs: Dynamo and AOT Autograd Explained

Graph acquisition in PyTorch refers to the process of creating and managing the computational graph that represents a neural network’s operations. This graph is central...

Gurwinder
06 Apr 2024

All Stories

From Theory to Practice: Quantization and Dequantization Made Simple

Quantization transforms floating-point values (‘float32’) into lower-precision formats, such as 8-bit integers (‘int8’), while attempting to preserve the numerical range and accuracy of the original data. This reduces memory usage...

Gurwinder
05 Jan 2025

Breaking Down Vision Transformers: A Code-Driven Explanation

In this article, I’ll break down the layers of a ViT step by step with code snippets, and a dry run of each layer.

Gurwinder
25 Nov 2024

Turn 3D Gaussian Splat Files into Stunning Assets in Unity 6

This guide walks you through the process of loading splat files in Unity 6 using the Gaussian Splatting Playground Unity plugin. By the end of this tutorial, you’ll be able...

Gurwinder
11 Nov 2024

Intel GPU Scheduling: Exploring Matrix Addition with SYCL and PyTorch

If you’ve ever worked with GPUs, you know how crucial it is to understand how they manage workloads. Today, we’re diving into Intel’s approach to kernel scheduling, particularly when using...

Gurwinder
20 Oct 2024

HLSL Ray Tracing: Crafting Realistic Scenes in Unity, One Ray at a Time

Instead of just slapping textures on polygons, ray tracing lets us simulate how light interacts with surfaces, giving us those jaw-dropping reflections and realistic lighting effects that make you go,...

Gurwinder
11 Oct 2024

Harnessing Local Llama to Process Complete Projects: How I use AI for code suggestions and refactoring my Projects

We’ll walk through a Python script that leverages the LangChain framework to process a codebase, embed the data, and perform queries using LLMs.

Gurwinder
10 Oct 2024

The Magic of DPAS on Intel's XMX Engines: Cracking Why GPUs are Fast

When you think of multiplying matrices, you probably imagine a lot of numbers flying around and crunching in the background. But what if I told you there’s a super-efficient way...

Gurwinder
29 Sep 2024

Understanding Cholesky Decomposition with PyTorch

When dealing with symmetric and positive-definite matrices, Cholesky decomposition emerges as an indispensable tool in numerical computing. This matrix factorization technique not only simplifies complex computations but also finds applications...

Gurwinder
10 Aug 2024

Code, Run, Debug on AutoPilot: Let Your Local Llama Do All Your Heavy Lifting!

AutoGen isn’t just another framework; it marks a revolutionary leap in leveraging Large Language Models (LLMs). Built to empower developers, AutoGen excels in orchestrating multi-agent conversations, where these agents are...

Gurwinder
31 Jul 2024