Craft & Insight

A space for thoughts and insights on technology, engineering, and creation.

Recent Articles

How Do GPUs Perform Machine Learning Computations?

Explore the principles of hardware acceleration from Python code to GPU transistors through JAX and CUDA.

2025-12-05

Pre-training Decoder-based Tiny LLM with JAX and TPU

We dissect the entire process from raw text data being read from disk, tokenized, and reborn as meaningful sentences through hardware called TPU. Let's implement the design of the latest Llama model directly with JAX and transform from a user of the model to a designer of the model.

LLMJAXTPU

2025-12-02

Deploying BERT on Vertex AI

A fully working, end-to-end guide on how to train, fine-tune, and deploy a custom BERT-based text classification model on Google Cloud Vertex AI.

MLOpsVertex AIOnline Prediction

2024-05-22

Browse by Tag

JAX GPU Machine Learning CUDA XLA Hardware