Craft & Insight
A space for thoughts and insights on technology, engineering, and creation.
Recent Articles
View AllHow Do GPUs Perform Machine Learning Computations?
Explore the principles of hardware acceleration from Python code to GPU transistors through JAX and CUDA.
GPUMachine LearningCUDA
2025-12-05
Read More
Pre-training Decoder-based Tiny LLM with JAX and TPU
We dissect the entire process from raw text data being read from disk, tokenized, and reborn as meaningful sentences through hardware called TPU. Let's implement the design of the latest Llama model directly with JAX and transform from a user of the model to a designer of the model.
LLMJAXTPU
2025-12-02
Read More
Deploying BERT on Vertex AI
A fully working, end-to-end guide on how to train, fine-tune, and deploy a custom BERT-based text classification model on Google Cloud Vertex AI.
MLOpsVertex AIOnline Prediction
2024-05-22
Read More