Blog

49 posts

Sort:

Exponential-Min and Gumbel-Max

Exponential-min and Gumbel-max tricks for reformulating sampling from a discrete distribution as argmin and argmax, making the sampling operation differentiable.

January 1, 2019•

From Transformers to ChatGPT

This note provides a high-level summary of the progress in large language models (LLMs) covering major milestones from Transformers to ChatGPT. The note serves as a fast-paced recap for readers to catch up on this field quickly.

December 29, 2022•

List of Advice

March 31, 2024•

Expectation-Maximization Algorithm in 10 minutes

A quick walk-through of Expectation-Maximization (EM) algorithm and its cousins.

December 15, 2017•

Diffusion Models

This is the first post of hopefully a series of post walking through diffusion models. This post will introduce the foundations, focusing on two foundational papers, that many other papers built upon.

June 6, 2023•

Life is short, a reading list

December 9, 2024•

Quoting Ezra

April 13, 2025•

a poem about human suffering

April 5, 2025•

Quoting Kazuo Ishiguro

April 5, 2025•

path to mastery

March 1, 2025•

Inner peace requires external validation

February 17, 2025•

Hume’s law

February 5, 2025•

The real luxuries in life

January 20, 2025•

2024 EOY Reflections

January 1, 2025•

Tweets from @AmuseChimp

December 22, 2024•

Dopamine

There has been a lot of confusing information about dopamine. I finally found a literature review-style article, and here is what I learned.

December 15, 2024•

Notes on Endurance

February 17, 2024•

London

January 10, 2024•

Quoting the BlackBerry Movie

January 10, 2024•

Quoting the history of Rome podcast

October 16, 2023•

Quoting Succession TV show

October 15, 2023•

Quoting Joshua Bach

October 4, 2023•

nanoGPT repo reading notes

May 18, 2023•

Building LLM-powered products

This is a quick note to discuss a few topics below related to building LLM-powered products and applications, such as how to let LLM use tools and become autonomous agents, how to incorporate domain adaptation, and the production hurdles.

April 23, 2023•

How does Auto-GPT work?

In this note, we'll take a look at how Auto-GPT work and discuss LLM's ability to do explicit reasoning and to become an autonomous agent. We'll touch upon a few related works such as WebGPT, Toolformer, and Langchain.

April 9, 2023•

Italy

In October 2021, we spent two weeks traveling to various cities in Italy, including Rome, Cinque Terre, Florence, Tuscany, and Venice. This was our first trip to Italy, and we have documented our journey with a report and photos.

November 10, 2021•

Utah

Trip report (itinerary and photos) from our recent trip to southern Utah (Zion, Arches, Canyonlands and Bryce).

August 15, 2021•

Recent Progress in Language Modeling

This page is a high-level summary / notes of various recent results in language modeling with little explanations

October 9, 2018•

NLP Starter Resources

A list of starter resources for Natural Language Processing (NLP), mostly with deep learning.

June 30, 2018•

Quantitative Tech Interview Preparation Guide

A short list of interview preparation resources for Data Scientists, Machine Learning Engineers, Machine Learning Scientists, Quant Developers and Quant Researchers.

May 5, 2018•

Recent Progress in Neural Variational Inference

A literature survey of recent papers on Neural Variational Inference (NVI) and its application in topic modeling.

March 8, 2018•

A Brief Survey of Generative Models

A high-level summary of various generative models including Variational Autoencoders (VAE), Generative Adverserial Networks (GAN), and their notable extentions and generalizations, such as f-GAN, Adversarial Variational Bayes (AVB), Wasserstein GAN, Wasserstein Auto-Encoder (WAE), Cramer GAN and etc

December 20, 2017•

How to measure HRV

April 27, 2025•

TIL: React Server Components

March 23, 2025•

Cello etudes difficulty table

February 16, 2025•

David Popper

February 15, 2025•

Hiragana and Katakana for Chinese speakers

December 6, 2024•

How to add a table of contents in Ghost without editing the site template

March 31, 2023•

How to Add EmailOctopus Form to a React App

EmailOctopus form is a script tag, this post shows how to make that work with React (using useEffect and useRef).

May 28, 2022•

React, Redux and Redux-Saga

A short note about the evolution from React to the need of Redux and some supporting tool chain for it.

December 3, 2021•

Next.js: Firebase Authentication and Middleware for API Routes

Building Next.js app with Firebase authentication on the client-side, as well as using it on the server-side with a middleware pattern similar to Express.js.

February 28, 2021•