
Exponential-Min and Gumbel-Max
Exponential-min and Gumbel-max tricks for reformulating sampling from a discrete distribution as argmin and argmax, making the sampling operation differentiable.
Exponential-min and Gumbel-max tricks for reformulating sampling from a discrete distribution as argmin and argmax, making the sampling operation differentiable.
This note provides a high-level summary of the progress in large language models (LLMs) covering major milestones from Transformers to ChatGPT. The note serves as a fast-paced recap for readers to catch up on this field quickly.
A quick walk-through of Expectation-Maximization (EM) algorithm and its cousins.
This is the first post of hopefully a series of post walking through diffusion models. This post will introduce the foundations, focusing on two foundational papers, that many other papers built upon.
There has been a lot of confusing information about dopamine. I finally found a literature review-style article, and here is what I learned.
nanoGPT repo reading notes
This is a quick note to discuss a few topics below related to building LLM-powered products and applications, such as how to let LLM use tools and become autonomous agents, how to incorporate domain adaptation, and the production hurdles.
In this note, we'll take a look at how Auto-GPT work and discuss LLM's ability to do explicit reasoning and to become an autonomous agent. We'll touch upon a few related works such as WebGPT, Toolformer, and Langchain.
In October 2021, we spent two weeks traveling to various cities in Italy, including Rome, Cinque Terre, Florence, Tuscany, and Venice. This was our first trip to Italy, and we have documented our journey with a report and photos.
Trip report (itinerary and photos) from our recent trip to southern Utah (Zion, Arches, Canyonlands and Bryce).
This page is a high-level summary / notes of various recent results in language modeling with little explanations
A list of starter resources for Natural Language Processing (NLP), mostly with deep learning.
A short list of interview preparation resources for Data Scientists, Machine Learning Engineers, Machine Learning Scientists, Quant Developers and Quant Researchers.
A literature survey of recent papers on Neural Variational Inference (NVI) and its application in topic modeling.
A high-level summary of various generative models including Variational Autoencoders (VAE), Generative Adverserial Networks (GAN), and their notable extentions and generalizations, such as f-GAN, Adversarial Variational Bayes (AVB), Wasserstein GAN, Wasserstein Auto-Encoder (WAE), Cramer GAN and etc
How to add a table of contents in Ghost without editing the site template
EmailOctopus form is a script tag, this post shows how to make that work with React (using useEffect and useRef).
A short note about the evolution from React to the need of Redux and some supporting tool chain for it.
Building Next.js app with Firebase authentication on the client-side, as well as using it on the server-side with a middleware pattern similar to Express.js.
My plan and progress updates on learning web frontend development more or less from scratch. Will be semi-regularly updated.
My takeaways from State of JS 2020 survey.
Automatically add preview / teaser content for Member-Only posts in Ghost.
This is a demo for "How to Enable Preview for Member-Only Content in Ghost"
Local development setup for Ghost themes.
Making featured posts show up first in Ghost Casper theme (instead of the default reverse chronological order).
I recently switched to Ghost to host my blog, Here are some editing tips as I learn to use this platform.
My onboarding experience with Ghost and some wishlist items for future improvements.
End of posts • 49 posts