Machine Learning
-
Attention Mechanisms
May 11, 2023
-
5 min read
-
Autodiff (Forward/Reverse)
May 11, 2023
-
5 min read
-
BatchNorm & LayerNorm
May 11, 2023
-
5 min read
-
Bias, Variance, & Overfitting
May 11, 2023
-
5 min read
-
Building Sharded Transformers
May 11, 2023
-
6 min read
-
Direct Preference Optimization
May 11, 2023
-
5 min read
-
Efficient Transformers
May 11, 2023
-
6 min read
-
Gradient Descent
May 11, 2023
-
5 min read
-
KV Caching & LLM Inference
May 11, 2023
-
6 min read
-
Linear Regression
May 11, 2023
-
4 min read