How HuggingFace tokenizer gets executed through maze of code?I was very much intrigued about hugging face’s core concept of loading models and running it. It does everything under the hood and you…Sep 12Sep 12
Information Retrieval : Chapter 2In this we will talk about vocabulary and postings list. In previous post of chapter 1, I wrote notes about very fundamental process of…Aug 29Aug 29
Information Retrieval : Chapter 1I didn’t think of this, but this started as a series of short notes on information retrieval by Prof. Christopher D. Manning, Prabhakar…Aug 26Aug 26
Mixture of Models vs Dense Transformers?Recently meta has released their “Herd of LLama Models” and highlight of which is 405B parameter model contributing significantly to the…Aug 20Aug 20
Continuous PreTraining or FineTuning? Let’s resolve this —In the coliseum of machine learning, two gladiators face off in an epic battle: Continuous Pre-Training and Fine-Tuning. The crowd roars as…Aug 15Aug 15
Equation for AutoRegressive Models : As explained by Claude sprinkled with info by meThis equation is from the paper : Fine-Tuning Language Models from Human Preferences (https://arxiv.org/pdf/1909.08593)Jul 30Jul 30
ULMFiT Paper Notes: Paper that laid foundation of fine tuningUniversal Language Model Fine-tuning for Text Classification — Excellent paper by Jeremy Howard and Sebastian RuderJul 30Jul 30
Macbook can be a great book — Not just for StarbucksCompiling brief steps on maximising your reading potential and retrieving the crux on demand when you tend to revisit what you read. Below…Oct 7, 2022Oct 7, 2022
Excerpts from Naval’s Podcasts -Recently I stumbled upon Naval Ravikant’s podcast with Joe Rogan which gave me some insights into what life processes can be. He has tried…Oct 2, 2022Oct 2, 2022
Context Switching in Processors — Is there a thing called parallelism?Imagine a scenario where you have time travelled and this is the era of 1903. Ford is coming up with its production line to launch the…Sep 24, 2022Sep 24, 2022