Weekly-#12 summary of LLM acceleration

Product LLM acceleration Almost finish it xiaohongshu start the small business Reading lazy-leadership llm-server ShortMax: fill the space between TikTok and film. HARO shut down: ty...

Nov 19, 2024

Outline of LLM acceleration

Summary Methods There are two main methods to acclerate LLM and another tricky methods low-rank: reduce dimension of matrix block: compute matrix with block trick: update model structure o...

Nov 11, 2024 LLM

Weekly-#11 Copilot-type products

Product LocalPictureCompress Spent whole one day to build LocalPictureCompress, really enjoy the monent when I publish it. Try AI code assistants continue: open source product, supports OpenA...

Nov 10, 2024

Weekly-#10 Preparation for next journey

Product New ideas computer use by local models polish anything by local models YouTube Upload five videos this week Reading Google build-AI challenge OpenAI ask me anything anthro...

Nov 4, 2024

Weekly-#9 Startup of YouTube

Product LLM acceleration read one paper Flash-attention: compute attention by blocks YouTube Upload five videos this week and start to try codeforces problems. Codeforces problems always c...

Oct 27, 2024

Notes of flash-attention

Backgroud There are two common kinds of bound which limited the speed of training in deep learning. Memeory-bound: time spent on memeory-access is bottlenecked Computation-bound: time spent o...

Oct 23, 2024

How to learn knowledge in new fields?

Collect, summary and adjust to get the following tutorial from multi-sources in Reference. How to learn Very quickly identify what the foundational knowledge is Build a personal curriculum to...

Oct 23, 2024 General

Weekly-#8 Start Reading

Product LLM acceleration Matrix Multiplcation: Read more LoRA: start reading paper YouTube Upload four videos this week and receive 5 Subscribers Blog Update current blog to jekyll-theme-chi...

Oct 20, 2024

Notes of LoRA

Introduction Inspiration: the change in weights during model adaptation have a low “intrinsic rank” Description: Change small matrices A and B when fine-tune, adding A * B to weight W, which sign...