Outline of LLM acceleration
Summary Methods There are two main methods to acclerate LLM and another tricky methods low-rank: reduce dimension of matrix block: compute matrix with block trick: update model structure o...
Summary Methods There are two main methods to acclerate LLM and another tricky methods low-rank: reduce dimension of matrix block: compute matrix with block trick: update model structure o...
Collect, summary and adjust to get the following tutorial from multi-sources in Reference. How to learn Very quickly identify what the foundational knowledge is Build a personal curriculum to...
What I do keywods: Jan-May: work Travel: wuhan, huangshan IELTS Products: Chinese blog collection slack to discord IELTS speaking assistant PopTranslate ...
Product Triton Try to debug the Liger-Kernel but found it works well except bfloat type. Learning news about LLM translation and try to realise the whole one in this kind of project, like cut cros...
Product Full Cross Entropy Realising of fast cross entropy Based on previous knowledge on fast cross-entropy, realizing it by triton doesn’t spend too much. There are only 1e-7 difference between...
Product Implementation of Triton I tried to realise better cross entropy loss by Triton, but failed, still trying to find the reason… PopTranslate Update to 3.2, replace self-hosted translation se...
Product Triton-Puzzle This is a project tearch you how to use Triton, which is a new open-source alternative language for cuda. It’s always hard to understand and use it if you are not familiar wit...
Product AutoBuilder Almost spend half of this week on AutoBuider, finish it and use it to build another project, ID_confuser, which programmed totally by AutoBuilder and spend only ten minutes. Su...
Product AutoBuilder I want to build AutoBuilder which can build simple product totally by AI model, like static websites. Compared copilot or cursor, AutoBuilder and see the result base on the curr...
Product Visa Received a possible visa phone call, her tell me that my visa is not valid. All phenomenon show that it’s a fraud call except the offical-email, I cannot iamge that fraud group can st...