PopTranslate

Introduction PopTranslate is a translation chrome extension, main features: Pop up translated contents immediately after selecting contents without extra click Show english dictionary when sel...

Sep 14, 2025

Better idea between Copilot-typed and CLI-typed assistant

Copilot-typed GitHub Copilot, cursor, GPT-codex, winsurf, trae are representative products of Copilot-typed tools, their function is to help users complete code automatically when necessary. Of cou...

Aug 12, 2025

LLM Post-Training experience

Prompt Prompt is the most direct way to influence response, tips for good prompt: Clear instruction about our demand Provide necessary context, role, tone, format guide LLM output reasoning ...

Jun 23, 2025 LLM

Cross Entropy Loss of Triton

Cross Entropy idea of fast cross entropy Based on previous knowledge on fast cross-entropy, realizing it by triton doesn’t spend too much. There are only 1e-7 difference between Pytorch and my T...

Jan 5, 2025

Outline of LLM acceleration

Summary Methods There are two main methods to acclerate LLM and another tricky methods low-rank: reduce dimension of matrix block: compute matrix with block trick: update model structure o...

Nov 11, 2024 LLM

Last day in netease

Today is the last day for me in netease, I don’t much as much feeling as that in baidu, on the one hand, I spend much more time in baidu, almose five days, one the other hand, that’s my first job, ...

Aug 22, 2025

Gemini-cli

Gemini-cli Gemini-cli is command line tool supported by Gemini-2.5-pro model, it’s a similar product with Claude-code by Anthropic but free, what’s more, Google open source all code of Gemini-cli ...

Jul 7, 2025

Papers I readed recently about LLM application

How much do LLM memorize? key definition unintended memorization: memorize a specific dataset generalization (intended memorization): contains about the true data-generation pr...

Jun 22, 2025 LLM

Difference between LLMs and traditional computer technology

ALL thing are certain in traditional computer technology, some programer say that there are beauty of certainty in traditional computer technology compared with current LLMs. For the influence or ...

May 25, 2025 LLM

GRPO

Main idea Key point it to understand the below pictures Iteration steps for each input, generator G outputs for each output, calculate logits_prob for each token in current, old, referenc...

May 20, 2025 LLM, FT