Papers I readed recently about LLM application

Posted Jun 22, 2025

By Informal

1 min read

How much do LLM memorize?

key definition
- unintended memorization: memorize a specific dataset
- generalization (intended memorization): contains about the true data-generation process
- calculation method: by information entropy and mutual information
double desent appear on the changing points from unintended memorization into generalization
GPT-models store 3.6bits data per parameters
value of float32 is 9% higher than float16

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

trade-off between pre-train model size and inference-time(inference length)
performance can ouperform 14x size model
performance is better in easy and medium problem, judge easy, medium or hard question based on the pass rate
two ways to increase inference-compute-time
- best-of-N: sample N outputs parallel and choose the best one based on learned verifier or reward function.
- revise response: revise original response

Prolonged Reinforcement Learning

illusion of thinking

for hanio tasks
- lower performance in simple-level question for reasoning model than general mdoel because it get wrong answer when thinking even already get correct answer [over thinking]
- better performance in medium-level question
- zero-performance in hard question

Gemini 2.5 tech report

This post is licensed under CC BY 4.0 by the author.

Trending Tags