Rising on arXiv - 2025-05-26

Ten of the ideas, topics, and commonplaces that have been gaining steam on arXiv during the last few months (explainer)

Most (not all!) of them are related to AI; reasonable if unexciting. But each of these lists will filter out previous ones — it's not a leaderboard — so perhaps it'll surface other things as it clears these from the deck (on the other hand, if there's something the LLM industry has been great at has been coming up with new biggest things ever at quite a quick pace).

1. GRPO: A RL policy that, in the DeepSeek style focuses not just on increasing performance but also reducing training costs. Used in DeepSeekMath (see paper).

Some recent articles:

2. Large Reasoning Models: The more LLMs are deployed in knowledge work, and not just for NLP tasks, the clearer it becomes to stakeholders that talking convincingly about something isn't the same as being able to think about it. So it's not surprising that there's a surge in research into figuring out how to make LLMs reason with reasonable reliability.

Some recent articles:

3. DeepSeek-R1: Still cooking!

Some recent articles:

4. Verifiable rewards: The term is picking up - look out for Reinforcement Learning with Verifiable Rewards (RLVR) as the relevant acronym. It's a slightly fancier way of saying "reward functions you can be certain about" (e.g., when you are training a system to produce answers to problems you can verify), which turns out can allow in some cases for cheaper or more robust training. See here for a good explainer of how this fits into the DeepSeek-R1 training process.

Some recent articles:

5. Small Language Models: Sometimes useful for research, sometimes the only thing you can build with the data you have. Sometimes they do work better.

Some recent articles:

6. SemEval-2025 task: From the workshop's page: SemEval is a series of international natural language processing (NLP) research workshops whose mission is to advance the current state of the art in semantic analysis and to help create high-quality annotated datasets in a range of increasingly challenging problems in natural language semantics. Each year's workshop features a collection of shared tasks in which computational semantic analysis systems designed by different teams are presented and compared.

Some recent articles:

7. Intermediate reasoning steps: See Discourse on the Method of Rightly Conducting One's Reason and of Seeking Truth in the Sciences (Descartes, 1637).

Some recent articles:

8. Interpretability: Because we can't always tell what the AI is doing.

Some recent articles:

9. DESI: As usual, a data release from one of the big astrophysics surveys comes with a host of interesting papers based on it. Here it's Data Release 1 of the Dark Energy Spectroscopic Instrument, tasked with getting information regarding minor scientific questions of detail like trying to understand dark energy, which in the appropriate measurement units might be around two-thirds of everything in the universe.

Some recent articles:

10. Vietnamese: There's been a particular growth in LLM work related primarily or indirectly to Vietnamese. I don't really know why — hints welcome — but it's always good news when language technologies expand their cultural scope.

Some recent articles:

Thank you to arXiv for use of its open access interoperability.