Rising on arXiv - 2025-05-30

2025-06-02 RisingOnArXiv

Ten of the ideas, topics, and commonplaces that have been gaining steam on arXiv during the last few months (explainer)

1. Claude 3.7 Sonnet: The top 3 terms in the list were AI models. I'm keeping the first one as a token (pun not intended) of intellectual honesty but, come on.

Some recent articles:

2. Solar Terrestrial Relations Observatory: Solar physics using data from the STEREO (alas, no longer appropriately named) mission.

Some recent articles:

3. Large Deviation Properties: The Wikipedia page is clearer than I could be, and includes this fantastic quote:

Any large deviation is done in the least unlikely of all the unlikely ways! — Frank den Hollander, Large Deviations, p. 10

Some recent articles:

4. Mu-SHROOM: A shared task for people working in the problem of hallucination detection in generative AI (the general *SHROOM* naming convention is a good pun).

Some recent articles:

5. Open-Source Solver: Probably a random spike in how often the term is used, but it's a good idea to browse those papers. The most powerful trick in computing — in a real sense the whole trick of computing — isn't "get more data" but "translate your problem into something we already have a good compiler or solver for."

Some recent articles:

6. Humanoid Locomotion: The topic gains renewed interest every now and then. There's been a lot of progress! Still, we don't quite have the software or hardware for it yet. Don't let anybody tell you that anything in robotics is easy or "just needs AI"; the moment you leave a controlled laboratory/factory environment you're literally in a world of trouble.

Some recent articles:

7. Kling: Alright, fine, another model, this one for video generation. Glancing at the abstracts, the papers seem to use it as one of various reference models for benchmarks, to test tools, etc.

Some recent articles:

8. GUI Agents: Four terms above we referred to ongoing efforts to try and figure out how to detect when LLMs hallucinate, something they do with alarming regularity. Here's a set of papers on the increasingly relevant (read: proposed as the technology's killer app) task of asking them to do things for you by tapping on apps, filling up forms, etc.

Some recent articles:

9. Textual Semantics: Not unrelated to the issues and use cases above.

Some recent articles:

10. Spatial Reasoning: You know what, I give up. Maybe I'll just modify my tool to filter out the whole LLM field next time.

Some recent articles:

Thank you to arXiv for use of its open access interoperability.