Detecting AI slop on Reddit in three easy steps

2026-06-14

We like reading stories on Reddit, sometimes. Since the advent of large language models, people have posted more and more AI-generated stories. Here is how to identify them.

This is a draft. Come back later for the final version.

Paper review: SSRL - Self-Search Reinforcement Learning

2025-11-04

Using LLMs is often intertwined with search engine calls, so it makes sense to teach LLMs how to make these calls during training. Search engine API can be expensive, so people came up with the idea of simulating search engine responses with a language model. It’s cheaper than the real thing if you are using a small self-hosted model for the purpose.

Stop the genocide

2025-05-19

The year is 2025. Israel, a part of the Western World, is performing genocide on Palestinians, slaughtering and starving them to death by tens of thousands. The Israelis are doing to Palestinians exactly what the Nazis did to the Jews. The oppressed became the oppressors. Never again turned out to be an empty slogan. Apparently it’s OK to Israelis if it’s them who do the mass killings.

Llama 4 smells bad

2025-04-17

Meta has distinguished itself positively by releasing three generations of Llama, a semi-open LLM with weights available if you ask nicely (and provide your full legal name, date of birth, and full organization name with all corporate identifiers). So no, it’s not open source. Anyway, on Saturday (!) May the 5th, Cinco de Mayo, Meta released Llama 4.

SEIZED BY DOGE

2025-04-01

SEIZED BY DOGE

Beware of Musk’s chatbot, Grok

2025-03-31

Elon Musk bought Twitter and used it to help elect Trump. He also controls a popular chatbot, which he might use to exert political influence when he needs to.

They are selling dollars for 69 cents now

2025-03-12

Back in October 2024 we posted about the US presidential election probabilities and if someone took the gamble, it backfired. To make up for it, here’s a new tip.

Paper review: X-Sample Contrastive Loss

2024-11-04

The paper, X-Sample Contrastive Loss: Improving Contrastive Learning with Sample Similarity Graphs, with Yann LeCun as the last author, is about similarity signals between samples, typically images. The point is to learn good representations (embeddings) by contrasting similar and dissimilar images.

They are selling a dollar coin flip for 36 cents

2024-10-22

Today the implied probability of Kamala Harris being elected as the president of the United States reached 36 percent on Polymarket. This is interesting because it’s the result of a concerted effort to manipulate the odds, and in wider context, to buy the election. While it is sad, it also presents an opportunity. Should you attempt to take some of the that money?

Large language models in 2024

2024-10-09

This is a high-level review of what’s been happening with large language models, to which we’ll also refer as the models, or chatbots. We focus on how expanding the context length allowed the models to rely more on data in prompts and less on the knowledge stored in their weights, resulting in fewer hallucinations.

← Older Contents

FastML

Machine learning made easy

Detecting AI slop on Reddit in three easy steps

Paper review: SSRL - Self-Search Reinforcement Learning

Stop the genocide

Llama 4 smells bad

SEIZED BY DOGE

Beware of Musk’s chatbot, Grok

They are selling dollars for 69 cents now

Paper review: X-Sample Contrastive Loss

They are selling a dollar coin flip for 36 cents

Large language models in 2024