How Reddit's ranking factors decide which mentions get cited in ChatGPT

A buyer asks ChatGPT to recommend an analytics tool. The model surfaces three names. All three come from the same Reddit thread, posted nine months ago, on a subreddit with about forty thousand members. Twenty other threads on roughly the same category question, on bigger subreddits, with more upvotes, get cited zero times.

When this comes up in client conversations, the question is always some version of: why that thread? It is not the biggest. It is not the newest. It is not the most upvoted. Something about it caused the AI model to reach for it instead of the alternatives. Understanding what that something is, in practice, is the difference between a Reddit program that gets cited and one that does not.

The short answer is that there are two layers of filtering involved. Reddit’s own ranking layer decides which threads it surfaces inside its product. The AI model’s citation-selection layer decides which of the surfaced threads to read from when generating an answer. Both layers use signals that overlap but are not identical, and threads that pass both filters are rare. Threads that pass both are the ones that get cited.

Layer one: Reddit’s ranking

Reddit’s ranking algorithm has gone through enough variations over the last decade that a long explanation gets out of date quickly. The current shape, as of 2026, weights a thread’s position on a subreddit’s “hot” feed using a combination of vote score, vote velocity, comment count, comment quality, post age, and a series of less-public quality signals that have been added over the past three years to suppress brigading and astroturfing.

For most B2B research threads, the visible signals that move ranking are:

Upvote score, weighted heavily but with diminishing returns. A thread with one hundred upvotes ranks much higher than a thread with twenty. A thread with one thousand upvotes ranks somewhat higher than one with five hundred. A thread with ten thousand upvotes ranks only slightly higher than one with five thousand. The curve flattens past a few hundred votes, which is why a popular thread in a medium-size subreddit can outrank a viral thread in a huge subreddit.

Comment count, weighted alongside vote score but with its own decay. Comments are read as signal of discussion quality, not just volume. A thread with twelve substantive replies outranks a thread with one hundred one-line replies, even when the latter has more upvotes. The exact weighting on substantive-versus-shallow is one of the less-public parts of the algorithm, but moderators and frequent posters have a reasonable sense of which threads “look real” to the ranking layer.

Post age, which decays the thread’s ranking power over the first seven days and then more slowly afterward. A thread is generally not surfaced beyond its own subreddit’s recent feed after about thirty days, although search and external links can keep it active much longer. For citation purposes, post age matters less than it does for in-product ranking.

Vote velocity, meaning how fast the upvotes arrive in the first few hours after posting. Fast early voting creates the “rising” effect that lands a thread on the front page of its subreddit. Slow steady voting still works for ranking but does not produce the visibility spike that drives the most reach.

The quality-signal layer, which is the part Reddit has been adding to suppress manipulation. Threads with vote patterns that look coordinated get downweighted. Threads with reply patterns that look manufactured get downweighted. Threads with mod-team intervention (sticky, lock, remove) get downweighted automatically if the action is negative.

A thread that scores well on all of those gets surfaced inside Reddit. A thread that does not get surfaced inside Reddit also does not get surfaced inside AI tools, almost always, because the AI tools are reading from the same data Reddit ranks, weighted in part by Reddit’s own ranking signals.

Layer two: AI citation selection

This is where things get more interesting and less public.

When ChatGPT (or Claude or Gemini or Perplexity) generates an answer to a category-research question, the model has access to a much larger pool of training data than what gets surfaced. It also has access to retrieval pipes that can pull in real-time content from licensed sources. Reddit is in those pipes by design, for the OpenAI and Google models at minimum, because of the licensing deals signed in 2024.

But not every Reddit thread the model could cite gets cited. The citation-selection layer applies its own quality filter, and the signals it uses are not the same signals Reddit uses for ranking.

The first AI signal is specificity. A thread that answers a specific buyer question with specific details about specific products gets cited more often than a thread that discusses the category broadly. The AI tools weight specificity heavily because the model is trying to generate a useful answer to a specific question, and the most useful source is one that already answers that question with specifics.

The second AI signal is signs-of-life discussion. A thread with multiple participants who add to each other’s points, who disagree productively, who update their initial takes after reading replies, gets weighted higher than a thread with one post and three nodding replies. The quality filter is reading the discussion shape, not just the vote count. A thread with twelve substantive back-and-forth comments looks more reliable to the citation layer than a thread with two hundred upvotes and no real discussion.

The third AI signal is source-context. The AI tools have learned which subreddits tend to produce reliable category discussion and which tend to produce noise. A thread on a vertically-specific subreddit with a strong moderator community gets weighted higher than a thread on a noisy generalist subreddit, even when the noisy subreddit has more users. This is the part of the citation layer that rewards middle-band subreddit selection over head-subreddit volume.

The fourth AI signal is recency-weighted authority. Citations skew toward threads from the last twelve to eighteen months, with the weighting decaying for older content unless the older content keeps accumulating links or external mentions. A two-year-old thread that is still getting linked to or quoted in other places retains citation weight much longer than a two-year-old thread that has gone quiet.

The fifth signal is anti-manipulation. Threads that pattern-match to undisclosed promotion get weighted down hard. The exact heuristics are not public, but the effect is real: a thread that looks like marketing gets cited less, even when it would otherwise qualify on the other signals. This is why FTC-noncompliant Reddit work tends to fail the AI-citation test even when the underlying account never gets banned.

Why a thread with twelve comments can outrank a thread with two hundred upvotes

Here is the practical pattern. Both filters care about quality more than they care about volume past a certain threshold. A thread with two hundred upvotes and no discussion looks like a meme thread or a popularity spike. A thread with one hundred upvotes and twelve substantive back-and-forth comments looks like a real conversation about a real question.

Reddit’s own ranking treats both layers as input. The AI citation layer treats discussion quality as more important than vote count once both threads clear a credibility threshold.

The implication is that brands trying to optimize for AI citation should not chase upvote counts. The thread you want to be in is the one with twelve substantive comments, not the one with two hundred drive-by upvotes. The thread you want to start is the one that invites discussion, not the one that asks for a poll.

This is also why pure paid-promotion approaches fail. A paid poster’s job is usually to get a high vote count. The vote count, in isolation, does not produce the citation outcome the brand wanted. The thread that gets cited is the one where the discussion was real.

Contribution timing

Once you know how both layers work, the question of when to post becomes operationally important.

For threads you are joining, the right time is when the discussion is active and substantive. A thread that has accumulated three or four thoughtful replies in the first six hours is in the window where adding a useful comment can compound. A thread that has been quiet for a week is mostly done from a citation standpoint, and contributing to it does not move the needle.

For threads you are starting, the right time depends on the subreddit’s posting pattern. Most subreddits have a window of higher-quality engagement during weekday business hours in the dominant time zone of their audience. Posting outside that window can still work, but the thread is more likely to get drive-by upvotes without the substantive discussion that produces citation weight.

The biggest mistake I see is teams posting on a schedule that suits the team rather than a schedule that suits the subreddit. The team posts during Friday afternoon when nobody is paying attention, the thread accumulates a few quiet upvotes, and the team treats it as “completed.” The thread did not get the discussion quality that would have produced the citation outcome. The work was done. The leverage was missed.

What this means for measurement

If you are running a Reddit program and your dashboard tracks upvote totals as the primary metric, you are measuring the wrong thing. The signal that correlates with AI citation is discussion quality, not vote volume. The signal that correlates with Google ranking on the same thread is similar: discussion quality and external link accumulation, not vote volume.

The metric that actually matters is something like “number of threads where the brand is mentioned in a substantive discussion.” That is a hand-counted metric. It does not automate well. It also predicts the outcome the brand cares about more reliably than any of the visible dashboard numbers. Most Reddit-marketing tools that try to automate the measurement end up tracking the easy signal, not the predictive one.

This is the reporting layer that takes me longer to assemble each month than any other piece of the work. It is also the layer that lets clients tell whether the program is moving the needle on AI citations and Google discussion rankings, which is the outcome they hired for.

Where to start

If your program is currently optimizing for upvotes or post volume, the right move is to switch the measurement to discussion-quality threads, then redirect the contribution work to threads that pass both layers of filtering. The ongoing discussion ops and AI citation tracking engagement is built around this kind of measurement as the standing reporting cadence, not a one-off audit.

If you are evaluating the citation surface for the first time and you want to know where your brand currently sits in both layers (Reddit-ranking visibility plus AI-citation surfacing), the AI SEO baseline includes both as part of the diagnostic.

The brands that get cited in ChatGPT, Claude, Gemini, and Perplexity are the ones that figured out the two-layer filter exists. The ones that did not figure it out are the ones still trying to buy upvotes.