Extract polarity scores from probabilites

malkolmhorlen · April 16, 2026, 7:20am

Hi,

All (from what I understand) transformer-based sentiment models output class probabilities (e.g., positive, neutral, negative), each in the range [0, 1].

I would like to transform these probabilities into a single polarity score ranging from -1 (negative) to 1 (positive). Is there a standard or recommended way to do this?

Best,
Malkolm

John6666 · April 16, 2026, 12:27pm

It seems that a rather primitive method is still the de facto standard…

The easiest good default is:

map negative, neutral, positive to -1, 0, +1
then take the probability-weighted average
for 3 classes, that simplifies to polarity = P(positive) - P(negative). (Statsmodels)

That is usually the best starting point because it is:

bounded in [-1, 1]
easy to explain
sensitive to uncertainty
naturally pulled toward 0 when the model is unsure or mostly neutral. (Statsmodels)

What models actually output

Transformer sentiment models do not always directly output probabilities. In the standard Hugging Face setup, the model returns logits, which the docs describe as classification scores before SoftMax. The pipeline then turns those into postprocessed scores. (Hugging Face)

So in practice, the usual flow is:

model outputs logits
softmax converts them to class probabilities
you collapse those probabilities into one polarity score. (Hugging Face)

The recommended formula

For a 3-class sentiment model:

P(neg)
P(neu)
P(pos)

use:

polarity = P(pos) - P(neg)

That is the same as taking the expected value of the scale [-1, 0, +1]. Neutral does not disappear conceptually. It just has value 0, so it automatically shrinks the score toward the center. (Statsmodels)

Quick examples

neg=0.80, neu=0.10, pos=0.10 → polarity -0.70
neg=0.20, neu=0.60, pos=0.20 → polarity 0.00
neg=0.05, neu=0.15, pos=0.80 → polarity 0.75

That matches the intuition most people want. More positive mass pushes right. More negative mass pushes left. More neutral mass pulls inward. This follows directly from the ordered-label interpretation. (Statsmodels)

Why this works

This is just an expected-value calculation.

If the classes are ordered, you can assign each class a location on a sentiment line and take the weighted average using the class probabilities. For the common 3-class case, the simplest anchors are -1, 0, +1. (Statsmodels)

That said, there is an important warning: ordered labels do not automatically come with a built-in numeric distance. The statsmodels ordinal-model docs state that ordinal labels are ordered, but the labels have no numeric interpretation besides the ordering. So the move from labels to -1, 0, +1 is a modeling choice. It is usually a sensible one, but it is still a choice. (Statsmodels)

Is this “standard”?

Not in the sense of one official universal rule. There is no general transformer standard that says “all sentiment probabilities must be collapsed this exact way.” Hugging Face documents logits, probabilities, and task setup, but not one canonical polarity-collapse formula. (Hugging Face)

In practice, though, the probability-weighted ordered scale is the cleanest and most defensible approach. For 3-way sentiment, that means P(pos) - P(neg). (Statsmodels)

Binary case

If your model only has negative and positive, then:

P(pos) - P(neg) works
and because the probabilities sum to 1, it is the same as 2 * P(pos) - 1

So the binary case is simpler. Most of the ambiguity comes from handling neutral, or more than three sentiment levels. (Hugging Face)

More than 3 classes

If your classes are ordered, like:

very negative
negative
neutral
positive
very positive

then use the same idea with more anchors, for example:

-1.0
-0.5
0.0
0.5
1.0

and take the weighted average. (Statsmodels)

This is usually the right extension. Just document your anchors, because equal spacing is an assumption, not something the labels guarantee by themselves. (Statsmodels)

The biggest technical caveat: calibration

This part matters more than most people expect.

Even if P(pos) - P(neg) is the right formula, the final scalar is only as trustworthy as the underlying probabilities. Scikit-learn’s calibration guide explains that models can produce poor probability estimates, and that calibration methods are used to improve them. The current sklearn calibration tools support isotonic, sigmoid, and temperature scaling. (scikit-learn)

This means:

a model can choose the right top label
but still be overconfident or underconfident
and then your polarity magnitude can look more precise than it really is. (scikit-learn)

For multi-class sentiment, temperature scaling is especially relevant because sklearn describes it as a natural way to obtain better calibrated multi-class probabilities, and its CalibratedClassifierCV docs explain how it applies softmax(logits / T). (scikit-learn)

What to do in practice

Good default

Use:

polarity = P(pos) - P(neg)

This is the best default for a normal 3-class sentiment model. It is easy to read, easy to explain, and usually matches intuition. (Statsmodels)

Better if the score matters a lot

If you will use the score for:

thresholds
rankings
time-series monitoring
downstream decision-making

then calibrate the probabilities on a held-out validation set first. Scikit-learn’s calibration docs explicitly recommend fitting calibration on data independent of the classifier training data. (scikit-learn)

Best if you truly want a continuous target

If the real task is inherently scalar, Hugging Face supports regression-style sequence-classification settings as well. In other words, if you truly want one continuous polarity output, it can be cleaner to train a regression model instead of forcing a classifier into post-hoc scalar conversion. The model-output docs note that sequence-classification outputs can also be used for regression when num_labels == 1. (Hugging Face)

One alternative people use

Sometimes people use:

(P(pos) - P(neg)) / (P(pos) + P(neg))

This measures sentiment direction conditional on being non-neutral.

It can be useful, but it is not the best default. It tends to exaggerate tiny scraps of positive or negative evidence when most probability mass is neutral. That is why the simple P(pos) - P(neg) score is usually safer and easier to interpret. This is an inference from the formulas and the ordered-scale setup, rather than a separate official rule. (Statsmodels)

Simple code

def polarity_from_probs(p_neg: float, p_neu: float, p_pos: float) -> float:
    return p_pos - p_neg

General ordered-class version:

import numpy as np

def ordered_polarity(probs, anchors):
    probs = np.asarray(probs, dtype=float)
    anchors = np.asarray(anchors, dtype=float)
    return float(np.dot(probs, anchors))

These implementations are just the expected-value idea applied directly to ordered sentiment classes. (Statsmodels)

Bottom line

Use this unless you have a strong reason not to:

polarity = P(positive) - P(negative)

That is the clearest recommended mapping for negative / neutral / positive probabilities into one score in [-1, 1]. It is not a universal official standard, but it is the most natural and defensible default. The two main warnings are:

your labels must really be ordered
your probabilities may need calibration before the magnitude is trustworthy. (Statsmodels)

Best references

Hugging Face model outputs: logits are scores before SoftMax. (Hugging Face)
Hugging Face course and docs for sequence classification mechanics. (Hugging Face)
Statsmodels OrderedModel: labels are ordered, but not inherently numeric. (Statsmodels)
Scikit-learn probability calibration guide. (scikit-learn)
Scikit-learn calibration API and temperature scaling support. (scikit-learn)
Scikit-learn 3-class calibration example. (scikit-learn)

Topic		Replies	Views
Token classification probability and scoring 🤗Transformers	0	782	November 23, 2020
How to get XLM-T classification output from the scores? Models	0	224	January 2, 2023
Calibrate Probabilities for Transformers Classifier 🤗Transformers	1	130	August 24, 2025
Sentiment Analysis Pipeline on single label function_to_apply not working 🤗Transformers	1	1058	March 17, 2022
Sentiment Analysis outputs Beginners	0	440	December 11, 2022