Aggregation Provenance
Aggregation is the most subtle feature of ProvSQL’s query rewriter: unlike selection or join, an aggregate function fundamentally combines values rather than just propagating them, so the provenance of an aggregate result cannot be a plain semiring value of the same type as a tuple’s provenance. This chapter explains the data model ProvSQL uses to represent aggregate provenance, the gate types involved, and how the rewriter glues everything together. The final section is a step-by-step guide for adding a new aggregate function in C++.
For the user-facing semantics see Aggregation and Grouping; for the formal semantics over the extended relational algebra see the ICDE 2026 paper [Sen et al., 2026].
The Semimodule Picture
The semiring framework for positive relational algebra (selection,
projection, join, union) annotates tuples: each row of a
K-relation carries a single element of a commutative semiring K.
Aggregation does not fit that picture, because the result of
SUM(salary) is a value, not a tuple, and the question “where
does this value come from?” has to track both the contributing
rows and the payload each row contributed. Amsterdamer, Deutch,
and Tannen ([Amsterdamer et al., 2011]) resolved this
by annotating tuples and values simultaneously using a
K-semimodule.
A K-semimodule is a commutative monoid \((M, +, 0)\) together
with a scalar multiplication \(\cdot : K \times M \to M\) that
distributes over the semiring operations on K and over the monoid
addition on M. The intuition is that elements of M are
“aggregate values” and elements of K are “provenance
annotations”; scaling m by k produces an M-element
“tagged” with the provenance k.
The specific K-semimodule the paper builds for aggregate
provenance is the tensor product \(K \star M\), whose
elements are finite formal sums
\(k_1 \star m_1 + k_2 \star m_2 + \cdots + k_n \star m_n\)
of pairs (“simple tensors”) \(k \star m\), modulo the
equivalences that make \(\star\) bilinear
(\((k \oplus k') \star m = k \star m + k' \star m\), etc.).
Note how this mixes three different operations: \(\oplus\)
is the ⊕ of the semiring K, \(+\) is the monoid addition of
the semimodule \(K \star M\), and \(\star\) is the
tensor product that glues a K-element to an M-element. A simple
tensor \(k \star m\) should be read as “the tuple with
provenance k contributed the value m”. The whole
\(K \star M\) is itself a K-semimodule, with scalar
multiplication \(k' \cdot (k \star m) = (k' \otimes k) \star m\).
Concretely, for a query like
SELECT city, SUM(salary) FROM personnel GROUP BY city;
the provenance of the aggregate value in one group is
where k_i is the semiring annotation of the i-th input tuple
in the group and m_i is its salary payload (lifted into M).
The sum is not collapsed into a single number – it carries,
for every contributing tuple, both its provenance and the value
it contributed, and concrete semirings (Counting, Formula, …)
are free to specialise the aggregation over \(K \star M\)
into a meaningful scalar only at evaluation time.
Strictly speaking, the Amsterdamer et al. construction requires
\((M, +, 0)\) to be a commutative monoid, which rules out
aggregate operators that are non-associative or non-commutative
(ARRAY_AGG with a user-supplied order, for instance).
ProvSQL therefore reuses the semimodule framework – the
tensor product \(K \star M\), the semimod / agg /
value gate types, the circuit shape – but does not bake
the monoid axioms into it. An agg gate is treated as an
abstract formal sum of simple tensors, and it is up to each
concrete accumulator to decide what “combining” those tensors
means. Aggregates whose operator genuinely is a commutative
monoid (SUM, COUNT, MIN, MAX, AND, OR,
…) open the door to optimisations – reordering, partial
folding during traversal, and so on – that accumulators for
order-sensitive or non-associative aggregates cannot use. The
distinction is a property of the accumulator, not of the gate
representation.
ProvSQL realises this construction directly as a circuit built
from three gate types in gate_type:
value– a leaf carrying a constant payload \(m\) (stored as a string in the persistent representation; the C++ accumulator parses it into the appropriate native type).semimod– a binary gate representing one simple tensor \(k \star m\). Its two children are the tuple’s provenance (a sub-circuit evaluating to \(k \in K\)) and avaluegate carrying \(m\).agg– the aggregate root, the formal sum \(\sum_i (k_i \star m_i)\) taken over all tuples in the group. Its annotations (info1/info2fromget_infos) record the PostgreSQL OID of the aggregate function and the OID of its result type, so the evaluator can later instantiate the right C++ accumulator.
Row-level provenance and the δ operator
An agg gate gives the provenance of the aggregate value in
one column of one output row, but the row itself also has a
provenance – “this row exists in the result” – that the
downstream evaluators need. That row-level provenance is the ⊕
of the per-tuple tokens of the group, just like for a plain
GROUP BY without aggregates. However, summing raw tuple
annotations can over-count existence: a row that exists in
multiplicity \(n\) in the input would naively have a
row-level provenance of
\(1_K \oplus 1_K \oplus \cdots \oplus 1_K\) (n times),
whereas what we want is a single “it exists”.
The δ operator (Amsterdamer, Deutch, Tannen
[Amsterdamer et al., 2011]) solves this. A
δ-semiring is a semiring together with a unary operation δ
satisfying \(\delta(\mathbb{0}) = \mathbb{0}\) and
\(\delta(\mathbb{1} \oplus \cdots \oplus \mathbb{1}) = \mathbb{1}\)
regardless of the number of \(\mathbb{1}\) s. Intuitively, δ collapses “any positive number of
witnesses” to a single “exists”. The rewriter emits a delta
gate wrapping the row-level ⊕ whenever the query has aggregation
and no HAVING clause (see
make_provenance_expression). HAVING queries do not
add a δ: a HAVING predicate constrains existence in a way
that depends on the aggregate value, so the row-level token has
to carry more information than a flat “exists”, and the rewriter
instead routes the result through cmp gates that
having_semantics.hpp evaluates ahead of the main
traversal (see Optional Methods).
The agg_token Type
PostgreSQL evaluates aggregates using its own machinery, which
expects them to return ordinary scalar types (an INT, a
FLOAT…). But the rewritten query needs the aggregate
result to carry both the scalar value and the provenance gate
that captures how it was computed – otherwise downstream
references to provenance() would have nothing to return.
ProvSQL solves this by introducing agg_token, a
composite SQL type wrapping a UUID (the root agg gate) and a
string form of the aggregate value. The rewriter wraps every
Aggref in a call to provenance_aggregate whose
return type is agg_token. When the executor processes the
result, every aggregate column carries an agg_token and
both pieces of information flow downstream:
provenanceextracts the UUID for the user.Cast operators on
agg_token(defined inagg_token.c) extract the scalar value when the user treats the result as a number, e.g. for further arithmetic.
A consequence of this design is that arithmetic on aggregate
results loses provenance: once you write
SUM(x) * 2, the multiplication operates on the scalar side of
the agg_token and the result is a plain FLOAT
without any associated gate. ProvSQL emits a warning in this
case (controlled by insert_agg_token_casts in the
rewriter) but the loss is unavoidable: the result is no longer a
direct combination of base tuples and we cannot honestly attach
a circuit to it. Comparisons of aggregate results, on the other
hand, are routed through cmp gates so that HAVING
predicates do preserve provenance – see Semiring Evaluation
for the gory details of how that pre-evaluation works.
What the Rewriter Builds
The generic rewriting pipeline in Query Rewriting Pipeline covers
aggregation at the pipeline level: Step 4 calls
rewrite_agg_distinct to lift COUNT(DISTINCT ...)
into an inner GROUP BY, Step 8 calls
replace_aggregations_by_provenance_aggregate followed
by migrate_aggtoken_quals_to_having and
insert_agg_token_casts, and Step 9 fuses the row-level
tokens with provenance_plus(array_agg(...)) and wraps the
result in provenance_delta. This section only documents the
aggregation-specific structures those passes produce – the
pieces that the generic pipeline description is too terse to spell
out.
The call that make_aggregation_expression synthesises to
replace an Aggref is a FuncExpr for
provsql.provenance_aggregate whose arguments are:
the OID of the aggregate function;
the OID of its result type;
the original
Aggrefitself, so PostgreSQL still computes the scalar value (this is what ends up inside theagg_token);an
ARRAY[...]of per-tupleprovenance_semimod(arg, t)calls – onesemimodgate per row of the input, glueing the row’s provenancetto the row’s contributed valuearg.
That fourth argument is where the semimodule construction of
The Semimodule Picture is actually assembled: every
semimod gate is one simple tensor \(k \star m\), and the
agg gate at the root of the provenance_aggregate call is
their formal sum \(\sum_i (k_i \star m_i)\).
The row-level side of the rewrite is much simpler. It reuses the
ordinary get_provenance_attributes collection, combines the
per-row tokens with provenance_plus(array_agg(...)), and – for
queries with aggregation and no HAVING – wraps the result in
provenance_delta. The row-level token therefore has no
semimodule content: it only records which input rows witness the
output row’s existence.
The end result is that each row of the aggregation output carries:
per-aggregate-column
agg_tokenvalues whose UUID points to anagggate combiningsemimodper-tuple contributions;a row-level
provsqltoken whose root is adeltagate over the ⊕ of the group’s row tokens.
These two pieces are independent: an evaluator that asks for the
provenance of a value in the result reaches an agg gate; an
evaluator that asks for the provenance of the row itself reaches
a delta gate.
Currently Supported Aggregates
The AggregationOperator enum in Aggregation.h
lists the operators currently implemented in C++: COUNT
(normalised to SUM over INT), SUM, MIN, MAX,
AVG, AND, OR, CHOOSE, and ARRAY_AGG. Adding
to this list is the topic of the next section.
Step-by-Step: Adding a New Aggregate
As a running example, assume we want to add bit_and – the
bitwise-AND of all non-null integer values in a group. The
mechanics are independent of which aggregate you are adding; the
only design point is the C++ accumulator that knows how to
combine the values incrementally.
Add an enum value. In
Aggregation.h, extendAggregationOperator:enum class AggregationOperator { COUNT, SUM, MIN, MAX, AVG, AND, OR, CHOOSE, ARRAY_AGG, BIT_AND, // new NONE, };
Map the PostgreSQL function name to the enum. In
Aggregation.cpp, extendgetAggregationOperatorwith a case matching the PostgreSQL function name returned byget_func_name():} else if(func_name == "bit_and") { op = AggregationOperator::BIT_AND;
Each PostgreSQL aggregate function with distinct semantics needs its own enum value and accumulator; aliases with identical semantics (e.g.
stddev/stddev_samp) can share one.Implement the accumulator. Add a templated
Aggregatorsubclass inAggregation.cppnext to the existing ones (SumAgg,MinAgg,AvgAgg, etc.). The class implements four virtual methods:template <typename T> struct BitAndAgg : Aggregator { T acc = ~T{0}; // all-ones identity bool has = false; void add(const AggValue& x) override { if (x.getType() == ValueType::NONE) return; acc &= std::get<T>(x.v); has = true; } AggValue finalize() const override { return has ? AggValue{acc} : AggValue{}; } AggregationOperator op() const override { return AggregationOperator::BIT_AND; } ValueType inputType() const override { return ValueType::INT; } ValueType resultType() const override { return ValueType::INT; } };
Instantiate the accumulator. Extend
makeAggregatorinAggregation.cppwith a case that creates the right template instantiation for each supported input type:case AggregationOperator::BIT_AND: switch (t) { case ValueType::INT: return std::make_unique<BitAndAgg<long>>(); default: throw std::runtime_error("BIT_AND not supported for this type"); }
Add a regression test. Create
test/sql/agg_bit_and.sqland its expected output, following the pattern of the existing aggregation tests. Reference it fromtest/schedule.common(see Testing and Build System for the schedule conventions).Update the user guide. Mention the new aggregate in Aggregation and Grouping, and add it to the list of currently supported operators in the “Currently Supported Aggregates” section above.
Nothing else needs to change: the query rewriter, the
provenance_aggregate SQL function, the agg_token
composite type, and the aggregation_evaluate SQL
dispatcher all operate on OIDs and metadata, so they pick up new
aggregates automatically once steps 1–4 are in place.