Specify which aggregate computations are supported #116

csharrison · 2021-03-08T20:16:39Z

In the meeting on 03-08-2021 we went over some example computations the aggregate API could support (slides) that satisfy differential privacy. These included:

Fixed domain vector aggregation (e.g. 1M aggregation keys)
- Example MPC: Prio, Distributed point functions
Hierarchical domains (possibly with multiple queries), to prune a larger domain smaller in some flexible way. Thresholding can be used to make the computations more efficient, though may not be strictly required by DP.
- Example MPC: hierarchical DPFs
"Sparse vector" techniques to handle truly massive domains (e.g. 2^64 or 2^128 entries from hashing a string), which requires thresholding to preserve DP, but will never report on a key that wasn't present (see this doc)
- Example MPC: something like what we documented in private_histograms_mpc.md, although more work is needed to evaluate these techniques.

These techniques have different pros and cons (and these techniques are obviously not exhaustive). I'm filing this issue to solicit more feedback. Some evaluation criteria:

Developer ergonomics (especially with regard to figuring out a dense encoding of aggregation keys)
Utility of output (e.g. bias introduced by thresholding)
MPC simplicity
MPC security guarantees (zero-knowledge, etc.)
MPC computation / communication costs
Privacy of output (e.g. smaller domain sizes can encode less information about users)

csharrison · 2021-04-05T03:17:27Z

Update here: Google has open sourced a C++ implementation of the distributed point functions functionality. It can be found here:
https://github.com/google/distributed_point_functions

csharrison · 2022-12-13T21:28:59Z

Closing, for now. The choice we made currently is specified in https://github.com/WICG/attribution-reporting-api/blob/main/AGGREGATION_SERVICE_TEE.md#pre-declaring-aggregation-buckets

However, there is an open issue to consider other mechanisms (#583)

csharrison mentioned this issue Mar 25, 2021

Undocumented proposals and statements csharrison/aggregate-reporting-api#20

Closed

csharrison mentioned this issue May 5, 2021

ML challenge inspired from the aggregate reporting API proposal #137

Closed

csharrison mentioned this issue Jun 7, 2021

aggregate reporting helper service threat model #157

Open

csharrison closed this as completed Dec 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specify which aggregate computations are supported #116

Specify which aggregate computations are supported #116

csharrison commented Mar 8, 2021

csharrison commented Apr 5, 2021

csharrison commented Dec 13, 2022

Specify which aggregate computations are supported #116

Specify which aggregate computations are supported #116

Comments

csharrison commented Mar 8, 2021

csharrison commented Apr 5, 2021

csharrison commented Dec 13, 2022