What the study found
PANDAExpress is a new algorithm that the authors say is both simpler and faster than PANDA. It removes the polylogarithmic factor hidden in PANDA’s runtime while retaining PANDA’s ability to handle conjunctive queries, disjunctive datalog rules, free variables, and arbitrary input degree constraints.
Why the authors say this matters
The authors say this matters because PANDA’s hidden polylogarithmic factor makes it impractical and slower than specialized algorithms. They conclude that PANDAExpress matches the runtimes of more intricate specialized algorithms while preserving PANDA’s broader generality.
What the researchers tested
The paper studies conjunctive queries and disjunctive datalog rules under input degree constraints, which are statistics and integrity constraints used in relational database management systems. The authors first prove a new probabilistic inequality for bounding the output size of disjunctive datalog rules under arbitrary degree constraints, and then derive the PANDAExpress algorithm from that proof.
What worked and what didn't
The new inequality provides the basis for PANDAExpress. The algorithm uses a new partitioning scheme with arbitrary hyperplane cuts, rather than the axis-parallel hyperplanes used in PANDA, and these cuts are dynamically constructed using data-skewness statistics tracked during execution. The abstract reports that this removes the polylogarithmic factor from PANDA’s runtime and that PANDAExpress can also handle ℓp-norm constraints, which generalize degree constraints.
What to keep in mind
The abstract does not describe experimental evaluation, so no empirical performance results are provided in the available summary. It also does not state limitations beyond noting that the work extends the setting from degree constraints to ℓp-norm constraints.
Key points
- PANDAExpress is presented as a simpler and faster version of PANDA.
- The authors say it removes the polylogarithmic runtime factor hidden in PANDA’s O~ notation.
- The algorithm keeps PANDA’s generality for conjunctive queries, disjunctive datalog rules, free variables, and arbitrary degree constraints.
- A new probabilistic inequality for disjunctive datalog rules leads directly to the algorithm.
- PANDAExpress uses dynamically constructed arbitrary hyperplane cuts and can handle ℓp-norm constraints.
Disclosure
- Research title:
- PANDAExpress removes PANDA’s polylogarithmic runtime factor
Get the weekly research newsletter
Stay current with peer-reviewed research without reading academic papers — one filtered digest, every Friday.


