We found a match
Your institution may have access to this item. Find your institution then sign in to continue.
- Title
Probability Density Estimation through Nonparametric Adaptive Partitioning and Stitching.
- Authors
Merino, Zach D.; Farmer, Jenny; Jacobs, Donald J.
- Abstract
We present a novel nonparametric adaptive partitioning and stitching (NAPS) algorithm to estimate a probability density function (PDF) of a single variable. Sampled data is partitioned into blocks using a branching tree algorithm that minimizes deviations from a uniform density within blocks of various sample sizes arranged in a staggered format. The block sizes are constructed to balance the load in parallel computing as the PDF for each block is independently estimated using the nonparametric maximum entropy method (NMEM) previously developed for automated high throughput analysis. Once all block PDFs are calculated, they are stitched together to provide a smooth estimate throughout the sample range. Each stitch is an averaging process over weight factors based on the estimated cumulative distribution function (CDF) and a complementary CDF that characterize how data from flanking blocks overlap. Benchmarks on synthetic data show that our PDF estimates are fast and accurate for sample sizes ranging from 2 9 to 2 27 , across a diverse set of distributions that account for single and multi-modal distributions with heavy tails or singularities. We also generate estimates by replacing NMEM with kernel density estimation (KDE) within blocks. Our results indicate that NAPS(NMEM) is the best-performing method overall, while NAPS(KDE) improves estimates near boundaries compared to standard KDE.
- Subjects
CUMULATIVE distribution function; NONPARAMETRIC estimation; MAXIMUM entropy method; PROBABILITY density function; PROBABILITY theory; TREE branches
- Publication
Algorithms, 2023, Vol 16, Issue 7, p310
- ISSN
1999-4893
- Publication type
Article
- DOI
10.3390/a16070310