romSpline is an easy-to-use Python code for compressing and interpolating 1d data using reduced-order modeling and statistical learning methods.
romSpline uses a greedy algorithm to find a nearly optimal subset of data samples that recovers the original data, to the requested tolerance, with a univariate spline interpolant. The output is the optimal subset of data and the corresponding reduced-order spline interpolant.
romSpline also contains code to estimate the prediction errors at new samples due to interpolation. These methods are largely based on Monte Carlo K-fold cross-validation studies. The mean of the resulting validation errors can be used as a global and useful upper bound on the interpolation errors.
This code should be useful for downsampling or compressing large data arrays to their essential components from which the original information can be constructed and new data predicted through interpolation. The degree of downsampling can be significant (e.g., orders of magnitude) for relatively smooth data. Furthermore, the distribution of the resulting reduced dataset provides information into features and structures of the data that might otherwise not be readily observed.
Future releases of romSpline will provide locally adapted interpolation error estimations based on more refined cross-validation studies. In addition, enhancements to the greedy algorithm will incorporate possible additional information about data quality (such as uncertainties in the data values being compressed and interpolated, if available).
If you use this code for academic work then please cite the following paper describing the method and algorithm:
- C. R. Galley and P. Schmidt, "Fast and efficient evaluation of gravitational waveforms via reduced-order spline interpolation" arXiv:1611.07529
romspline is available through PyPI:
pip install romspline
romspline is available on conda-forge:
conda install -c conda-forge romspline
git clone https://github.com/crgalley/romspline.git
cd romspline
python setup.py install
If you do not have root permissions, replace the last step with
python setup.py install --user
. Instead of using setup.py
manually, you can also replace the last step with pip install .
or
pip install --user .
.
Alternatively, you can download or clone this repository and add the download path to your PYTHONPATH variable.
As another alternative, include the following lines in your Python code:
import sys
sys.path.append(<path to romspline>)
import romspline
All of these can be installed through pip or conda.
romSpline requires NumPy, SciPy, and H5py, which come with most Python distributions. For parallelization, which is useful but not necessary for some of the cross-validation routines, romSpline currently uses the concurrent.futures module. If you are using Python 2 and do not have concurrent.futures installed you may install it using pip:
pip install futures
Future versions of romSpline will not use concurrent.futures.
See the accompanying IPython notebooks (romSpline_example.ipynb and errors_example.ipynb) for simple tutorials on using the code and estimating errors of the reduced-order spline interpolant for predicting new values.
Copyright (C) 2015 Chad Galley (crgalley "at" tapir "dot" caltech "dot" edu). Released under the MIT/X Consortium license. Comments and requests welcome.