filesampler: File Sampler
A collection of utilities for reading subsamples of flat text files by line in a reasonably efficient manner. We do so by sampling as the input file is scanned and randomly choosing whether or not to dump the current line to an external temporary file. This temporary file is then read back into R. For (aggressive) 'downsampling', this is a very effective strategy; for resampling, you are much better off reading the full dataset into memory.
Source: filesampler_0.4-0.tar.gz Archive
Author | Drew Schmidt [aut, cre], Daniel Lemire [ctb] (vectorized line counter) |
---|---|
Maintainer | Drew Schmidt <wrathematics@gmail.com> |
Version | 0.4-0 |
License | BSD 2-clause License + file LICENSE |
Depends | R (>= 3.5.0) |
Imports | utils |
Compiled? | yes |
URL | https://github.com/wrathematics/filesampler |
BugReports | https://github.com/wrathematics/filesampler/issues |